Published

2025-03-31

Open In Colab

More on Writing Functions

Justin Post

We’ve gone through a lot with python already!

  • JupyterLab & Markdown
  • Basic data types
    • Strings, Floats, Ints, Booleans
  • Compound data types
    • Lists, Tuples, Dictionaries, Numpy arrays, pandas series & data frames
  • Writing Functions
  • Control flow (if/then/else, Looping)
  • Data uses and basic summarizations

Next up we’ll cover a bit more about writing our own functions that will greatly increase their usefulness! After that we’ll talk about how to plot our data using matplotlib and pandas.

Then we’ll really be ready to start talking about modeling, data sources, and generally moving towards doing fun things with big data!

Note: These types of webpages are built from Jupyter notebooks (.ipynb files). You can access your own versions of them by clicking here. It is highly recommended that you go through and run the notebooks yourself, modifying and rerunning things where you’d like!


Writing Functions Recap

  • Writing functions is super cool!
  • Recall the basic syntax
def func_name(args):
    """
    Doc string
    """
    body
    return object
  • We saw that there were many ways to set up your function arguments and to call your function
  • Remember that variables defined within the function are not generally available outside of the function
    • That is, a new symbol table is used when the function is called
    • We can define global variables if we really want to
  • We return what we want from the function with return
    • If we don’t return anything then None is returned!

The topics we’ll cover in this notebook are: - Packing and unpacking with functions + Catching extra arguments given to a function + Passing your arguments to a function from an object - lambda functions - map(), filter(), and functools.reduce()

In a later notebook we’ll talk about how to handle errors or exceptions!


Packing and Unpacking

Reminder: We can pack a list by separating variables to create with commas:

first, second, third = ...

Let’s look at an example:

animals = ["Dog", "Cat", "Horse", "Frog", "Cow", "Buffalo", "Deer", "Fish", "Bird", "Fox", "Racoon"]
short_animals = animals[:3]

first, second, third = short_animals
print(first + " " + second + " " + third)
Dog Cat Horse

We saw that we can pack leftover elements into a list using *variable:

first, second, third, *other = animals
print(first + " " + second + " " + third)
print(other)
Dog Cat Horse
['Frog', 'Cow', 'Buffalo', 'Deer', 'Fish', 'Bird', 'Fox', 'Racoon']

Unlimited Positional Arguments

  • This idea can be used when writing a function!
  • In this case we define an argument to our function with *variable
  • This allows us to pass unlimited positional arguments to our function (variadic arguments)
  • The inputs are handled as a tuple in the function!

Let’s write a silly function to print out all arguments passed via this idea

def basic_print(*args):
  print(type(args))
  print(args)
  return None

We can pass this function as many things as we’d like and it will be accessible within the function body as a tuple. We can see this as the printed values are surrounded by ( and ), which implies we are printing a tuple!

basic_print("hi", ["a list", "how fun"], 3, 10)
<class 'tuple'>
('hi', ['a list', 'how fun'], 3, 10)

As tuples are iterable, we can iterate across these elements via a loop!

def basic_print_elements(*args):
  for i in args:
    print(type(i),i)
  return None
basic_print_elements("hi", ["a list", "how fun"], 3, 10)
<class 'str'> hi
<class 'list'> ['a list', 'how fun']
<class 'int'> 3
<class 'int'> 10

Let’s define a function that takes in as many 1D numpy arrays or pandas Series the user would like and returns the means for each input.

We’ll also take an argument for the number of decimal places to return for the means.

def find_means(*args, decimals = 4):
    """
    Assume that args will be a bunch of numpy arrays (1D) or pandas series
    Return the mean of each, rounded to `decimals` places
    """
    means = []
    for x in args: #iterate over the tuple values
        means.append(np.mean(x).round(decimals))
    return means
  • Create some data with numpy to send to this
import numpy as np
from numpy.random import default_rng
rng = default_rng(3) #set seed to 3

#generate a few means from standard normal data
n5 = rng.standard_normal(5)       #sample size of 5
n25 = rng.standard_normal(25)     #sample size of 25
n100 = rng.standard_normal(100)   #sample size of 100
n1000 = rng.standard_normal(1000) #sample size of 1000

Let’s pass these to our function!

find_means(n5, n25, n100, n1000, decimals = 2)
[-0.22, 0.11, -0.01, 0.04]

Awesome! This gives us a lot more functionality with our function writing.


Unlimited Keyword Arguments

  • You can also pass unlimited keyword arguments if you define the arg with a **
  • Handled as a dictionary in the function

Let’s write a basic function to print out the keywords with their values.

def print_key_value_pairs(**kwargs):
    """
    key word args can be anything
    """
    print(type(kwargs), kwargs)
    for x in kwargs:
        print(x + " : " + str(kwargs.get(x))) #cast the value to a string for printing

Now we pass as many named arguments as we’d like!

print_key_value_pairs(
  name = "Justin",
  job = "Professor",
  phone = 9195150637)
<class 'dict'> {'name': 'Justin', 'job': 'Professor', 'phone': 9195150637}
name : Justin
job : Professor
phone : 9195150637

Unpacking Arguments

  • Suppose we want to call our function but our arguments are stored in a list or tuple
    • We’ll do this a bit when we do our machine learning models!
  • We can unpack this list or tuple to be our function arguments by calling our function in a particular way.
#We want to call our find_means function with these arguments
call_args = [n5, n25, n100, n1000]
  • Call the function using *call_args (unpacking)
find_means(*call_args, decimals = 3)
[-0.223, 0.114, -0.014, 0.04]

Nice! Now we can more easily call our function too!

  • We can do the same thing with our keyword arguments.
  • Suppose our keyword arguments are stored in a dictionary
  • Can call the function using **kw_call_args (unpacking)

Define a quick function.

def print_items(name, job, number):
  print("Name is: ", name)
  print("Job is: ", job)
  print("Phone number is: ", number)
  return

Create a dictionary with key-value pairs corresponding to our inputs.

kw_call_args = {"name": "Justin Post", "job": "Professor", "number": "9195150637"}
kw_call_args
{'name': 'Justin Post', 'job': 'Professor', 'number': '9195150637'}

Call our function using ** with our dictionary!

print_items(**kw_call_args)
Name is:  Justin Post
Job is:  Professor
Phone number is:  9195150637
  • Passing named and unnamed arguments can both be done at once!
  • Recall our find_means function inputs: def find_means(*args, decimals = 4):
dec_dictionary = {"decimals": 6}
find_means(*call_args, **dec_dictionary)
[-0.223413, 0.114454, -0.014443, 0.039762]

Lambda Functions

Another thing that comes up a lot is that we need to create a quick function for a single purpose that we don’t want to reuse for later.

Rather than define a function and storing it as an object the way we’ve been doing it, we can create a lambda function (also sometimes called an in-line function or an anonymous function)

  • Use keyword lambda
  • Define arguments followed by a :
  • Give the action for the function to perform
    • Syntax requires a single line. Cannot use return or some other keywords
square_it = lambda x : x**2
square_it(10)
100
square_then_add = lambda x, y : x**2 + y
square_then_add(10, 5)
105
  • Can still define the arguments in many ways
my_print = lambda x, y = "ho": print(x, y)
my_print("hi")
hi ho
  • Can create the arbitrary positional arguments too (but then why would we be using lambda function!? This is just to show the functionality.)
my_print = lambda *x: [print("Input: " + str(z)) for z in x]
my_print("hi", "ho", "off", "to", "work", "we", "go")
Input: hi
Input: ho
Input: off
Input: to
Input: work
Input: we
Input: go
[None, None, None, None, None, None, None]

Now, saving the function function in an object is really kind of counter to the point of an anonymous (lambda) function. We don’t usually save these for later use! We’ll see many uses for lambda functions. Let’s cover one of those here.

map()

Using lambda functions comes up a lot in the map/reduce idea. This is important for some of the legacy big data operations that we’ll do!

Map/reduce idea: - Apply (or map) a function to each element of an iterable object - Combine (or reduce) the results where able

Example: Counting words - Want to take a list of words and create a tuple with the word and the value 1 - Syntax for map: + map(function, object_to_apply_function_to)

res = map(
    lambda word: (word, 1),
    ["these", "are", "my", "words", "these", "words", "rule"]
    )

Similar to other functions like range or zip, we don’t get back the actual object we think we would. Instead we get a map object that can be used to find the mapped values.

print(type(res))
res
<class 'map'>
<map at 0x7f615a9b1990>

We can convert the map object to a list using list()

list(res)
[('these', 1),
 ('are', 1),
 ('my', 1),
 ('words', 1),
 ('these', 1),
 ('words', 1),
 ('rule', 1)]

Let’s return the square of some values without defining a square function via map()

map(lambda r: r **2, range(0,5))
<map at 0x7f615a9b39d0>
list(map(lambda r: r **2, range(0,5)))
[0, 1, 4, 9, 16]

Note: this can equivalently be done using a list comprehension!

[r ** 2 for r in range(0,5)]
[0, 1, 4, 9, 16]

Another example of using map with a lambda function might be to quickly uppercase a list of strings.

list(map(lambda x: x.upper(), ['cat', 'dog', 'wolf', 'bear', 'parrot']))
['CAT', 'DOG', 'WOLF', 'BEAR', 'PARROT']

Again, this could be done with a list comprehension!

[x.upper() for x in ['cat', 'dog', 'wolf', 'bear', 'parrot']] #equivalent
['CAT', 'DOG', 'WOLF', 'BEAR', 'PARROT']

One interesting use of a lambda function is through the creation of a function generator

  • Create a function that generates functions!
  • Here a function to raise a number to a given power
def raise_power(k):
    return lambda r: r ** k
square = raise_power(2) #creates a function!
square(10)
100
cube = raise_power(3)
cube(5)
125

We can put this together with our packing idea and map!

ident, square, cube = map(raise_power, range(1,4))
ident(4)
4
square(4)
16
cube(4)
64

filter()

  • Lambda functions can be used with filter()
    • filter() takes a predicate (statement to return what you want) as the first arg and an iterable as the second
    • We can give the first argument as a lambda function

Here we want to return only vowels from a string (an iterable).

filter(lambda x: x in "aeiou", "We want to return just the vowels.")
<filter at 0x7f615a9b3f40>
#return in list form!
list(filter(lambda x: x in "aeiou", "We want to return just the vowels."))
['e', 'a', 'o', 'e', 'u', 'u', 'e', 'o', 'e']

Equivalent to doing a list comprehension with an if in there!

[x for x in "We want to return just the vowels." if x in "aeiou"] #equivalent
['e', 'a', 'o', 'e', 'u', 'u', 'e', 'o', 'e']

This time let’s use filter to only return even numbers from an iterable object.

  • Recall the mod operator %, which returns the remainder
  • A number is even if, when we divide by 2, we get 0 as the remainder
list(filter(lambda x: (x % 2) != 0, range(0, 10)))
[1, 3, 5, 7, 9]

Equivalent to this list comprehension with an if:

[x for x in range(0, 10) if (x % 2) != 0]
[1, 3, 5, 7, 9]

functools.reduce()

Lambda functions can be used with functools.reduce()

  • reduce() takes in a function of two variables and an iterable
  • It applies the function repetitively over the iterable, and returns the result

Here, we’ll find the cumulative sum of a bunch of numbers (given as an iterable)

from functools import reduce
reduce(lambda x, y: x + y, range(1,11)) # sum first 10 numbers
55

Here, reduce() works like this: - Takes the first two arguments of the iterable (1 and 2) and adds them - Takes the result of that (3) and adds it to the next argument of the iterable (3) - Repeats until the iterable is exhausted

Again, we could do this kind of thing with a list comprehension. Here we just use the sum function on the result.

sum([x for x in range(1,11)])
55

We can also provide an initial value to reduce() to start the computation at. Here we supply 45.

#add an initial value to the computation
reduce(lambda x, y: x + y, range(1,11), 45) # sum first 10 numbers + 45
100

Ok, that’s a bit silly. We can do more interesting things with this. For instance, here we write a reduce function to find the largest value in a list.

#create a list of numbers to find the max of
my_list = [53, 13, 103, 2, 15, -10, 201, 6]
reduce(lambda x, y: x if x > y else y, my_list)
201

How does that work? - Take x (53) and y (13), if x > y take x (53), otherwise take y (13) - With the result of that as x, take y as the next value in the iterable (103) - Repeat that step. Here it would keep 103 since it is larger - Keep going until the iterable is exhausted

This works with a starting value as well!

reduce(lambda x, y: x if x > y else y, my_list, 500)
500

Quick Video

This video shows an example of writing more involved functions including the use of lambda and map(). Remember to pop the video out into the full player.

The notebook written in the video is available here.

from IPython.display import IFrame
IFrame(src="https://ncsu.hosted.panopto.com/Panopto/Pages/Embed.aspx?id=55af749d-fd9d-4b0d-b2b4-b10301614c9c&autoplay=false&offerviewer=true&showtitle=true&showbrand=true&captions=false&interactivity=all", height="405", width="720")

Recap

  • Unlimited positional and keyword arguments can be handled in your function

    • position as a tuple
    • keyword as a dictionary
  • You can also pass your arguments to a function from an object!

  • lambda functions are useful for writing short functions

  • map(), filter(), and functools.reduce() are common places we use lambda functions

If you are on the course website, use the table of contents on the left or the arrows at the bottom of this page to navigate to the next learning material!

If you are on Google Colab, head back to our course website for our next lesson!