def func_name(args):
"""
Doc string
"""
bodyreturn object
More on Writing Functions
Justin Post
We’ve gone through a lot with python already!
JupyterLab
& Markdown- Basic data types
- Strings, Floats, Ints, Booleans
- Compound data types
- Lists, Tuples, Dictionaries,
Numpy
arrays,pandas
series & data frames
- Lists, Tuples, Dictionaries,
- Writing Functions
- Control flow (if/then/else, Looping)
- Data uses and basic summarizations
Next up we’ll cover a bit more about writing our own functions that will greatly increase their usefulness! After that we’ll talk about how to plot our data using matplotlib
and pandas
.
Then we’ll really be ready to start talking about modeling, data sources, and generally moving towards doing fun things with big data!
Note: These types of webpages are built from Jupyter notebooks (.ipynb
files). You can access your own versions of them by clicking here. It is highly recommended that you go through and run the notebooks yourself, modifying and rerunning things where you’d like!
Writing Functions Recap
- Writing functions is super cool!
- Recall the basic syntax
- We saw that there were many ways to set up your function arguments and to call your function
- Remember that variables defined within the function are not generally available outside of the function
- That is, a new symbol table is used when the function is called
- We can define global variables if we really want to
- We return what we want from the function with
return
- If we don’t return anything then
None
is returned!
- If we don’t return anything then
The topics we’ll cover in this notebook are: - Packing and unpacking with functions + Catching extra arguments given to a function + Passing your arguments to a function from an object - lambda functions - map()
, filter()
, and functools.reduce()
In a later notebook we’ll talk about how to handle errors or exceptions!
Packing and Unpacking
Reminder: We can pack a list by separating variables to create with commas:
first, second, third = ...
Let’s look at an example:
= ["Dog", "Cat", "Horse", "Frog", "Cow", "Buffalo", "Deer", "Fish", "Bird", "Fox", "Racoon"]
animals = animals[:3]
short_animals
= short_animals
first, second, third print(first + " " + second + " " + third)
Dog Cat Horse
We saw that we can pack leftover elements into a list using *variable
:
*other = animals
first, second, third, print(first + " " + second + " " + third)
print(other)
Dog Cat Horse
['Frog', 'Cow', 'Buffalo', 'Deer', 'Fish', 'Bird', 'Fox', 'Racoon']
Unlimited Positional Arguments
- This idea can be used when writing a function!
- In this case we define an argument to our function with
*variable
- This allows us to pass unlimited positional arguments to our function (variadic arguments)
- The inputs are handled as a
tuple
in the function!
Let’s write a silly function to print out all arguments passed via this idea
def basic_print(*args):
print(type(args))
print(args)
return None
We can pass this function as many things as we’d like and it will be accessible within the function body as a tuple. We can see this as the printed values are surrounded by (
and )
, which implies we are printing a tuple!
"hi", ["a list", "how fun"], 3, 10) basic_print(
<class 'tuple'>
('hi', ['a list', 'how fun'], 3, 10)
As tuples are iterable, we can iterate across these elements via a loop!
def basic_print_elements(*args):
for i in args:
print(type(i),i)
return None
"hi", ["a list", "how fun"], 3, 10) basic_print_elements(
<class 'str'> hi
<class 'list'> ['a list', 'how fun']
<class 'int'> 3
<class 'int'> 10
Let’s define a function that takes in as many 1D numpy
arrays or pandas
Series
the user would like and returns the means for each input.
We’ll also take an argument for the number of decimal places to return for the means.
def find_means(*args, decimals = 4):
"""
Assume that args will be a bunch of numpy arrays (1D) or pandas series
Return the mean of each, rounded to `decimals` places
"""
= []
means for x in args: #iterate over the tuple values
round(decimals))
means.append(np.mean(x).return means
- Create some data with
numpy
to send to this
import numpy as np
from numpy.random import default_rng
= default_rng(3) #set seed to 3
rng
#generate a few means from standard normal data
= rng.standard_normal(5) #sample size of 5
n5 = rng.standard_normal(25) #sample size of 25
n25 = rng.standard_normal(100) #sample size of 100
n100 = rng.standard_normal(1000) #sample size of 1000 n1000
Let’s pass these to our function!
= 2) find_means(n5, n25, n100, n1000, decimals
[-0.22, 0.11, -0.01, 0.04]
Awesome! This gives us a lot more functionality with our function writing.
Unlimited Keyword Arguments
- You can also pass unlimited keyword arguments if you define the arg with a
**
- Handled as a dictionary in the function
Let’s write a basic function to print out the keywords with their values.
def print_key_value_pairs(**kwargs):
"""
key word args can be anything
"""
print(type(kwargs), kwargs)
for x in kwargs:
print(x + " : " + str(kwargs.get(x))) #cast the value to a string for printing
Now we pass as many named arguments as we’d like!
print_key_value_pairs(= "Justin",
name = "Professor",
job = 9195150637) phone
<class 'dict'> {'name': 'Justin', 'job': 'Professor', 'phone': 9195150637}
name : Justin
job : Professor
phone : 9195150637
Unpacking Arguments
- Suppose we want to call our function but our arguments are stored in a list or tuple
- We’ll do this a bit when we do our machine learning models!
- We can unpack this list or tuple to be our function arguments by calling our function in a particular way.
#We want to call our find_means function with these arguments
= [n5, n25, n100, n1000] call_args
- Call the function using
*call_args
(unpacking)
*call_args, decimals = 3) find_means(
[-0.223, 0.114, -0.014, 0.04]
Nice! Now we can more easily call our function too!
- We can do the same thing with our keyword arguments.
- Suppose our keyword arguments are stored in a dictionary
- Can call the function using
**kw_call_args
(unpacking)
Define a quick function.
def print_items(name, job, number):
print("Name is: ", name)
print("Job is: ", job)
print("Phone number is: ", number)
return
Create a dictionary with key-value pairs corresponding to our inputs.
= {"name": "Justin Post", "job": "Professor", "number": "9195150637"}
kw_call_args kw_call_args
{'name': 'Justin Post', 'job': 'Professor', 'number': '9195150637'}
Call our function using **
with our dictionary!
**kw_call_args) print_items(
Name is: Justin Post
Job is: Professor
Phone number is: 9195150637
- Passing named and unnamed arguments can both be done at once!
- Recall our
find_means
function inputs:def find_means(*args, decimals = 4):
= {"decimals": 6}
dec_dictionary *call_args, **dec_dictionary) find_means(
[-0.223413, 0.114454, -0.014443, 0.039762]
Lambda Functions
Another thing that comes up a lot is that we need to create a quick function for a single purpose that we don’t want to reuse for later.
Rather than define a function and storing it as an object the way we’ve been doing it, we can create a lambda function (also sometimes called an in-line function or an anonymous function)
- Use keyword
lambda
- Define arguments followed by a
:
- Give the action for the function to perform
- Syntax requires a single line. Cannot use
return
or some other keywords
- Syntax requires a single line. Cannot use
= lambda x : x**2
square_it 10) square_it(
100
= lambda x, y : x**2 + y
square_then_add 10, 5) square_then_add(
105
- Can still define the arguments in many ways
= lambda x, y = "ho": print(x, y)
my_print "hi") my_print(
hi ho
- Can create the arbitrary positional arguments too (but then why would we be using lambda function!? This is just to show the functionality.)
= lambda *x: [print("Input: " + str(z)) for z in x]
my_print "hi", "ho", "off", "to", "work", "we", "go") my_print(
Input: hi
Input: ho
Input: off
Input: to
Input: work
Input: we
Input: go
[None, None, None, None, None, None, None]
Now, saving the function function in an object is really kind of counter to the point of an anonymous (lambda) function. We don’t usually save these for later use! We’ll see many uses for lambda functions. Let’s cover one of those here.
map()
Using lambda functions comes up a lot in the map/reduce
idea. This is important for some of the legacy big data operations that we’ll do!
Map/reduce idea: - Apply (or map) a function to each element of an iterable object - Combine (or reduce) the results where able
Example: Counting words - Want to take a list of words and create a tuple with the word and the value 1 - Syntax for map
: + map(function, object_to_apply_function_to)
= map(
res lambda word: (word, 1),
"these", "are", "my", "words", "these", "words", "rule"]
[ )
Similar to other functions like range
or zip
, we don’t get back the actual object we think we would. Instead we get a map
object that can be used to find the mapped values.
print(type(res))
res
<class 'map'>
<map at 0x7f615a9b1990>
We can convert the map
object to a list using list()
list(res)
[('these', 1),
('are', 1),
('my', 1),
('words', 1),
('these', 1),
('words', 1),
('rule', 1)]
Let’s return the square of some values without defining a square function via map()
map(lambda r: r **2, range(0,5))
<map at 0x7f615a9b39d0>
list(map(lambda r: r **2, range(0,5)))
[0, 1, 4, 9, 16]
Note: this can equivalently be done using a list comprehension!
** 2 for r in range(0,5)] [r
[0, 1, 4, 9, 16]
Another example of using map with a lambda function might be to quickly uppercase a list of strings.
list(map(lambda x: x.upper(), ['cat', 'dog', 'wolf', 'bear', 'parrot']))
['CAT', 'DOG', 'WOLF', 'BEAR', 'PARROT']
Again, this could be done with a list comprehension!
for x in ['cat', 'dog', 'wolf', 'bear', 'parrot']] #equivalent [x.upper()
['CAT', 'DOG', 'WOLF', 'BEAR', 'PARROT']
One interesting use of a lambda function is through the creation of a function generator
- Create a function that generates functions!
- Here a function to raise a number to a given power
def raise_power(k):
return lambda r: r ** k
= raise_power(2) #creates a function!
square 10) square(
100
= raise_power(3)
cube 5) cube(
125
We can put this together with our packing idea and map!
= map(raise_power, range(1,4))
ident, square, cube 4) ident(
4
4) square(
16
4) cube(
64
filter()
- Lambda functions can be used with
filter()
filter()
takes a predicate (statement to return what you want) as the first arg and an iterable as the second- We can give the first argument as a lambda function
Here we want to return only vowels from a string (an iterable).
filter(lambda x: x in "aeiou", "We want to return just the vowels.")
<filter at 0x7f615a9b3f40>
#return in list form!
list(filter(lambda x: x in "aeiou", "We want to return just the vowels."))
['e', 'a', 'o', 'e', 'u', 'u', 'e', 'o', 'e']
Equivalent to doing a list comprehension with an if
in there!
for x in "We want to return just the vowels." if x in "aeiou"] #equivalent [x
['e', 'a', 'o', 'e', 'u', 'u', 'e', 'o', 'e']
This time let’s use filter to only return even numbers from an iterable object.
- Recall the mod operator
%
, which returns the remainder - A number is even if, when we divide by 2, we get 0 as the remainder
list(filter(lambda x: (x % 2) != 0, range(0, 10)))
[1, 3, 5, 7, 9]
Equivalent to this list comprehension with an if
:
for x in range(0, 10) if (x % 2) != 0] [x
[1, 3, 5, 7, 9]
functools.reduce()
Lambda functions can be used with functools.reduce()
reduce()
takes in a function of two variables and an iterable- It applies the function repetitively over the iterable, and returns the result
Here, we’ll find the cumulative sum of a bunch of numbers (given as an iterable)
from functools import reduce
reduce(lambda x, y: x + y, range(1,11)) # sum first 10 numbers
55
Here, reduce()
works like this: - Takes the first two arguments of the iterable (1 and 2) and adds them - Takes the result of that (3) and adds it to the next argument of the iterable (3) - Repeats until the iterable is exhausted
Again, we could do this kind of thing with a list comprehension. Here we just use the sum function on the result.
sum([x for x in range(1,11)])
55
We can also provide an initial value to reduce()
to start the computation at. Here we supply 45.
#add an initial value to the computation
reduce(lambda x, y: x + y, range(1,11), 45) # sum first 10 numbers + 45
100
Ok, that’s a bit silly. We can do more interesting things with this. For instance, here we write a reduce function to find the largest value in a list.
#create a list of numbers to find the max of
= [53, 13, 103, 2, 15, -10, 201, 6]
my_list reduce(lambda x, y: x if x > y else y, my_list)
201
How does that work? - Take x (53) and y (13), if x > y take x (53), otherwise take y (13) - With the result of that as x, take y as the next value in the iterable (103) - Repeat that step. Here it would keep 103 since it is larger - Keep going until the iterable is exhausted
This works with a starting value as well!
reduce(lambda x, y: x if x > y else y, my_list, 500)
500
Quick Video
This video shows an example of writing more involved functions including the use of lambda
and map()
. Remember to pop the video out into the full player.
The notebook written in the video is available here.
from IPython.display import IFrame
="https://ncsu.hosted.panopto.com/Panopto/Pages/Embed.aspx?id=55af749d-fd9d-4b0d-b2b4-b10301614c9c&autoplay=false&offerviewer=true&showtitle=true&showbrand=true&captions=false&interactivity=all", height="405", width="720") IFrame(src
Recap
Unlimited positional and keyword arguments can be handled in your function
- position as a tuple
- keyword as a dictionary
You can also pass your arguments to a function from an object!
lambda functions are useful for writing short functions
map()
,filter()
, andfunctools.reduce()
are common places we use lambda functions
If you are on the course website, use the table of contents on the left or the arrows at the bottom of this page to navigate to the next learning material!
If you are on Google Colab, head back to our course website for our next lesson!