Difference between revisions of "OPS435 Python Lab 4"

From CDOT Wiki
Jump to: navigation, search
(Investigation 2 - Part 1 - String Basics)
(Create a Python Script Demonstrating Comparing Lists)
(317 intermediate revisions by 6 users not shown)
Line 1: Line 1:
 
= OBJECTIVES =
 
= OBJECTIVES =
:The first investigation of this lab we will be working with different data structures. These are stored in a similar way to variables and lists, however they can contain a lot more information and are designed for specific purposes. Each structure has its own advantages and disadvantages, this lab will emphasize where those important differences lay. The second investigation will focus closely on strings. We have been using and storing strings since our first class, however in this lab we will dive into the more complex nature of string manipulation. Finally, this lab will cover how to use a variety of different regular expression functions, for searching and input validation.
 
  
=== Python Reference ===
+
:The '''first investigation''' in this lab will focus on '''Data Structures'''. Each data structure has its own advantages and limitations. This lab will emphasize the most important differences between them.
 +
 
 +
:The '''second investigation''' will focus on strings. You have been using and storing strings since our first class, however in this lab we will dive into the more complex nature of string manipulation. Finally, this lab will cover how to use a variety of regular expression functions for searching and input validation.
 +
 
 +
=== PYTHON REFERENCE ===
 +
 
 +
:As you develop your Python scripting skills, you may start to be "overwhelmed" with the volume of information that you have absorbed over these labs. One way to help is to learn to use online references effectively in order to obtain information regarding Python scripting techniques and tools.
 +
 
 +
:Below is a table with links to useful online Python reference sites (by category). You may find these references useful when performing assignments, etc.
 +
 
 +
{| class="wikitable" | style="margin-left:20px; border: 2px solid black;"
 +
|- style="border: 2px solid black;font-weight:bold;text-align:center;"
 +
| style="border: 2px solid black;" | Data Structures
 +
| style="border: 2px solid black;" | Lists & List Comprehension
 +
| style="border: 2px solid black;" | Strings
 +
| style="border: 2px solid black;" | Regular Expressions
 +
| style="border: 2px solid black;" | Miscellaneous
 +
 
 +
|-  style="background-color:white;border:none;"
 +
| style="border: 2px solid black;" valign="top"| 
 +
:*[https://docs.python.org/3/tutorial/datastructures.html#tuples-and-sequences Tuples]
 +
:*[https://docs.python.org/3/tutorial/datastructures.html#sets Sets]
 +
| style="border: 2px solid black;" | 
 +
:*[https://docs.python.org/3/tutorial/introduction.html#lists Lists]
 +
:*[https://docs.python.org/3/tutorial/datastructures.html#more-on-lists More on Lists]
 +
:*[https://docs.python.org/3/tutorial/datastructures.html#list-comprehensions List Comprehensions]
 +
| style="border: 2px solid black;" valign="top"| 
 +
:*[https://docs.python.org/3/tutorial/introduction.html#strings Strings]
 +
:*[https://docs.python.org/3/library/string.html String Comparisons]
 +
| style="border: 2px solid black;" valign="top"| 
 +
:*[https://docs.python.org/3/library/re.html Regular Expression Operations]
 +
:*[https://docs.python.org/3/howto/regex.html Regular Expressions (HOWTO)]
 +
| style="border: 2px solid black;" valign="top"| 
 +
:*[https://docs.python.org/3/tutorial/datastructures.html#dictionaries Dictionaries]
 +
 
 +
|}
  
 
= INVESTIGATION 1: DATA STRUCTURES =
 
= INVESTIGATION 1: DATA STRUCTURES =
== PART 1 - Tuple ==
 
A Python Tuple is a number of immutable Python values. This is similar to a list in a lot of ways, except that, you cannot change the values inside.
 
<pre>
 
ipython
 
t1 = ('Prime', 'Ix', 'Secundus', 'Caladan')
 
t2 = (1, 2, 3, 4, 5, 6)
 
</pre>
 
  
Values from a tuple can be retreived in the same way as a list.
+
:In this investigation, you will learn several data structures commonly used in Python scripting. These tools include '''lists''', '''tuples''', '''sets''', and '''dictionaries'''.
<pre>
+
t1[0]
+
== PART 1 - Tuples ==
t2[2:4]
 
</pre>
 
  
Or check if a value exists inside a tuple.
+
:Many often confuse a '''tuple''' with a '''list''' (which you learned about in a previous lab). A '''tuple''' is a type of list whose values cannot be changed. In fact, nothing in a tuple can be changed after it's created (like adding, removing list elements).
<pre>
 
'Ix' in t1
 
'Geidi' in t1
 
</pre>
 
  
Try changing a tuple value.
+
:There are many advantages to using tuples when creating Python scripts:
<pre>
 
t2[1] = 10
 
</pre>
 
  
Did it work? Once created the tuple values will not be able to change. If you would like a tuple with different values than the tuple you currently have, you must create a new one.
+
::*'''Data protection''' (eg. values are are NOT allowed to change so you won't modify them accidentally)
<pre>
+
::*Tuples can be used as '''keys in data dictionaries''' (which are NOT allowed to change)
t3 = t2[2:3]
+
::*Tuples allow for '''faster access''' than lists
</pre>
 
  
You however can still use most of the basic operations you might expect from tuples.
+
:The term to indicate that a data structure cannot be changed is called '''immutable''' (as opposed to ''"mutable"'' which means the data structure can be changed).
<pre>
 
len(t1)    # list the length of the tuple
 
t1 * 3      # repitition
 
t1 + t2    # concatenation, remember this is creating a new tuple, not modifying
 
</pre>
 
  
Like lists, you can also loop through the values of tuples.
+
:'''Perform the Following Steps:'''
<pre>
+
:#Let's create two tuples in a temporary Python file, so we can learn how to use them and learn how they differ from lists.<br><br>Note: '''tuples are defined by using parenthesis ( )''' as opposed to '''lists which are defined by using square brackets [ ]'''<source lang="python">
 +
t1 = ('Prime', 'Ix', 'Secundus', 'Caladan')
 +
t2 = (1, 2, 3, 4, 5, 6)
 +
</source>
 +
:#Values from a tuple can be retrieved in the same way as a list. For example:<source lang="python">
 +
print(t1[0])
 +
print(t2[2:4])
 +
</source>
 +
:#You can also check to see whether a value exists inside a tuple or not. To demonstrate try:<source lang="python">
 +
print('Ix' in t1)
 +
print('Geidi' in t1)
 +
</source>Let's now see how a tuple differs from a list. We will now create a list and note the difference between them:<source lang="python">
 +
list2 = [ 'uli101', 'ops235', 'ops335', 'ops435', 'ops535', 'ops635' ]
 +
</source>
 +
:#See if you can change the value of your list:<source lang="python">
 +
list2[0]= 'ica100'
 +
print(list2[0])
 +
print(list2)
 +
</source>You should have been successful in changing the value of your list.<br><br>
 +
:#Now, try changing the value of your previously-created tuple:<source lang="python">
 +
t2[1] = 10
 +
</source>Did it work? Once created the tuple values will not be able to change.<br><br>If you would like a tuple with different values than the tuple you currently have, then you must create a new one.<br><br>
 +
:#The following creates a new tuple (t3) with a contents from a slice of the t2 tuple. Slicing works the same way for tuples as for lists:<source lang="python">
 +
t3 = t2[2:3]
 +
</source>
 +
:#Also, as with lists, you can use for loops to iterate the values of tuples:<source lang="python">
 
for item in t1:
 
for item in t1:
 
     print('item: ' + item)
 
     print('item: ' + item)
</pre>
+
</source>
  
== PART 2 - Set ==
+
== PART 2 - Sets ==
Sets are another very similar structure to lists, they can also be modified and changed, unlike the tuple. But sets have two unique characteristics, they are unordered, and they cannot have duplicate values. The unordered part provides a added performance from hashing the values, but also means we cannot pull out a specific value at a spefici position. Any duplicate entries will immediately be deleted. Sets however are great tools for doing comparisons, finding differences in multiple sets, or finding similarities. The best part about sets are, they are fast!
 
  
Lets create a couple sets to work with.
+
:So far, you have been exposed to two structures that are used to contain data: ''lists'' and ''tuples''. You can modify the values within a list as well as modify the structure of a list (i.e. add and remove elements), whereby you cannot with a tuple.
<pre>
+
 
 +
:In this section, you will learn about '''sets'''. A set has similar characteristics as a list, but there are two major differing characteristics:
 +
 
 +
::*Sets are '''un-ordered'''
 +
::*Sets '''cannot contain duplicate values'''
 +
 
 +
:Since new duplicate entries will be automatically removed when using sets, they are very useful for performing tasks such as '''comparisons''': '''finding similarities or differences in multiple sets'''.
 +
 
 +
:#Create some sets to work with in a temporary Python file:<source lang="python">
 
s1 = {'Prime', 'Ix', 'Secundus', 'Caladan'}
 
s1 = {'Prime', 'Ix', 'Secundus', 'Caladan'}
 
s2 = {1, 2, 3, 4, 5}
 
s2 = {1, 2, 3, 4, 5}
 
s3 = {4, 5, 6, 7, 8}
 
s3 = {4, 5, 6, 7, 8}
</pre>
+
</source>Note: '''Sets are defined by using braces { }''' as opposed to tuples which use parenthesis ( ), or lists which use square brackets [ ]'''<br /><br />
 +
:#Try to access a set through the index:<source lang="python">
 +
print(s1[0])
 +
</source>This should have caused an '''error'''. You cannot access data inside a set this way because the elements inside are '''unordered'''. Instead, you should use the '''in''' method to check to see whether a value is contained in the set:<source lang="python">
 +
print('Ix' in s1)
 +
print('Geidi' in s1)
 +
</source><br>'''Sets can be combined''', but it is important to note that any '''duplicate values (shared among sets) will be deleted'''.<br><br>
 +
:#Print the contents of the sets and note the values that are common:<source lang="python">
 +
print(s2)
 +
print(s3)
 +
</source>
 +
:#This is how you get a set containing only UNIQUE values (no duplicates) from both sets:<source>
 +
print(s2 | s3)        # returns a set containing all values from both sets
 +
print(s2.union(s3))    # same as s2 | s3
 +
</source>Notice that both methods above have the same result, which one you choose depends purely on your style.<br><br>Instead of combining sets, we can display '''values that are common to both sets'''. This is known in mathematical terms as an '''intersection''' between the lists:<source lang="python">
 +
print(s2 & s3)            # returns a set containing all values that s2 and s3 share
 +
print(s2.intersection(s3)) # same as s2 & s3
 +
</source>
 +
:#Sets can also have their values compared against other sets. First find out what items are in '''s2''' but not in '''s3'''. This is also called a '''difference''':<source lang="python">
 +
print(s2)
 +
print(s3)
 +
print(s2 - s3)            # returns a set containing all values in s2 that are not found in s3
 +
print(s2.difference(s3))  # same as s2 - s3
 +
</source>
 +
:#In order to see <u>every</u> difference between both sets, you need to find the '''symmetric difference'''. This will return a set that shows all numbers that both sets do not share together:<source lang="python">
 +
print(s2 ^ s3)                    # returns a set containing all values that both sets DO NOT share
 +
print(s2.symmetric_difference(s3)) # same as s2 ^ s3
 +
</source>Note: the '''set()''' function can convert lists into sets, and the '''list()''' function can convert sets into lists. The operations in this section can only be applied to sets, so if you need to perform a union, intersection, or difference between lists, you need to convert them to sets first. For example:<source lang="python">
 +
l2 = [1, 2, 3, 4, 5]
 +
l3 = [4, 5, 6, 7, 8]
 +
temporary_set = set(l2).intersection(set(l3))
 +
new_list = list(temporary_set)  # '''set()''' can make lists into sets. '''list()''' can make sets into lists.
 +
print(new_list)
 +
</source>
 +
 
 +
=== Create a Python Script Demonstrating Comparing Sets ===
 +
:'''Perform the Following Instructions'''
 +
:#Create the '''~/ops435/lab4/lab4a.py''' script. The purpose of this script will be to demonstrate the different way of comparing sets. There will be three functions, each returning a different set comparison.
 +
:#Use the following template to get started:<source lang="python">
 +
#!/usr/bin/env python3
 +
 
 +
def join_sets(s1, s2):
 +
    # join_sets will return a set that contains every value from both s1 and s2
 +
 
 +
def match_sets(s1, s2):
 +
    # match_sets will return a set that contains all values found in both s1 and s2
 +
 
 +
def diff_sets(s1, s2):
 +
    # diff_sets will return a set that contains all different values which are not shared between the sets
 +
 
 +
if __name__ == '__main__':
 +
    set1 = set(range(1,10))
 +
    set2 = set(range(5,15))
 +
    print('set1: ', set1)
 +
    print('set2: ', set2)
 +
    print('join: ', join_sets(set1, set2))
 +
    print('match: ', match_sets(set1, set2))
 +
    print('diff: ', diff_sets(set1, set2))
 +
 
 +
</source>
 +
 
 +
::*The join_sets() function should return a set that contains all values from both sets
 +
::*The match_sets() function should return a set that contains all values found in both sets
 +
::*The diff_sets() function should return a set that contains all values which are not shared between both sets
 +
::*All three functions should accept '''two arguments''' both are sets
 +
::*The script should show the exact output as the samples
 +
::*The script should contain no errors
 +
 
 +
:::'''Sample Run 1:'''<source>
 +
./lab4a.py
 +
set1:  {1, 2, 3, 4, 5, 6, 7, 8, 9}
 +
set2:  {5, 6, 7, 8, 9, 10, 11, 12, 13, 14}
 +
join:  {1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14}
 +
match:  {8, 9, 5, 6, 7}
 +
diff:  {1, 2, 3, 4, 10, 11, 12, 13, 14}
 +
</source>
 +
:::'''Sample Run 2 (with import):'''<source>
 +
import lab4a
 +
set1 = {1,2,3,4,5}
 +
set2 = {2,1,0,-1,-2}
 +
print(lab4a.join_sets(set1,set2))
 +
# Will output {-2, -1, 0, 1, 2, 3, 4, 5}
 +
print(lab4a.match_sets(set1,set2))
 +
# Will output {1, 2}
 +
print(lab4a.diff_sets(set1,set2))
 +
# Will output {-2, -1, 0, 3, 4, 5}
 +
</source>
 +
<ol><li value='3' style="margin-left:25px;">Download the checking script and check your work. Enter the following commands from the bash shell:<source lang="bash">
 +
cd ~/ops435/lab4/
 +
pwd #confirm that you are in the right directory
 +
ls CheckLab4.py || wget https://raw.githubusercontent.com/Seneca-CDOT/ops435/master/LabCheckScripts/CheckLab4.py
 +
python3 ./CheckLab4.py -f -v lab4a
 +
</source></li>
 +
<li style="margin-left:25px;">Before proceeding, make certain that you identify all errors in lab4a.py. When the checking script tells you everything is OK - proceed to the next step.</li></ol>
  
First, try accessing a set through the index.
+
=== Create a Python Script Demonstrating Comparing Lists ===
<pre>
+
:'''Perform the Following Instructions'''
s1[0]
+
:#Create the '''~/ops435/lab4/lab4b.py''' script. The purpose of this script will be to improve the previous script to perform the same joins, matches, and diffs, but this time on lists.  
</pre>
+
:#Use the following as a template:<source lang="python">
 +
#!/usr/bin/env python3
  
You should have received a error, this is not how you access data inside a set because they are unordered. Instead you can check to see if a value is inside.
+
def join_lists(l1, l2):
<pre>
+
    # join_lists will return a list that contains every value from both l1 and l2
'Ix' in s1
 
'Geidi' in s1
 
</pre>
 
  
If you would like to combine sets together you can. Any duplicates that the 2 sets share, will be deleted. Take a close look at which items are shared between the sets.
+
def match_lists(l1, l2):
<pre>
+
    # match_lists will return a list that contains all values found in both l1 and l2
s2
 
s3
 
s2 | s3        # returns a set containing all values from both sets
 
s2.union(s3)    # same as s2 | s3
 
</pre>
 
  
Instead of combining sets, we can find out what values are in both sets. This is a intersection between the lists.
+
def diff_lists(l1, l2):
<pre>
+
    # diff_lists will return a list that contains all different values, which are not shared between the lists
s2
 
s3
 
s2 & s3            # returns a set containing all values that s2 and s3 share
 
s2.intersection(s3) # same as s2 & s3
 
</pre>
 
  
 +
if __name__ == '__main__':
 +
    list1 = list(range(1,10))
 +
    list2 = list(range(5,15))
 +
    print('list1: ', list1)
 +
    print('list2: ', list2)
 +
    print('join: ', join_lists(list1, list2))
 +
    print('match: ', match_lists(list1, list2))
 +
    print('diff: ', diff_lists(list1, list2))
 +
</source>
 +
::*The match_lists() function should return a list that contains all values found in both lists
 +
::*The diff_lists() function should return a list that contains all values which are not shared between both lists
 +
::*The join_lists() function should return a list that contains all values from both lists
 +
::*All three functions should accept '''two arguments''' both are lists
 +
::*The script should show the exact output as the samples
 +
::*The script should contain no errors
 +
:::'''Sample Run 1:'''<source>
 +
./lab4b.py
 +
list1:  [1, 2, 3, 4, 5, 6, 7, 8, 9]
 +
list2:  [5, 6, 7, 8, 9, 10, 11, 12, 13, 14]
 +
join:  [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14]
 +
match:  [5, 6, 7, 8, 9]
 +
diff:  [1, 2, 3, 4, 10, 11, 12, 13, 14]
 +
</source>
  
Lets look at how we can compare the values inside sets. First lets find out what items are in '''s2''' but not in '''s3'''. This is also called a difference. But notice that it only shows values that '''s2''' contains, specifically values that '''s3''' doesn't have. So this isn't really the true difference between the sets.
+
:::'''Sample Run 2 (with import) under interactive python shell:'''<source>
<pre>
+
import lab4b
s2
+
list1 = [1,2,3,4,5]
s3
+
list2 = [2,1,0,-1,-2]
s2 - s3            # returns a set containing all values in s2 that are not found s3
+
print(lab4b.join_lists(list1,list2)))
s2.difference(s3)  # same as s2 - s3
+
# Will output [0, 1, 2, 3, 4, 5, -2, -1]
</pre>
+
print(lab4b.match_lists(list1,list2))                                                                                                               
 +
# Will output [1, 2]
 +
print(lab4b.diff_lists(list1,list2))                                                                                                                 
 +
# Will output [0, 3, 4, 5, -2, -1]
 +
</source>
 +
::3. Download the checking script and check your work. Enter the following commands from the bash shell.<source lang="bash">
 +
cd ~/ops435/lab4/
 +
pwd #confirm that you are in the right directory
 +
ls CheckLab4.py || wget matrix.senecac.on.ca/~acoatley-willis/CheckLab4.py
 +
python3 ./CheckLab4.py -f -v lab4b
 +
</source>
 +
::4. Before proceeding, make certain that you identify all errors in lab4b.py. When the checking script tells you everything is OK - proceed to the next step.
  
If we want to see every difference between both sets, we can find the symmetric difference. This will return a set that shows all numbers that both sets do not share together.
+
== PART 3 - Dictionaries ==
<pre>
+
 
s2
+
::In Python, a '''dictionary''' is a set of key-value pairs. Dictionaries are '''unordered''', like sets, however any value can be retrieved from a dictionary if you know the key. This section will go over how to create, access, and change dictionaries, giving you a new powerful tool to store and manipulate data.
s3
+
 
s2 ^ s3                    # returns a set containing all values that both sets DO NOT share
+
::'''Perform the Following Steps:'''
s2.symmetric_difference(s3) # same as s2 ^ s3
+
::#Let's begin by creating a new dictionary in a temporary Python file:<source lang="python">
</pre>
+
dict_york = {'Address': '70 The Pond Rd', 'City': 'Toronto', 'Postal Code': 'M3J3M6'}
 +
</source>You should note that the syntax to define a dictionary is similar to defining sets (i.e. using '''{}'''), but unlike sets dictionaries use '''<code>key:value</code>''' pairs within the dictionary, each ''key:value'' pair is separated by commas.
 +
::#All the values in a dictionary can be retrieved by using the '''dictionary.values()''' function. This particular function provides a '''list''' containing all values:<source lang="python">
 +
print(dict_york.values())
 +
</source>All keys to access the ''key:pair'' values within a dictionary can be retrieved using the '''dictionary.keys()''' function:<source lang="python">
 +
dict_york.keys()
 +
</source>We can retrieve <u>individual</u> values from a dictionary by providing the key associated with the value:<source lang="python">
 +
print(dict_york['Address'])
 +
print(dict_york['Postal Code'])
 +
</source>
 +
::#Dictionary keys can be any '''immutable''' values (i.e. not permitted for value to be changed). Types of values include: '''strings''', '''numbers''', and '''tuples'''.
 +
::#Try adding a new key and value to the dictionary:<source lang="python">
 +
dict_york['Country'] = 'Canada'
 +
print(dict_york)
 +
print(dict_york.values())
 +
print(dict_york.keys())
 +
</source>
 +
::#Let's change the province value to BC:<source lang="python">
 +
dict_york['Province'] = 'BC'
 +
print(dict_york)
 +
print(dict_york.values())
 +
print(dict_york.keys())
 +
</source>'''WARNING: Dictionary keys must be unique'''. Attempting to add a key that already exists in the dictionary will <u>overwrite</u> the existing value for that key! For example:<source lang="python">
 +
dict_york['Province'] = 'ON'
 +
print(dict_york)
 +
print(dict_york.values())
 +
print(dict_york.keys())
 +
</source>You should notice that value for the 'Province' key has been changed back to 'ON'.<br><br>The lists that contain the values and keys of the dictionary are not <u>real</u> python lists - they are "views of the dictionary" and therefore are <u>immutable</u>. You could change these views into usable lists by using the '''list()''' function:<source lang="python">
 +
list_of_keys = list(dict_york.keys())
 +
print(list_of_keys[0])
 +
</source>
 +
::#Lists can be used with '''for loops''':<source lang="python">
 +
list_of_keys = list(dict_york.keys())
 +
for key in list_of_keys:
 +
    print(key)
 +
for value in dict_york.values():
 +
    print(value)
 +
</source>
 +
 
 +
=== Create a Python Script for Managing Dictionaries ===
 +
:'''Perform the Following Instructions'''
 +
::#Create the '''~/ops435/lab4/lab4c.py''' script. The purpose of this script will be to create dictionaries, extract data from dictionaries, and to make comparisons between dictionaries.
 +
::#Use the following as a template:<source lang="python">
 +
#!/usr/bin/env python3
 +
 
 +
# Dictionaries
 +
dict_york = {'Address': '70 The Pond Rd', 'City': 'Toronto', 'Country': 'Canada', 'Postal Code': 'M3J3M6', 'Province': 'ON'}
 +
dict_newnham = {'Address': '1750 Finch Ave E', 'City': 'Toronto', 'Country': 'Canada', 'Postal Code': 'M2J2X5', 'Province': 'ON'}
 +
# Lists
 +
list_keys = ['Address', 'City', 'Country', 'Postal Code', 'Province']
 +
list_values = ['70 The Pond Rd', 'Toronto', 'Canada', 'M3J3M6', 'ON']
 +
 
 +
def create_dictionary(keys, values):
 +
    # Place code here - refer to function specifics in section below
 +
 
 +
def shared_values(dict1, dict2):
 +
    # Place code here - refer to function specifics in section below
  
Since these powerful features can be so useful and efficient, you may want to try applying them to lists. There are some added steps required if you want to use these functions on lists. First convert the list to a set, perform the set comparison or function, convert the set back to a list.
 
<pre>
 
l2 = [1, 2, 3, 4, 5]
 
l3 = [4, 5, 6, 7, 8]
 
new_list = list(set(l2).intersection(set(l3)))  # set() can make lists into sets. list() can make sets into lists
 
new_list
 
</pre>
 
  
== PART 3 - Dictionary ==
+
if __name__ == '__main__':
 +
    york = create_dictionary(list_keys, list_values)
 +
    print('York: ', york)
 +
    common = shared_values(dict_york, dict_newnham)
 +
    print('Shared Values', common)
 +
</source>
  
== PART 4 - List Comprehension ==
+
:::*The script should contain '''two''' functions:
 +
:::::'''create_dictionary()'''<ol><li>'''accepts''' two lists as arguments keys and values, '''combines''' these lists together to '''create''' a dictionary<br>('''Tip:''' use a while loop to access elements in both the keys and values lists at the same time)</li><li>'''returns a dictionary''' that has the keys and associated values from the lists</li></ol>
 +
:::::'''shared_values()''' <ol><li>'''accepts''' two dictionaries as arguments and '''finds''' all values that are shared between the two dictionaries<br>('''Tip:''' generate sets containing only values for each dictionary, then use a function mentioned in a previous section to store the values that are common to <u>both</u> lists)</li><li>'''returns a set''' containing '''ONLY values''' found in '''BOTH dictionaries'''</li></ol>
 +
:::*make sure the functions have the correct number of arguments required
 +
:::*The script should show the exact output as the samples
 +
:::*The script should contain no errors
 +
::::'''Sample Run 1:'''<source>
 +
./lab4c.py
 +
York:  {'Country': 'Canada', 'Postal Code': 'M3J3M6', 'Address': '70 The Pond Rd', 'Province': 'ON', 'City': 'Toronto'}
 +
Shared Values {'Canada', 'ON', 'Toronto'}
 +
</source>
 +
::::'''Sample Run 2 (with import):'''<source>
 +
import lab4c
 +
dict_york = {'Address': '70 The Pond Rd', 'City': 'Toronto', 'Country': 'Canada', 'Postal Code': 'M3J3M6', 'Province': 'ON'}
 +
dict_newnham = {'Address': '1750 Finch Ave E', 'City': 'Toronto', 'Country': 'Canada', 'Postal Code': 'M2J2X5', 'Province': 'ON'}
 +
list_keys = ['Address', 'City', 'Country', 'Postal Code', 'Province']
 +
list_values = ['70 The Pond Rd', 'Toronto', 'Canada', 'M3J3M6', 'ON']
  
We've already covered lists to a degree. Lets move into more advanced functions to use and generate lists. This is a very common practice in Python, understanding how to generate, manipulate, and apply functions to items inside a list can be incredibly useful. List comprehension is a way to build new lists from existing list.
+
york = lab4c.create_dictionary(list_keys, list_values)
  
Lets start with creating a list and applying some function to each item in the list. The below will print out the square of each item.
+
print(york)
<pre>
+
# Will print: {'Address': '70 The Pond Rd',
l1 = [1, 2, 3, 4, 5]
+
              'City': 'Toronto',
for item in l1:
+
              'Country': 'Canada',
    print(item ** 2)
+
              'Postal Code': 'M3J3M6',
</pre>
+
              'Province': 'ON'}
  
If we would like to store these squares for later use, we can create a new list and append the squares to it. This will generate a new list that contains squared values in the same positions of the first list. What we are doing is using an existing list to create a new list.
+
common = lab4c.shared_values(dict_york, dict_newnham)
<pre>
 
l1 = [1, 2, 3, 4, 5]
 
l2 = []
 
for item in l1:
 
    l2.append(item ** 2)
 
l1
 
l2
 
</pre>
 
  
Lets take another step here. Lets move the squaring of numbers out into it's own separate function. While the squaring example is a simple function, this example could include a more complex functions that does a lot more processing on each item in the list.
+
print(common)
<pre>
+
# Will print: {'Canada', 'ON', 'Toronto'}
def square(number):
+
</source>
    return number ** 2
+
:::3. Download the checking script and check your work. Enter the following commands from the bash shell.<source lang="bash">
 +
cd ~/ops435/lab4/
 +
pwd #confirm that you are in the right directory
 +
ls CheckLab4.py || wget https://raw.githubusercontent.com/Seneca-CDOT/ops435/master/LabCheckScripts/CheckLab4.py
 +
python3 ./CheckLab4.py -f -v lab4c
 +
</source>
 +
:::4. Before proceeding, make certain that you identify all errors in lab4c.py. When the checking script tells you everything is OK proceed to the next step.
  
l1 = [1, 2, 3, 4, 5]
+
= INVESTIGATION 2: STRINGS =
l2 = []
+
:Strings are basically a list of characters (bits of text). This section will investigate strings in more detail such as '''cutting strings into sub-strings''', '''joining strings''', '''formatting strings''', '''searching through strings''', and '''matching strings against patterns'''.<br><br>Strings are '''immutable''' data objects - this means that once a string is created, it <u>cannot</u> be modified. In order to make a change inside a string, you would first make a copy of the part of the string (i.e. sub-string) for manipulation.
for item in l1:
 
    l2.append(square(item))
 
  
l1
+
== PART 1 - Strings and Substrings ==
l2
+
:This first part will explain basic concepts of using strings, printing strings, and manipulating sub-strings.
</pre>
 
  
 +
:'''Perform the Following Steps:'''
 +
:#Create some strings in a temporary Python file:<source lang="python">
 +
course_name = 'Open System Automation'
 +
course_code = 'OPS435'
 +
course_number = 435
 +
</source>Strings can contain any '''characters''' inside them, whether they are '''letters''', '''numbers''', or '''symbols'''.
 +
:#Strings can also be '''concatenated''' (i.e. "combined together") by using the '''+''' sign, just make sure string are only concatenating strings with strings (no lists, no numbers, no dictionaries, etc.):<source lang="python">
 +
print(course_name)
 +
print(course_code)
 +
print(str(course_number))
 +
print(course_name + ' ' + course_code + ' ' + str(course_number))
 +
</source>When using the '''print()''' function, you can display '''special characters'''. One such special character is the  is the newline character (denoted by the symbol:  '''\n'''). This allows you to separate content between new lines or empty lines:<source lang="python">
 +
print('Line 1\nLine 2\nLine 3\n')
 +
</source>
 +
:#Strings have many built-in functions that we can use to manipulate text. [https://docs.python.org/3/library/stdtypes.html#string-methods Here's a list].
 +
:#Lets try out several different functions:<source lang="python">
 +
print(course_name.lower())        # Returns a string in lower-case letters
 +
print(course_name.upper())        # Returns a string in upper-case letters
 +
print(course_name.swapcase())      # Returns a string with upper-case and lower-case letters swapped
 +
print(course_name.title())        # Returns a string with upper-case first letter of each word, lowercase for remaining text
 +
print(course_name.capitalize())    # Returns a string with upper-case first letter only, lowercase for remaining text
 +
</source>
 +
:#These values can be saved inside new strings and then reused:<source lang="python">
 +
lower_name = course_name.lower()    # Save returned string lower-case string inside new string variable
 +
print(lower_name)
 +
</source>
 +
:#If a string contains many values separated by a single character, such as a space, the string can be split on those values and create a list of values<source>
 +
lower_name.split(' ')      # Provide the split() function with a character to split on
 +
</source>The above example will return a list of strings, which we can access just like all of lists. <br><br>
 +
:#Let's practice more string manipulation:<source lang="python">
 +
list_of_strings = lower_name.split(' ')    # Split string on spaces and store the list in a variable
 +
print(list_of_strings)                      # Display list
 +
print(list_of_strings[0])                  # Display first item in list
 +
</source>Since lists are actually a list of '''strings''', you should be able to use any function that works with a string on a list:<source lang="python">
 +
list_of_strings[0].upper()          # Use the function after the index to affect a single string within a list
 +
first_word = list_of_strings[0]
 +
print(first_word)
 +
</source>The '''index''' that is used to access <u>items</u> within a list, can also be used to access <u>characters</u> within a string. For practice, let's  create a new string, and start accessing the strings index:<source>
 +
course_name = 'Open System Automation'
 +
course_code = 'OPS435'
 +
course_number = 435
 +
print(course_code[0])                          # Print the first character in course_code
 +
print(course_code[2])                          # Print the third character in course_code
 +
print(course_code[-1])                        # Print the last character in course_code
 +
print(str(course_number)[0])                  # Turn the integer into a string, return first character in that string, and print it
 +
print(course_code[0] + course_code[1] + course_code[2])
 +
</source>
 +
:#You can use a technique that uses index numbers of a string to '''cut-out''' or '''"parse"''' smaller portions of text within a string. This term is referred to as a '''substring'''. We can use this to create a new string or display only a small portion of it:<source lang="python">
 +
print(course_name[0:4])                # Print the first four characters (values of index numbers 0,1,2, and 3)
 +
first_word = course_name[0:4]          # Save this substring for later use
 +
print(course_code[0:3])                # Print the first three characters (values of index numbers 0,1,and 2)
 +
</source>
 +
:# The index allows a few '''extra functions''' using the same parsing technique:<source lang="python">
 +
course_name = 'Open System Automation'
 +
print(course_name[12:])                        # Print the substring '12' index until end of string
 +
print(course_name[5:])                        # Print the substring '5' index until end of string
 +
print(course_name[-1])                        # Print the last character
 +
</source>With '''negative indices''', '''-1''' would represent the '''last''' character, '''-2''' index would represent the '''second last''' character, etc.:<source lang="python">
 +
course_name = 'Open System Automation'
 +
print(course_name[-1])
 +
print(course_name[-2])
 +
</source>
 +
:# Practice some of the skills that you have learned in this section:<source>
 +
course_name = 'Open System Automation'
 +
print(course_name[-10:])                            # Return the last ten characters
 +
print(course_name[-10:-6])                          # Try and figure out what this is returning
 +
print(course_name[0:4] + course_name[-10:-6])      # Combine substrings together
 +
substring = course_name[0:4] + course_name[-10:-6]  # Save the combined substring as a new string for later
 +
print(substring)
 +
</source>
 +
:# The real power found in substrings goes beyond just manually writing index values and getting back words. The next part of this investigation will cover how to search through a string for a specific word, letter, number, and return the index to that search result.
  
The map function can be used to apply a function on each item in a list. This is exactly what we did above, however it gives us much better syntax, removes the loop, including the variable we had to create to do the loop. This will make our work a little more efficient while performing the same task.
+
'''Create a Python Script Demostrating Substrings'''
<pre>
+
:'''Perform the Following Instructions'''
def square(number):
+
:#Create the '''~/ops435/lab4/lab4d.py''' script. The purpose of this script is to demonstrate creating and manipulating strings. There will be four functions each will return a single string.
    return number ** 2
+
:#Use the following template to get started:<source>
 +
#!/usr/bin/env python3
 +
# Strings 1
  
l1 = [1,2,3,4,5]
+
str1 = 'Hello World!!'
l2 = list(map(square, l1))
+
str2 = 'Seneca College'
  
l1
+
num1 = 1500
l2
+
num2 = 1.50
</pre>
 
  
The above map function required us to provide it with a function, and a list. This meant that before we could use map we needed to define a function earlier in the script. We can avoid this entire function definition through the use of anonymous functions. This is the ability to create a simple function without defining it, and pass it off for use. Below we will use lambda, which will return a function, and we can use that function immediately. The function takes 1 argument x, and it will perform a single operation on x, square it.
+
def first_five():
<pre>
+
    # Place code here - refer to function specifics in section below
square = lambda x: x ** 2
 
l1 = [1,2,3,4,5]
 
l2 = list(map(square, l1))
 
  
l1
+
def last_seven():
l2
+
    # Place code here - refer to function specifics in section below
</pre>
 
  
The above code is actually not particularly good, the whole purpose of using lambda here is we were avoiding the function definition and just quickly returning a function. However this does break down exactly what lambda does, it returns a function for use. Lets fix this and remove the square function and just use the return function from lambda. Now remember what map requires? map's first argument is a function, and map's second argument is a list. Here lambda will return a function and provide it as the first argument.
+
def middle_number():
<pre>
+
    # Place code here - refer to function specifics in section below
l1 = [1,2,3,4,5]
 
l2 = list(map(lambda x: x ** 2, l1))
 
  
l1
+
def first_three_last_three():
l2
+
    # Place code here - refer to function specifics in section below
</pre>
 
  
= Investigation 2: STRINGS =
 
Strings are in their most basic form a list of characters, or a bit of text. Strings store text so that we can use them later. In this section we will cover more than just displaying that text to the screen. Here, we will go over cutting strings into sub-strings, joining strings together, searching through strings, and matching strings against patterns.
 
  
== PART 1 - String Basics ==
+
if __name__ == '__main__':
 +
    print(first_five(str1))
 +
    print(first_five(str2))
 +
    print(last_seven(str1))
 +
    print(last_seven(str2))
 +
    print(middle_number(num1))
 +
    print(middle_number(num2))
 +
    print(first_three_last_three(str1, str2))
 +
    print(first_three_last_three(str2, str1))
 +
</source>
  
We can concatenate strings using the plus sign. Combining strings together to create a brand new string, strings are immutable just like tuples. This means everytime you change a string, you are actually creating a new string.
+
::*The script should contain '''four''' functions (use your own argument names):
<pre>
+
::::'''first_five()''':<ol><li>Accepts a single string argument</li><li>Returns a string that contains the first five characters of the argument given</li></ol>
str1 = 'Paul'
+
::::'''last_seven()''':<ol><li>Accepts a single string argument</li><li>Returns a string that contains the last seven characters of the argument given</li></ol>
str2 = 'Atreides'
+
::::'''middle_number()''':<ol><li>Accepts a integer as a argument</li><li>Returns a string containing the second and third characters in the number</li></ol>
str3 = str1 + ' ' + str2
+
::::'''first_three_last_three()''':<ol><li>Accepts two string arguments</li><li>Returns a single string that starts with the first three characters of argument1 and ends with the last three characters of argument2</li></ol>
str3
+
::*Example: first_three_last_three('abcdefg', '1234567') returns single string 'abc567'
</pre>
+
:::'''Sample Run 1'''<source>
 +
./lab4d.py
 +
Hello
 +
Senec
 +
World!!
 +
College
 +
50
 +
.5
 +
Helege
 +
Send!!
 +
</source>
 +
:::'''Sample Run 2 (with import)'''<source>
 +
import lab4d
  
Repetition is also a useful tool that can be used with strings. Repetition repeats the string over and over a specific amount of times. This is useful anytime you would manually be typing the same thing over again.
+
str1 = 'Hello World!!'
<pre>
+
str2 = 'Seneca College'
str1 = 'Paul'
+
num1 = 1500
str2 = 'Atreides'
+
num2 = 1.50
str3 = str1 + ' ' + str2 + ' ' + 'I'
 
str3
 
str3 = str1 + ' ' + str2 + ' ' + 'I'*3
 
str3
 
</pre>
 
  
== Investigation 2 - Part 1 - String Manipulation ==
+
print(lab4d.first_five(str1))
== Investigation 2 - Part 1 - Regular Expressions ==
+
# Will output 'Hello'
 +
print(lab4d.first_five(str2))
 +
# Will output 'Senec'
 +
print(lab4d.last_seven(str1))
 +
# Will output 'World!!'
 +
print(lab4d.last_seven(str2))
 +
# Will output 'College'
 +
print(lab4d.middle_number(num1))
 +
# Will output '50'
 +
print(lab4d.middle_number(num2))
 +
# Will output '.5'
 +
print(lab4d.first_three_last_three(str1, str2))
 +
# Will output 'Helege'
 +
print(lab4d.first_three_last_three(str2, str1))
 +
# Will output 'Send!!'
 +
</source>
 +
::3. Download the checking script and check your work. Enter the following commands from the bash shell.<source>
 +
cd ~/ops435/lab4/
 +
pwd #confirm that you are in the right directory
 +
ls CheckLab4.py || wget https://raw.githubusercontent.com/Seneca-CDOT/ops435/master/LabCheckScripts/CheckLab4.py
 +
python3 ./CheckLab4.py -f -v lab4d
 +
</source>
 +
::4. Before proceeding, make certain that you identify all errors in lab4d.py. When the checking script tells you everything is OK - proceed to the next step.
  
= LAB 1 SIGN-OFF (SHOW INSTRUCTOR) =
+
= LAB 4 SIGN-OFF (SHOW INSTRUCTOR) =
 
[[Image:lab1_signoff.png|thumb|right|450px|Students should be prepared with '''all required commands (system information) displayed in a terminal (or multiple terminals) prior to calling the instructor for signoff'''.]]
 
[[Image:lab1_signoff.png|thumb|right|450px|Students should be prepared with '''all required commands (system information) displayed in a terminal (or multiple terminals) prior to calling the instructor for signoff'''.]]
  
Line 211: Line 536:
 
:'''Have Ready to Show Your Instructor:'''
 
:'''Have Ready to Show Your Instructor:'''
  
::<span style="color:green;font-size:1.5em;">&#x2713;</span> x
+
::<span style="color:green;font-size:1.5em;">&#x2713;</span> Output of: <code>./CheckLab4.py -f -v</code>
::<span style="color:green;font-size:1.5em;">&#x2713;</span> x
+
::<span style="color:green;font-size:1.5em;">&#x2713;</span> Output of: <code>cat lab4a.py lab4b.py lab4c.py lab4d.py</code>
::<span style="color:green;font-size:1.5em;">&#x2713;</span> Lab1 logbook notes completed
 
  
 +
= LAB REVIEW =
  
= Practice For Quizzes, Tests, Midterm &amp; Final Exam =
+
# What is the purpose of a '''tuple'''? How does a tuple differ from a list?
 +
# How do you define elements within a tuple?
 +
# Write Python code to confirm if the string ''''OPS435'''' exists within the tuple called '''courses'''.
 +
# What is the purpose of a '''set'''? How do sets differ from lists or tuples?
 +
# How do you define elements within a set?
 +
# Assuming you have defined two sets called '''set1''' and '''set2'''. Write Python code to:<ol type="a"><li>Return a set containing all values of both sets</li><li>Returns a set containing all values in set1 that are not found in set2</li><li>Return a set containing all values that both sets DO NOT share</li></ol>
 +
# What is the purpose of a dictionary?
 +
# How do you define elements within a dictionary?
 +
# Write Python commands to display for a dictionary called '''my_dictionary''' the dictionary key called '''my_key''' and a dictionary value for that key?
 +
# What is the purpose for the '''range()''', '''len()''', '''append()''', and '''map()''' functions for a dictionary?
 +
# List and briefly explain the following functions (methods) that can be used with strings:<br>'''lower()''' , '''upper()''' , '''swapcase()''' , '''title()''' , '''captilize()''' , '''split()'''
 +
# Assume you issued the following command in your ipython3 shell:<br>'''course_name = 'Programming with Python''''<br>What will be the output for each of the following Python commands?<ol type="a"><li>'''course_name[3:11]'''</li><li>'''course_name[10:]'''</li><li>'''course_name[-1]</li></ol>
  
# x
+
[[Category:OPS435-Python]]
# x
 
# x
 

Revision as of 16:39, 5 June 2019

OBJECTIVES

The first investigation in this lab will focus on Data Structures. Each data structure has its own advantages and limitations. This lab will emphasize the most important differences between them.
The second investigation will focus on strings. You have been using and storing strings since our first class, however in this lab we will dive into the more complex nature of string manipulation. Finally, this lab will cover how to use a variety of regular expression functions for searching and input validation.

PYTHON REFERENCE

As you develop your Python scripting skills, you may start to be "overwhelmed" with the volume of information that you have absorbed over these labs. One way to help is to learn to use online references effectively in order to obtain information regarding Python scripting techniques and tools.
Below is a table with links to useful online Python reference sites (by category). You may find these references useful when performing assignments, etc.
Data Structures Lists & List Comprehension Strings Regular Expressions Miscellaneous

INVESTIGATION 1: DATA STRUCTURES

In this investigation, you will learn several data structures commonly used in Python scripting. These tools include lists, tuples, sets, and dictionaries.

PART 1 - Tuples

Many often confuse a tuple with a list (which you learned about in a previous lab). A tuple is a type of list whose values cannot be changed. In fact, nothing in a tuple can be changed after it's created (like adding, removing list elements).
There are many advantages to using tuples when creating Python scripts:
  • Data protection (eg. values are are NOT allowed to change so you won't modify them accidentally)
  • Tuples can be used as keys in data dictionaries (which are NOT allowed to change)
  • Tuples allow for faster access than lists
The term to indicate that a data structure cannot be changed is called immutable (as opposed to "mutable" which means the data structure can be changed).
Perform the Following Steps:
  1. Let's create two tuples in a temporary Python file, so we can learn how to use them and learn how they differ from lists.

    Note: tuples are defined by using parenthesis ( ) as opposed to lists which are defined by using square brackets [ ]
    t1 = ('Prime', 'Ix', 'Secundus', 'Caladan')
    t2 = (1, 2, 3, 4, 5, 6)
  2. Values from a tuple can be retrieved in the same way as a list. For example:
    print(t1[0])
    print(t2[2:4])
  3. You can also check to see whether a value exists inside a tuple or not. To demonstrate try:
    print('Ix' in t1)
    print('Geidi' in t1)
    Let's now see how a tuple differs from a list. We will now create a list and note the difference between them:
    list2 = [ 'uli101', 'ops235', 'ops335', 'ops435', 'ops535', 'ops635' ]
  4. See if you can change the value of your list:
    list2[0]= 'ica100'
    print(list2[0])
    print(list2)
    You should have been successful in changing the value of your list.

  5. Now, try changing the value of your previously-created tuple:
    t2[1] = 10
    Did it work? Once created the tuple values will not be able to change.

    If you would like a tuple with different values than the tuple you currently have, then you must create a new one.

  6. The following creates a new tuple (t3) with a contents from a slice of the t2 tuple. Slicing works the same way for tuples as for lists:
    t3 = t2[2:3]
  7. Also, as with lists, you can use for loops to iterate the values of tuples:
    for item in t1:
        print('item: ' + item)

PART 2 - Sets

So far, you have been exposed to two structures that are used to contain data: lists and tuples. You can modify the values within a list as well as modify the structure of a list (i.e. add and remove elements), whereby you cannot with a tuple.
In this section, you will learn about sets. A set has similar characteristics as a list, but there are two major differing characteristics:
  • Sets are un-ordered
  • Sets cannot contain duplicate values
Since new duplicate entries will be automatically removed when using sets, they are very useful for performing tasks such as comparisons: finding similarities or differences in multiple sets.
  1. Create some sets to work with in a temporary Python file:
    s1 = {'Prime', 'Ix', 'Secundus', 'Caladan'}
    s2 = {1, 2, 3, 4, 5}
    s3 = {4, 5, 6, 7, 8}
    Note: Sets are defined by using braces { } as opposed to tuples which use parenthesis ( ), or lists which use square brackets [ ]

  2. Try to access a set through the index:
    print(s1[0])
    This should have caused an error. You cannot access data inside a set this way because the elements inside are unordered. Instead, you should use the in method to check to see whether a value is contained in the set:
    print('Ix' in s1)
    print('Geidi' in s1)

    Sets can be combined, but it is important to note that any duplicate values (shared among sets) will be deleted.

  3. Print the contents of the sets and note the values that are common:
    print(s2)
    print(s3)
  4. This is how you get a set containing only UNIQUE values (no duplicates) from both sets:
    print(s2 | s3)         # returns a set containing all values from both sets
    print(s2.union(s3))    # same as s2 | s3
    Notice that both methods above have the same result, which one you choose depends purely on your style.

    Instead of combining sets, we can display values that are common to both sets. This is known in mathematical terms as an intersection between the lists:
    print(s2 & s3)             # returns a set containing all values that s2 and s3 share
    print(s2.intersection(s3)) # same as s2 & s3
  5. Sets can also have their values compared against other sets. First find out what items are in s2 but not in s3. This is also called a difference:
    print(s2)
    print(s3)
    print(s2 - s3)             # returns a set containing all values in s2 that are not found in s3
    print(s2.difference(s3))   # same as s2 - s3
  6. In order to see every difference between both sets, you need to find the symmetric difference. This will return a set that shows all numbers that both sets do not share together:
    print(s2 ^ s3)                     # returns a set containing all values that both sets DO NOT share
    print(s2.symmetric_difference(s3)) # same as s2 ^ s3
    Note: the set() function can convert lists into sets, and the list() function can convert sets into lists. The operations in this section can only be applied to sets, so if you need to perform a union, intersection, or difference between lists, you need to convert them to sets first. For example:
    l2 = [1, 2, 3, 4, 5]
    l3 = [4, 5, 6, 7, 8]
    temporary_set = set(l2).intersection(set(l3))
    new_list = list(temporary_set)  # '''set()''' can make lists into sets. '''list()''' can make sets into lists.
    print(new_list)

Create a Python Script Demonstrating Comparing Sets

Perform the Following Instructions
  1. Create the ~/ops435/lab4/lab4a.py script. The purpose of this script will be to demonstrate the different way of comparing sets. There will be three functions, each returning a different set comparison.
  2. Use the following template to get started:
    #!/usr/bin/env python3
    
    def join_sets(s1, s2):
        # join_sets will return a set that contains every value from both s1 and s2
    
    def match_sets(s1, s2):
        # match_sets will return a set that contains all values found in both s1 and s2
    
    def diff_sets(s1, s2):
        # diff_sets will return a set that contains all different values which are not shared between the sets
    
    if __name__ == '__main__':
        set1 = set(range(1,10))
        set2 = set(range(5,15))
        print('set1: ', set1)
        print('set2: ', set2)
        print('join: ', join_sets(set1, set2))
        print('match: ', match_sets(set1, set2))
        print('diff: ', diff_sets(set1, set2))
  • The join_sets() function should return a set that contains all values from both sets
  • The match_sets() function should return a set that contains all values found in both sets
  • The diff_sets() function should return a set that contains all values which are not shared between both sets
  • All three functions should accept two arguments both are sets
  • The script should show the exact output as the samples
  • The script should contain no errors
Sample Run 1:
./lab4a.py
set1:  {1, 2, 3, 4, 5, 6, 7, 8, 9}
set2:  {5, 6, 7, 8, 9, 10, 11, 12, 13, 14}
join:  {1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14}
match:  {8, 9, 5, 6, 7}
diff:  {1, 2, 3, 4, 10, 11, 12, 13, 14}
Sample Run 2 (with import):
import lab4a
set1 = {1,2,3,4,5}
set2 = {2,1,0,-1,-2}
print(lab4a.join_sets(set1,set2))
# Will output {-2, -1, 0, 1, 2, 3, 4, 5}
print(lab4a.match_sets(set1,set2))
# Will output {1, 2}
print(lab4a.diff_sets(set1,set2))
# Will output {-2, -1, 0, 3, 4, 5}
  1. Download the checking script and check your work. Enter the following commands from the bash shell:
    cd ~/ops435/lab4/
    pwd #confirm that you are in the right directory
    ls CheckLab4.py || wget https://raw.githubusercontent.com/Seneca-CDOT/ops435/master/LabCheckScripts/CheckLab4.py
    python3 ./CheckLab4.py -f -v lab4a
  2. Before proceeding, make certain that you identify all errors in lab4a.py. When the checking script tells you everything is OK - proceed to the next step.

Create a Python Script Demonstrating Comparing Lists

Perform the Following Instructions
  1. Create the ~/ops435/lab4/lab4b.py script. The purpose of this script will be to improve the previous script to perform the same joins, matches, and diffs, but this time on lists.
  2. Use the following as a template:
    #!/usr/bin/env python3
    
    def join_lists(l1, l2):
        # join_lists will return a list that contains every value from both l1 and l2
    
    def match_lists(l1, l2):
        # match_lists will return a list that contains all values found in both l1 and l2
    
    def diff_lists(l1, l2):
        # diff_lists will return a list that contains all different values, which are not shared between the lists
    
    if __name__ == '__main__':
        list1 = list(range(1,10))
        list2 = list(range(5,15))
        print('list1: ', list1)
        print('list2: ', list2)
        print('join: ', join_lists(list1, list2))
        print('match: ', match_lists(list1, list2))
        print('diff: ', diff_lists(list1, list2))
  • The match_lists() function should return a list that contains all values found in both lists
  • The diff_lists() function should return a list that contains all values which are not shared between both lists
  • The join_lists() function should return a list that contains all values from both lists
  • All three functions should accept two arguments both are lists
  • The script should show the exact output as the samples
  • The script should contain no errors
Sample Run 1:
./lab4b.py
list1:  [1, 2, 3, 4, 5, 6, 7, 8, 9]
list2:  [5, 6, 7, 8, 9, 10, 11, 12, 13, 14]
join:  [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14]
match:  [5, 6, 7, 8, 9]
diff:  [1, 2, 3, 4, 10, 11, 12, 13, 14]
Sample Run 2 (with import) under interactive python shell:
import lab4b
list1 = [1,2,3,4,5]
list2 = [2,1,0,-1,-2]
print(lab4b.join_lists(list1,list2)))
# Will output [0, 1, 2, 3, 4, 5, -2, -1]
print(lab4b.match_lists(list1,list2))                                                                                                                 
# Will output [1, 2]
print(lab4b.diff_lists(list1,list2))                                                                                                                  
# Will output [0, 3, 4, 5, -2, -1]
3. Download the checking script and check your work. Enter the following commands from the bash shell.
cd ~/ops435/lab4/
pwd #confirm that you are in the right directory
ls CheckLab4.py || wget matrix.senecac.on.ca/~acoatley-willis/CheckLab4.py
python3 ./CheckLab4.py -f -v lab4b
4. Before proceeding, make certain that you identify all errors in lab4b.py. When the checking script tells you everything is OK - proceed to the next step.

PART 3 - Dictionaries

In Python, a dictionary is a set of key-value pairs. Dictionaries are unordered, like sets, however any value can be retrieved from a dictionary if you know the key. This section will go over how to create, access, and change dictionaries, giving you a new powerful tool to store and manipulate data.
Perform the Following Steps:
  1. Let's begin by creating a new dictionary in a temporary Python file:
    dict_york = {'Address': '70 The Pond Rd', 'City': 'Toronto', 'Postal Code': 'M3J3M6'}
    You should note that the syntax to define a dictionary is similar to defining sets (i.e. using {}), but unlike sets dictionaries use key:value pairs within the dictionary, each key:value pair is separated by commas.
  2. All the values in a dictionary can be retrieved by using the dictionary.values() function. This particular function provides a list containing all values:
    print(dict_york.values())
    All keys to access the key:pair values within a dictionary can be retrieved using the dictionary.keys() function:
    dict_york.keys()
    We can retrieve individual values from a dictionary by providing the key associated with the value:
    print(dict_york['Address'])
    print(dict_york['Postal Code'])
  3. Dictionary keys can be any immutable values (i.e. not permitted for value to be changed). Types of values include: strings, numbers, and tuples.
  4. Try adding a new key and value to the dictionary:
    dict_york['Country'] = 'Canada'
    print(dict_york)
    print(dict_york.values())
    print(dict_york.keys())
  5. Let's change the province value to BC:
    dict_york['Province'] = 'BC'
    print(dict_york)
    print(dict_york.values())
    print(dict_york.keys())
    WARNING: Dictionary keys must be unique. Attempting to add a key that already exists in the dictionary will overwrite the existing value for that key! For example:
    dict_york['Province'] = 'ON'
    print(dict_york)
    print(dict_york.values())
    print(dict_york.keys())
    You should notice that value for the 'Province' key has been changed back to 'ON'.

    The lists that contain the values and keys of the dictionary are not real python lists - they are "views of the dictionary" and therefore are immutable. You could change these views into usable lists by using the list() function:
    list_of_keys = list(dict_york.keys())
    print(list_of_keys[0])
  6. Lists can be used with for loops:
    list_of_keys = list(dict_york.keys())
    for key in list_of_keys:
        print(key)
    for value in dict_york.values():
        print(value)

Create a Python Script for Managing Dictionaries

Perform the Following Instructions
  1. Create the ~/ops435/lab4/lab4c.py script. The purpose of this script will be to create dictionaries, extract data from dictionaries, and to make comparisons between dictionaries.
  2. Use the following as a template:
    #!/usr/bin/env python3
    
    # Dictionaries
    dict_york = {'Address': '70 The Pond Rd', 'City': 'Toronto', 'Country': 'Canada', 'Postal Code': 'M3J3M6', 'Province': 'ON'}
    dict_newnham = {'Address': '1750 Finch Ave E', 'City': 'Toronto', 'Country': 'Canada', 'Postal Code': 'M2J2X5', 'Province': 'ON'}
    # Lists
    list_keys = ['Address', 'City', 'Country', 'Postal Code', 'Province']
    list_values = ['70 The Pond Rd', 'Toronto', 'Canada', 'M3J3M6', 'ON']
    
    def create_dictionary(keys, values):
        # Place code here - refer to function specifics in section below
    
    def shared_values(dict1, dict2):
        # Place code here - refer to function specifics in section below
    
    
    if __name__ == '__main__':
        york = create_dictionary(list_keys, list_values)
        print('York: ', york)
        common = shared_values(dict_york, dict_newnham)
        print('Shared Values', common)
  • The script should contain two functions:
create_dictionary()
  1. accepts two lists as arguments keys and values, combines these lists together to create a dictionary
    (Tip: use a while loop to access elements in both the keys and values lists at the same time)
  2. returns a dictionary that has the keys and associated values from the lists
shared_values()
  1. accepts two dictionaries as arguments and finds all values that are shared between the two dictionaries
    (Tip: generate sets containing only values for each dictionary, then use a function mentioned in a previous section to store the values that are common to both lists)
  2. returns a set containing ONLY values found in BOTH dictionaries
  • make sure the functions have the correct number of arguments required
  • The script should show the exact output as the samples
  • The script should contain no errors
Sample Run 1:
./lab4c.py
York:  {'Country': 'Canada', 'Postal Code': 'M3J3M6', 'Address': '70 The Pond Rd', 'Province': 'ON', 'City': 'Toronto'}
Shared Values {'Canada', 'ON', 'Toronto'}
Sample Run 2 (with import):
import lab4c
dict_york = {'Address': '70 The Pond Rd', 'City': 'Toronto', 'Country': 'Canada', 'Postal Code': 'M3J3M6', 'Province': 'ON'}
dict_newnham = {'Address': '1750 Finch Ave E', 'City': 'Toronto', 'Country': 'Canada', 'Postal Code': 'M2J2X5', 'Province': 'ON'}
list_keys = ['Address', 'City', 'Country', 'Postal Code', 'Province']
list_values = ['70 The Pond Rd', 'Toronto', 'Canada', 'M3J3M6', 'ON']

york = lab4c.create_dictionary(list_keys, list_values)

print(york)
# Will print: {'Address': '70 The Pond Rd',
               'City': 'Toronto',
               'Country': 'Canada',
               'Postal Code': 'M3J3M6',
               'Province': 'ON'}

common = lab4c.shared_values(dict_york, dict_newnham)

print(common)
# Will print: {'Canada', 'ON', 'Toronto'}
3. Download the checking script and check your work. Enter the following commands from the bash shell.
cd ~/ops435/lab4/
pwd #confirm that you are in the right directory
ls CheckLab4.py || wget https://raw.githubusercontent.com/Seneca-CDOT/ops435/master/LabCheckScripts/CheckLab4.py
python3 ./CheckLab4.py -f -v lab4c
4. Before proceeding, make certain that you identify all errors in lab4c.py. When the checking script tells you everything is OK proceed to the next step.

INVESTIGATION 2: STRINGS

Strings are basically a list of characters (bits of text). This section will investigate strings in more detail such as cutting strings into sub-strings, joining strings, formatting strings, searching through strings, and matching strings against patterns.

Strings are immutable data objects - this means that once a string is created, it cannot be modified. In order to make a change inside a string, you would first make a copy of the part of the string (i.e. sub-string) for manipulation.

PART 1 - Strings and Substrings

This first part will explain basic concepts of using strings, printing strings, and manipulating sub-strings.
Perform the Following Steps:
  1. Create some strings in a temporary Python file:
    course_name = 'Open System Automation'
    course_code = 'OPS435'
    course_number = 435
    Strings can contain any characters inside them, whether they are letters, numbers, or symbols.
  2. Strings can also be concatenated (i.e. "combined together") by using the + sign, just make sure string are only concatenating strings with strings (no lists, no numbers, no dictionaries, etc.):
    print(course_name)
    print(course_code)
    print(str(course_number))
    print(course_name + ' ' + course_code + ' ' + str(course_number))
    When using the print() function, you can display special characters. One such special character is the is the newline character (denoted by the symbol: \n). This allows you to separate content between new lines or empty lines:
    print('Line 1\nLine 2\nLine 3\n')
  3. Strings have many built-in functions that we can use to manipulate text. Here's a list.
  4. Lets try out several different functions:
    print(course_name.lower())         # Returns a string in lower-case letters
    print(course_name.upper())         # Returns a string in upper-case letters
    print(course_name.swapcase())      # Returns a string with upper-case and lower-case letters swapped
    print(course_name.title())         # Returns a string with upper-case first letter of each word, lowercase for remaining text
    print(course_name.capitalize())    # Returns a string with upper-case first letter only, lowercase for remaining text
  5. These values can be saved inside new strings and then reused:
    lower_name = course_name.lower()    # Save returned string lower-case string inside new string variable
    print(lower_name)
  6. If a string contains many values separated by a single character, such as a space, the string can be split on those values and create a list of values
    lower_name.split(' ')       # Provide the split() function with a character to split on
    The above example will return a list of strings, which we can access just like all of lists.

  7. Let's practice more string manipulation:
    list_of_strings = lower_name.split(' ')     # Split string on spaces and store the list in a variable
    print(list_of_strings)                      # Display list
    print(list_of_strings[0])                   # Display first item in list
    Since lists are actually a list of strings, you should be able to use any function that works with a string on a list:
    list_of_strings[0].upper()           # Use the function after the index to affect a single string within a list
    first_word = list_of_strings[0]
    print(first_word)
    The index that is used to access items within a list, can also be used to access characters within a string. For practice, let's create a new string, and start accessing the strings index:
    course_name = 'Open System Automation'
    course_code = 'OPS435'
    course_number = 435
    print(course_code[0])                          # Print the first character in course_code
    print(course_code[2])                          # Print the third character in course_code
    print(course_code[-1])                         # Print the last character in course_code
    print(str(course_number)[0])                   # Turn the integer into a string, return first character in that string, and print it
    print(course_code[0] + course_code[1] + course_code[2])
  8. You can use a technique that uses index numbers of a string to cut-out or "parse" smaller portions of text within a string. This term is referred to as a substring. We can use this to create a new string or display only a small portion of it:
    print(course_name[0:4])                 # Print the first four characters (values of index numbers 0,1,2, and 3) 
    first_word = course_name[0:4]           # Save this substring for later use
    print(course_code[0:3])                 # Print the first three characters (values of index numbers 0,1,and 2)
  9. The index allows a few extra functions using the same parsing technique:
    course_name = 'Open System Automation'
    print(course_name[12:])                        # Print the substring '12' index until end of string
    print(course_name[5:])                         # Print the substring '5' index until end of string
    print(course_name[-1])                         # Print the last character
    With negative indices, -1 would represent the last character, -2 index would represent the second last character, etc.:
    course_name = 'Open System Automation'
    print(course_name[-1])
    print(course_name[-2])
  10. Practice some of the skills that you have learned in this section:
    course_name = 'Open System Automation'
    print(course_name[-10:])                            # Return the last ten characters
    print(course_name[-10:-6])                          # Try and figure out what this is returning 
    print(course_name[0:4] + course_name[-10:-6])       # Combine substrings together
    substring = course_name[0:4] + course_name[-10:-6]  # Save the combined substring as a new string for later
    print(substring)
  11. The real power found in substrings goes beyond just manually writing index values and getting back words. The next part of this investigation will cover how to search through a string for a specific word, letter, number, and return the index to that search result.

Create a Python Script Demostrating Substrings

Perform the Following Instructions
  1. Create the ~/ops435/lab4/lab4d.py script. The purpose of this script is to demonstrate creating and manipulating strings. There will be four functions each will return a single string.
  2. Use the following template to get started:
    #!/usr/bin/env python3
    # Strings 1
    
    str1 = 'Hello World!!'
    str2 = 'Seneca College'
    
    num1 = 1500
    num2 = 1.50
    
    def first_five():
        # Place code here - refer to function specifics in section below
    
    def last_seven():
        # Place code here - refer to function specifics in section below
    
    def middle_number():
        # Place code here - refer to function specifics in section below
    
    def first_three_last_three():
        # Place code here - refer to function specifics in section below
    
    
    if __name__ == '__main__':
        print(first_five(str1))
        print(first_five(str2))
        print(last_seven(str1))
        print(last_seven(str2))
        print(middle_number(num1))
        print(middle_number(num2))
        print(first_three_last_three(str1, str2))
        print(first_three_last_three(str2, str1))
  • The script should contain four functions (use your own argument names):
first_five():
  1. Accepts a single string argument
  2. Returns a string that contains the first five characters of the argument given
last_seven():
  1. Accepts a single string argument
  2. Returns a string that contains the last seven characters of the argument given
middle_number():
  1. Accepts a integer as a argument
  2. Returns a string containing the second and third characters in the number
first_three_last_three():
  1. Accepts two string arguments
  2. Returns a single string that starts with the first three characters of argument1 and ends with the last three characters of argument2
  • Example: first_three_last_three('abcdefg', '1234567') returns single string 'abc567'
Sample Run 1
./lab4d.py 
Hello
Senec
World!!
College
50
.5
Helege
Send!!
Sample Run 2 (with import)
import lab4d

str1 = 'Hello World!!'
str2 = 'Seneca College'
num1 = 1500
num2 = 1.50

print(lab4d.first_five(str1))
# Will output 'Hello'
print(lab4d.first_five(str2))
# Will output 'Senec'
print(lab4d.last_seven(str1))
# Will output 'World!!'
print(lab4d.last_seven(str2))
# Will output 'College'
print(lab4d.middle_number(num1))
# Will output '50'
print(lab4d.middle_number(num2))
# Will output '.5'
print(lab4d.first_three_last_three(str1, str2))
# Will output 'Helege'
print(lab4d.first_three_last_three(str2, str1))
# Will output 'Send!!'
3. Download the checking script and check your work. Enter the following commands from the bash shell.
cd ~/ops435/lab4/
pwd #confirm that you are in the right directory
ls CheckLab4.py || wget https://raw.githubusercontent.com/Seneca-CDOT/ops435/master/LabCheckScripts/CheckLab4.py
python3 ./CheckLab4.py -f -v lab4d
4. Before proceeding, make certain that you identify all errors in lab4d.py. When the checking script tells you everything is OK - proceed to the next step.

LAB 4 SIGN-OFF (SHOW INSTRUCTOR)

Students should be prepared with all required commands (system information) displayed in a terminal (or multiple terminals) prior to calling the instructor for signoff.


Have Ready to Show Your Instructor:
Output of: ./CheckLab4.py -f -v
Output of: cat lab4a.py lab4b.py lab4c.py lab4d.py

LAB REVIEW

  1. What is the purpose of a tuple? How does a tuple differ from a list?
  2. How do you define elements within a tuple?
  3. Write Python code to confirm if the string 'OPS435' exists within the tuple called courses.
  4. What is the purpose of a set? How do sets differ from lists or tuples?
  5. How do you define elements within a set?
  6. Assuming you have defined two sets called set1 and set2. Write Python code to:
    1. Return a set containing all values of both sets
    2. Returns a set containing all values in set1 that are not found in set2
    3. Return a set containing all values that both sets DO NOT share
  7. What is the purpose of a dictionary?
  8. How do you define elements within a dictionary?
  9. Write Python commands to display for a dictionary called my_dictionary the dictionary key called my_key and a dictionary value for that key?
  10. What is the purpose for the range(), len(), append(), and map() functions for a dictionary?
  11. List and briefly explain the following functions (methods) that can be used with strings:
    lower() , upper() , swapcase() , title() , captilize() , split()
  12. Assume you issued the following command in your ipython3 shell:
    course_name = 'Programming with Python'
    What will be the output for each of the following Python commands?
    1. course_name[3:11]
    2. course_name[10:]
    3. course_name[-1]