Skip to main content

Section 7.11 String Comparison

The comparison operators also work on strings. To see if two strings are equal you write a boolean expression using the equality operator.
Other comparison operations are useful for putting words in lexicographical order 1 . This is similar to the alphabetical order you would use with a dictionary, except that all the uppercase letters come before all the lowercase letters.
It is probably clear to you that the word apple would be less than (come before) the word banana. After all, a is before b in the alphabet. But what if we consider the words apple and Apple? Are they the same?
It turns out, if you recall from our discussion of variable names, that uppercase and lowercase letters are considered to be different from one another. The way the computer knows they are different is that each character is assigned a unique integer value. “A” is 65, “B” is 66, and “5” is 53. The way you can find out the so-called ordinal value for a given character is to use a character function called ord.
One thing to note in the next to last example is the fact that the space character has an ordinal value (32). Even though you don’t see it, it is an actual character. We sometimes call it a whitespace character.
When you compare characters or strings to one another, Python converts the characters into their equivalent ordinal values and compares the integers from left to right. As you can see from the example above, “a” is greater than “A” so “apple” is greater than “Apple”. (And “apple” is greater than “Zebra”!)
Humans commonly ignore capitalization when comparing two words. However, computers do not. A common way to address this issue is to convert strings to a standard format, such as all lowercase, before performing the comparison.

Checkpoint 7.11.1.

    Evaluate the following comparison:
    "Dog" < "Doghouse"
    
  • True
  • Both match up to the “g”, but “Dog” is shorter than “Doghouse”, so it comes first in the dictionary.
  • False
  • Strings are compared character by character.

Checkpoint 7.11.2.

    Evaluate the following comparison:
    "dog" < "Dog"
    
  • True
  • “d” is greater than “D” according to the ord function (100 versus 68).
  • False
  • Yes, upper case is less than lower case according to the ordinal values of the characters.
  • They are the same word
  • Python is case sensitive, meaning that upper case and lower case characters are different.

Checkpoint 7.11.3.

    Evaluate the following comparison:
    "dog" < "Doghouse"
    
  • True
  • “d” is greater than “D”.
  • False
  • The length does not matter. Lower case “d” is greater than upper case “D”.
http://en.wikipedia.org/wiki/Lexicographic_order