v1.2; 1994/10/12, Author: Chris Hoffmann, choffman@vicorp.com
For:python [-diuv] [-c command | script | - ] [args] -d Turn on parser debugging output (for wizards only, depending on compilation options). -i When a script is passed as first argument or the -c option is used, enter interactive mode after executing the script or the command. It does not read the $PYTHONSTARTUP file. This can be useful to inspect global variables or a stack trace when a script raises an exception. -s Suppress auto-printing of expressions in interactive mode. -u Force stdout and stderr to be totally unbuffered. -v Print a message each time a module is initialized, showing the place (filename or built-in module) from which it is loaded. -c command Specify the command to execute (see next section). This terminates the option list (following options are passed as arguments to the command). - anything afterward is passed as options to python script or command, not interpreted as an option to interpreter itself. script is the name of a python file to execute args are passed to script or command (in "sys.argv")If no script or command, Python enters interactive mode. Uses "readline" package for input, if available.
and elif from lambda return break else global not try class except if or while continue exec import pass def finally in print del for is raise
Illegitimate Tokens (only valid in strings): @ $ ?
A statement must all be on a single line. To break a statement over multiple lines use "\", as with the C preprocessor. Exception: can always break when inside any (), [], or {} pair, or in triple-quoted strings.
More than one statement can appear on a line if they are separated with semicolons (";").
Comments start with "#" and continue to end of line.
Identifiers: (letter|"_") (letter|digit|"_")* Python identifiers keywords, attributes, etc. are case-sensitive.
\newline Ignored (escape newline) \\ Backslash (\) \e Escape (ESC) \v Vertical Tab (VT) \' Single quote (') \f Formfeed (FF) \0OO (zero) char with \" Double quote (") \n Linefeed (LF) octal value OO \a Bell (BEL) \r Carriage Return (CR) \xXX char with \b Backspace (BS) \t Horizontal Tab (TAB) hex value XX \is left as-is
NULL byte (\000) is NOT an end-of-string marker; NULL's may be imbedded in strings
Strings (and tuples) are immutable: they cannot be modified.
long integer (unlimited precision): 1234567890L
octal integer: 0177, 0177777777777777777L hex integer: 0xFF, 0xFFFFFFFFFFFFFFFFFFL
float: 3.14e-10
tuple of length 0, 1, 2, etc:
() (1,) (1,2) # parentheses are optional if len > 0
list of length 0, 1, 2, etc:
[] [1] [1,2]
dictionary of length 0, 1, 2, etc:
{} {1 : 'first'} {1 : 'first', 'next': 'second'}
Indexing is 0-based. Negative indices (usually) mean count backwards from end of sequence.
Sequence slicing [starting-at-index : but-less-than-index] Start defaults to '0'; End defaults to 'sequence-length'.
a = (0,1,2,3,4,5,6,7)
a[3] ==> 3
a[-1] ==> 7
a[2:4] ==> (2, 3)
a[1:] ==> (1, 2, 3, 4, 5, 6, 7)
a[:3] ==> (0, 1, 2)
a[:] ==> (0,1,2,3,4,5,6,7) # makes a copy of the sequence.
< strictly less than <= less than or equal > strictly greater than >= greater than or equal == equal != not equal ( "<>" is also allowed) is object identity (are _objects_ identical, not values) is not negated object identityX < Y < Z < W has expected meaning, unlike C
False values: None, numeric zeros, empty sequences and mappings True values: all other values not X: if X is false then 1, else 0 X or Y: if X is false then Y, else X X and Y: if X is false then X, else Y ('or', 'and' evaluate second arg only if necessary to determine outcome)
Floats are implemented with C doubles.
Integers are implemented with C longs.
Long integers have unlimited size (only limit is system resources)
abs(x) absolute value of x int(x) x converted to integer long(x) x converted to long integer float(x) x converted to floating point -x x negated +x x unchanged x + y sum of x and y x - y difference of x and y x * y product of x and y x / y quotient of x and y x % y remainder of x / y divmod(x, y) the tuple (x/y, x%y) pow(x, y) x to the power y
~x the bits of x inverted x ^ y bitwise exclusive or of x and y x & y bitwise and of x and y x | y bitwise or of x and y x << n x shifted left by n bits x >> n x shifted right by n bits
len(s) length of s min(s) smallest item of s max(s) largest item of s x in s 1 if an item of s is equal to x, else 0 x not in s 0 if an item of s is equal to x, else 1 s + t the concatenation of s and t s * n, n * s n copies of s concatenated s[i] i'th item of s, origin 0 s[i:j] slice of s from i to j slice from index i up to but not including index j i defaults to 0, j to len(s) negative goes from right-end of sequence
s[i] = x item i of s is replaced by x s[i:j] = t slice of s from i to j is replaced by t del s[i:j] delete slice (same as s[i:j] = []) s.append(x) add x to end of s s.count(x) return number of i's for which s[i] == x s.index(x) return smallest i such that s[i] == x1) s.insert(i, x) item i becomes x, old item i is now at i+1, etc. s.remove(x) same as del s[s.index(x)] s.reverse() reverses the items of s (in place) s.sort() sorts the list (in place) Optional parameter: function of two arguments returning -1, 0 or 1 depending on whether arg1 is >, ==, < arg2.IndexError is raised on out-of-range sequence subscript
len(a) the number of items in a a[k] the item of a with key k a[k] = x set a[k] to x del a[k] remove a[k] from a a.items() a copy of a's list of (key, value) pairs a.keys() a copy of a's list of keys a.values() a copy of a's list of values a.has_key(k) 1 if a has a key k, else 0TypeError is raised if key not acceptable
Supports: %, c, s, i, d, u, o, x, X, e, E, f, g, G.
Width and precision may be a * to specify that an integer argument specifies the actual width or precision.
The flag characters -, +, blank, # and 0 are understood.
%s will convert any type argument to string (uses str() function)
a = '%s has %03d quote types.' % ('Python', 2) a ==> 'Python has 002 quote types.'Right-hand-side can be a mapping:
a = '%(lang)s has %(c)03d quote types.' % {'c':2, 'lang':'Python}(vars() function very handy to use on right-hand-side.)
f.close(x) close file f. f.fileno(x) get fileno (fd) for f. f.flush(x) flush file's internal buffer. f.isatty() 1 if file is connected to a tty-like dev, else 0 f.read([size]) read at most mostbytes from file and return as a string object. If omitted, read to EOF. f.readline() read one entire line from file f.readlines() read until EOF with readline() and return list of lines read. f.seek(offset, whence=0) set file's position, like "stdio's fseek()". whence == 0 then use absolute indexing whence == 1 then offset relative to current pos whence == 2 then offset relative to file end f.tell() return file's current position f.write(str) Write string to file. f.writelines(list) Write list of strings to file.
pass -- Null statement = -- Assignment operator. Can unpack tuples, lists, first, second = a[0:2]; [f, s] = range(2) del -- Unbind name from object, or attributes from objects, etc. print [<c1> [, <c2> ]* [,] -- Writes to sys.stdout. Puts spaces between arguments. Puts newline at end unless statement ends with comma. Print is not required when running interactively, simply typing an expression will print its value, unless the value is None. exec <x> [in <globals> [,<locals>]] -- Executes <x> in namespace provided. Defaults to current namespace. <x> can be a string, file object or a function object.
if <condition>: <suite> [elif <condition>: <suite>]* [else: suite] -- usual if/else_if/else statement while <condition>: <suite> [else: <suite>] -- usual while statement. "else" suite is executed after loop exits, unless the loop is exited with "break" for <target> in <condition-list>: <suite> [else: <suite>] -- iterates over sequence "<condition-list>", assigning each element to "<target>". "else" suite executed at end unless loop exited with "break" break -- immediately exit "for" or "while" loop continue -- immediately do next iteration of "for" or "while" loop return [<result>] -- return from function (or method) and return "<result>". If no result given, then returns None.
try: <suite1> [except [<exception> [, <value>]: <suite2>]+ [else: <suite3>] -- statements in <suite1> are executed. If an exception occurs, look in "except" clauses for matching <exception>. If matches or bare "except" execute suite of that clause. If no exception happens suite in "else" clause is executed after <suite1>. If <exception> has a value, it is put in <value>. <exception> can also be tuple of exceptions, e.g. "except (KeyError, NameError), val: print val" try: <suite1> finally: <suite2> -- statements in <suite1> are executed. If no exception, execute <suite2> (even if <suite1> is exited with a "return", "break" or "continue" statement). If exception did occur, executes <suite2> and them immediately reraises exception. raise <exception> [,<value>] -- raises <exception> with optional parameter <value>. An exception is simply a string (object). Create a new one simply by creating a new string: my_exception = 'You did something wrong' try: if bad: raise my_exception, bad except my_exception, value: print 'Oops', value
import <module_id1> [, <module_id2>]* -- imports modules. Members of module must be referred to by qualifying with module name: "import sys; print sys.argv: from <module_id> import <id1> [, <id2>]* -- imports names from module <module_id>. Names are not qualified: "from sys import argv; print argv" from <module_id> import * -- imports all names in module <module_id>, except those starting with "_": "from sys import *; print argv" global <id1> [,<id2>]* -- ids are from global scope (usually meaning from module) rather than local (usually meaning only in function). -- E,g, in fcn, no "global" statements, assume "a" is name that hasn't been used in fcn or module so far. Try to read from "a" -> NameError Try to write to "a" -> creates "a" local to fcn If "a" not defined in fcn, but is in module, then Try to read from "a", gets value from module Try to write to "a", creates "a" local to fcn But note "a[0]=3" starts with search for "a", will use to global "a" if no local "a".
def <func_id> ([<param_list>]): <suite> -- creates a function object and assigns it name <func_id>. <param_list> ==> [<id> [, <id>]*] [<id> = <v> [, <id> = <v>]*] [*<id>] E.g. def test (p1, p2 = 1+1, *rest): -- Parameters with "=" have default value (<v> is evaluated when function defined). If list ends with "*<id>" then <id> is assigned a tuple of all remaining args passed to function. (allows vararg functions).
class <class_id> [(<super_class1> [,<super_class2>]*)]: <suite> -- Creates a class object and assigns it name <class_id> <suite> may contain local "defs" of class methods and assignments to class attributes. E.g. class my_class (class1, class_list[3]): ... Creates a class object inheriting from both "class1" and whatever class object "class_list[3]" evaluates to. Assigns new class object to name "my_class". First arg to class methods is always instance object. By convention this is called "self". Special method "__init__()" called when instance created. Create instance by "calling" class object, possibly with args. In current implementation, can't subclass off built-in classes. But can "wrap" them, see __getattr__() below. E.g. class c (c_parent): def __init__(self, name): self.name = name def print_name(self): print "I'm", self.name def call_parent(self): c_parent.print_name(self) instance = c('tom') print instance.name 'tom' instance.print_name() "I'm tom" Call parent's super class by accessing parent's method directly and passing "self" explicitly (see "call_parent" in example above). Many other special methods available for implementing arithmetic operators, sequence, mapping indexing, etc.
lambda [<param_list>]: <condition> -- Create an anonymous function. <condition> must be an expression not a statement (e.g., not "if xx:...", "print xxx", etc.) and thus can't contain newlines. Used mostly for filter(), map(), reduce() functions.
abs(x) Return the absolute value of a number apply(f, args) Call func/method <f> with args <args> callable(x) Returns 1 if x callable, else 0. chr(i) Return one-character string whose ASCII code is integer i cmp(x,y) Return neg, zero, pos if x <, ==, > to y coerce(x,y) Return a tuple of the two numeric arguments converted to a common type. compile(string, filename, kind) Compile <string> into a code object. <filename> is used for error reporting, can be any string. <kind> is either 'eval' if <string> is a single stmt, else should be 'exec'. dir([object]) If no args, return the list of names in current local symbol table. With a module, class or class instance object as arg, return list of names in its attr dict. divmod(a,b) Returns tuple of (a/b, a%b) eval(s, globals, locals) Eval string <s> in (optional) <globals>, <locals>. <s> must have no NULL's or newlines. <s> can also be a code object. E.g.: x = 1; incr_x = eval('x + 1') filter(function, list) Construct a list from those elements of <list> for which <function> returns true. <function> takes one parameter. float(x) Convert a number to floating point. getattr(object, name) Get attr called <name> from <object>. getattr(x, 'foobar') <=> x.foobar hasattr(object, name) Returns true if <object> has attr called <name>. hash(object) Return the hash value of the object (if it has one) hex(x) Convert a number to a hexadecimal string. id(object) Return a unique 'identity' integer for an object. input([prompt]) Prints prompt, if given. Reads input and evaluates it. int(x) Convert a number to a plain integer. len(s) Return the length (the number of items) of an object. long(x) Convert a number to a long integer. map(function, list, ...) Apply <function> to every item of <list> and return a list of the results. If additional arguments are passed, <function> must take that many arguments and it is given to <function> on each call. max(s) Return the largest item of a non-empty sequence. min(s) Return the smallest item of a non-empty sequence. oct(x) Convert a number to an octal string. open(filename [, mode='r', [bufsize=<implementation dependent>]]) Return a new file object. First two args are same as those for C's "stdio open" function. <bufsize> is 0 for unbuffered, 1 for line-buffered, negative for sys-default, all else, of (about) given size. ord(c) Return integer ASCII value of <c> (str of len 1). pow(x, y [, z]) Return x to power y [modulo z] range(start [,end [, step]]) return list of ints from >= start and < end. With 1 arg, list from 0..<arg>-1 With 2 args, list from <start>..<end>-1 With 3 args, list from <start> up to <end> by <step> raw_input([prompt]) Print prompt if given, then read string from std input. reduce(f, list [, init]) Apply the binary function <f> to the items of <list> so as to reduce the list to a single value. If <init> given, it is "prepended" to <list>. reload(module) Re-parse and re-initialize an already imported module. Useful in interactive mode, if you want to reload a module after fixing it. If module was synactically correct but had an error in initialization, must import it one more time before calling reload(). repr(object) Return a string containing a printable representation of an object. Equivalent to `object` (using backquotes). round(x,n=0) Return the floating point value x rounded to n digits after the decimal point. setattr(object, name, value) This is the counterpart of getattr(). setattr(o, 'foobar', 3) <=> o.foobar = 3 str(object) Return a string containing a nicely printable representation of an object. tuple(list) Creates a tuple with same elements as <list> type(object) Return type of an object. E.g., if type(x) == type(''): print 'It is a string' vars([object]) Without arguments, return a dictionary corresponding to the current local symbol table. With a module, class or class instance object as argument returns a dictionary corresponding to the object's symbol table. Useful with "%" formatting operator. xrange(start [, end [, step]]) Like range(), but doesn't actually store entire list all at once. Good to use in "for" loops when there is a big range and little memory.
E.g. class x: def __init__(self, v): self.value = v def __add__(self, r): return self.value + r a = x(3) # sort of like calling x.__init__(a, 3) a + 4 # is equivalent to a.__add__(4)
__init__(s, args) object instantiation __del__(s) called on object demise (refcount becomes 0) __repr__(s) repr() and `...` conversions __str__(s) str() and 'print' statement __cmp__(s, o) implements <, ==, >, <=, <>, !=, >=, is [not] __hash__(s) hash() and dictionary operations __getattr__(s, name) called when attr lookup doesn't find <name> __setattr__(s, name, val) called when setting an attr (inside, don't use "self.name = value" use "self.__dict__[name] = val") __delattr__(s, name) called to delete attr <name> __call__(self, *args) called when an instance is called as function.
s+o = __add__(s,o) s-o = __sub__(s,o) s*o = __mul__(s,o) s/o = __div__(s,o) s%o = __mod__(s,o) divmod(s,o) = __divmod__(s,o) pow(s,o) = __pow__(s,o) s&o = __and__(s,o) s^o = __xor__(s,o) s|o = __or__(s,o) s<<o = __lshift__(s,o) s>>o = __rshift__(s,o) nonzero(s) = __nonzero__(s) (used in boolean testing) -s = __neg__(s) +s = __pos__(s) abs(s) = __abs__(s) ~s = __invert__(s) (bitwise) int(s) = __int__(s) long(s) = __long__(s) float(s) = __float__(s) oct(s) = __oct__(s) hex(s) = __hex__(s) coerce(s,o) = __coerce__(s,o) Right-hand-side equivalents for all binary operators exist; are called when class instance is on r-h-s of operator: a + 3 calls __add__(a, 3) 3 + a calls __radd__(a, 3)
len(s) = __len__(s) length of object, >= 0. Length 0 == false s[i] = __getitem__(s,i) Element at index/key i, origin 0
s[i]=v = __setitem__(s,i,v) del s[i] = __delitem__(s,i) s[i:j] = __getslice__(s,i,j) s[i:j]=seq = __setslice__(s,i,j,seq) del s[i:j] = __delslice__(s,i,j) == s[i:j] = []
hash(s) = __hash__(s) - hash value for dictionary references s[k]=v = __setitem__(s,k,v) del s[k] = __delitem__(s,k)
X.__dict__ dict used to store object's writeable attributes I.__methods__ list of I's methods; on many built-in types. I.__members__ lists of I's data attributes; on many built-in types I.__class__ class to which instance I belongs C.__bases__ r/o attr, tuple of class C's base classes M.__name__ r/o attr, module M's name as string
sys
Variables:
argv -- The list of command line arguments passed to a Python script. sys.argv[0] is the script name. builtin_module_names -- A list of strings giving the names of all modules written in C that are linked into this interpreter. check_interval -- How often to check for thread switches or signals (measured in number of virtual machine instructions) exc_type exc_value exc_traceback -- Set when in an exception handler. Are last exception, last exception value, and traceback object of call stack when exception occured. exitfunc -- User can set to a parameterless fcn. It will get called before interpreter exits. last_type last_value last_traceback -- Set only when an exception not handled and interpreter prints an error. Used by debuggers. modules -- List of modules that have already been loaded. path -- Search path for external modules. Can be modified by program. ps1 ps2 -- prompts to use in interactive mode. stdin stdout stderr -- File objects used for I/O. User can redirect by assigning a new file object to them (or any object with a method "write()" taking string argument). tracebacklimit -- Maximum levels of tb info printed on error.
Functions:
exit(n) -- Exit with status <n>. Raises SystemExit exception. (Hence can be caught and ignored by program) settrace(func) -- Sets a trace function: called before each line of code is exited. setprofile(func) -- Sets a profile function for performance profiling.
os
"synonym" for whatever O/S-specific module is proper for current environment. Uses "posix" whenever possible.Variables
name -- name of O/S-specific module (e.g. "posix" or "mac") path -- O/S-specific module for path manipulations. on Unix, os.path.split() <=> posixpath.split() curdir -- string used to represent current directory ('.') pardir -- string used to represent parent directory ('..') sep -- string used to separate directories ('/')
posix
Variables:
environ -- dictionary of environment variables, e.g. posix.environ['HOME'] error -- exception raised on POSIX-related error. Corresponding value is tuple of errno code and perror() string.
Some Functions (see doc for more):
chdir(path) -- Go to <path>. close(fd) -- Close file descriptor <fd>. _exit(n) -- Immediate exit, with no cleanups, no SystemExit, etc. Should use this to exit a child process. execv(p, args) -- "Become" executable <p> with args <args> fork() -- Like C's fork(). Returns 0 to child, child pid to parent. kill(pid, signal) -- Like C's kill listdir(path) -- List names of entries in directory <path>. open(file, flags, mode) -- Like C's open(). Returns file descriptor. pipe() -- Creates pipe. Returns pair of file descriptors (r, w). popen(command, mode) -- Open a pipe to or from <command>. Result is a file object to read to or write from, as indicated by <mode> being 'r' or 'w'. read(fd, n) -- Read <n> bytes from <fd> and return as string. stat(path) -- Returns st_mode, st_ino, st_dev, st_nlink, st_uid, st_gid, st_size, st_atime, st_mtime, st_ctime. system(command) -- Execute string <command> in a subhell. Returns exit status of subshell. unlink(path) -- Unlink ("delete") path/file. wait() -- Wait for child process completion. Returns tuple of pid, exit_status waitpid(pid, options) -- Wait for process pid to complete. Returns tuple of pid, exit_status write(fd, str) -- Write <str> to <fd>. Returns num bytes written.
posixpath
Some Functions (see doc for more):
exists(p) -- True if string <p> is an existing path expanduser(p) -- Returns string that is <p> with "~" expansion done. isabs(p) -- True if string <p> is an absolute path. isfile(p) -- True if string <p> is a regular file. isdir(p) -- True if string <p> is a directory. islink(p) -- True if string <p> is a symbolic link. isfile(p) -- True if string <p> is a regular file. ismount(p) -- True if string <p> is a mount point. split(p) -- Splits into (head, tail) where <tail> is last pathname component and <head> is everything leading up to that. splitext(p) -- Splits into (root, ext) where last comp of <root> contains no periods and <ext> is empty or starts with a period. walk(p, visit, arg) -- Calls the function <visit> with arguments (<arg>, <dirname>, <names>) for each directory in the directory tree rooted at <p> The argument <dirname> specifies the visited directory, the argument <names> lists the files in the directory. The <visit> function may modify <names> to influence the set of directories visited below <dirname>, e.g., to avoid visiting certain parts of the tree.
math
Variables:
pi eFunctions (see ordinary C man pages for info):
acos(x) asin(x) atan(x) atan2(x, y) ceil(x) cos(x) cosh(x) exp(x) fabs(x) floor(x) fmod(x, y) frexp(x) -- Unlike C: (float, int) = frexp(float) ldexp(x, y) log(x) log10(x) modf(x) -- Unlike C: (float, float) = modf(float) pow(x, y) sin(x) sinh(x) sqrt(x) tan(x) tanh(x)
getopt
Functions:
getopt(list, optstr) -- Similar to C. <optstr> is option letters to look for. Put ':' after letter if option takes arg. E.g. # invocation was "python test.py -c hi -a arg1 arg2" opts, args = getopt.getopt(sys.argv[1:], 'ab:c:') # opts would be [('-c', 'hi'), ('-a', '')] # args would be ['arg1', 'arg2']
string
Some Variables:
digits -- The string '0123456789' uppercase lowercase whitespace -- Strings containing the appropriate characters index_error -- Exception raised by index() if substr not found.Some Functions:
index(s, sub, i=0) -- Return the lowest index in <s> not smaller than <i> where the substring <sub> is found. lower(s) -- Return a string that is <s> in lowercase splitfields(s, sep) -- Returns a list containing the fields of the string <s>, using the string <sep> as a separator. joinfields(words, sep) -- Concatenate a list or tuple of words with intervening separators. strip(s) -- Return a string that is <s> without leading and trailing whitespace. upper(s) -- Return a string that is <s> in uppercase
regex
Patterns are specified as strings. Default syntax is emacs-style.Special Characters (using default syntax):
. matches any character * 0 or more of preceeding regular expresssion + 1 or more of preceeding regular expresssion ? 0 or 1 of preceeding regular expresssion [ ] defines character set: '[a-zA-Z]' to match all letters [^ ] defines complemented character set: matches if char is NOT in set ^ matches empty str at beginning of line $ matches empty str at end of line \ quoting char: \[ matches char '[' \\ matches '\'; due to Python string rules, write as '\\\\' in the pattern string. \| specifies alternative: 'foo\|bar' matches 'foo' or 'bar' \( \) grouping (for \|, or complicated expr, or substr for future reference by \D character or group() method) \D D is digit: matches substr matched by D'th \( \) in pattern \` empty str at beginning of file \' empty str at end of file \b empty str at beg or end of word: '\bis\b' matches 'is', but not 'his' \B empty str NOT at beginning or end of word \< empty str at beginning of word \> empty str at end of word \w any word constituent \W any non-word constituentVariables:
error -- Exception when pattern string isn't valid regexp.Functions:
match(pattern, string) -- Return how many characters at the beginning of <string> match regexp <pattern>. -1 if none. search(pattern, string [, pos]) -- Return the first position in <string> that matches regexp <pattern>. Return -1 if none. [starting at <pos>.] compile(pattern [,translate]) -- Create regexp object that has methods match() and search() working as above. Also group(i1, [,i2]*) E.g. p = compile('id\([a-z]\)\([a-z]\)') p.match('idab') ==> 4 p.group(1, 2) ==> ('a', 'b') set_sytax(flag) -- Set syntax flags for future calls to match(), search() and compile(). Returns current value. Flags in module regex_syntax. symcomp(pattern [,translate]) -- Like compile but with symbolic group names. Names in angle brackets. Access through group method. E.g. p = symcomp('id\(<l1>[a-z]\)\(<l2>[a-z]\)') p.match('idab') ==> 4 p.group('l1') ==> 'a'
regex_syntax
Flags for regex.set_syntax(). BitOr the flags you want together and pass to function.Variables:
RE_NO_BK_PARENS -- if set, ( means grouping, \( is literal "(" if not, vice versa RE_NO_BK_VBAR -- if set, | means or, \| is literal "|" if not, vice versa RE_BK_PLUS_QM -- if set, + or ? are operator, \+, \? are literal if not, vice versa RE_TIGHT_VBAR -- if set, | binds tighter than ^ or $ if not, vice versa RE_NEWLINE_OR -- if set, \n is an OR operator if not, it is a normal char RE_CONTEXT_INDEP_OPS -- if not set, special chars always have special meaning if set, depends on context: # ^ - only special at the beginning, or after ( or | # $ - only special at the end, or before ) or | # *, +, ? - only special when not after the beginning, (, or | RE_SYNTAX_AWK = (RE_NO_BK_PARENS | RE_NO_BK_VBAR | RE_CONTEXT_INDEP_OPS) RE_SYNTAX_EGREP = (RE_SYNTAX_AWK | RE_NEWLINE_OR) RE_SYNTAX_GREP = (RE_BK_PLUS_QM | RE_NEWLINE_OR) RE_SYNTAX_EMACS = 0
reg_sub
Functions:
sub(pattern, rep, str) -- Replace 1st occur of <pattern> in <str> by <rep> and return this. gsub(pattern, rep, str) -- Replace _all_ occurances of <pattern> in <str> by <rep> and return this. split(str, pattern) -- Split <str> into fields seperated by delimiters matching <pattern> and return as list of strings.
Other Modules In Base Distribution
* Built-ins *
sys Interpreter state vars and functions __built-in__ Access to all built-in python identifiers __main__ Scope of the interpreters main program, script or stdin array Obj efficiently representing arrays of basic values math Math functions of C standard time Time-related functions regex Regular expression matching operations marshal Read and write some python values in binary format struct Convert between python values and C structs* Standard *
getopt Parse cmd line args in sys.argv. A la UNIX 'getopt'. os A more portable interface to OS dependent functionality rand Pseudo-random generator, like C rand() regsub Functions useful for working with regular expressions string Useful string and characters functions and exceptions whrandom Wichmann-Hill pseudo-random number generator* Unix *
dbm Interface to Unix ndbm database library grp Interface to Unix group database posix OS functionality standardized by C and POSIX standards posixpath POSIX pathname functions pwd Access to the Unix password database select Access to Unix select multiplex file synchronization socket Access to BSD socket interface thread Low-level primitives for working with process threads* Multimedia *
audioop Useful operations on sound fragments imageop Useful operations on images jpeg Access to jpeg image compressor and decompressor rgbimg Access SGI imglib image files* Cryptographic Extensions *
md5 Interface to RSA's MD5 message digest algorithm mpz Interface to int part of GNU multiple precision library rotor Implementation of a rotor-based encryption algorithm* Stdwin * Standard Window System
stdwin Standard Window System interface stdwinevents Stdwin event, command, and selection constants rect Rectangle manipulation operations* SGI IRIX * (4 & 5)
al SGI audio facilities AL al constants fl Interface to FORMS library FL fl constants flp Functions for form designer fm Access to font manager library gl Access to graphics library GL Constants for gl DEVICE More constants for gl imgfile Imglib image file interface* Suns *
sunaudiodev Access to sun audio interfaceContents of Lib directory
Queue -- A multi-producer, multi-consumer queue. UserDict -- A wrapper to allow subclassing of built-in dict class UserList -- A wrapper to allow subclassing of built-in list class StringIO -- File-like objects that read/write a string buffer addpack -- Support for "packages", groups of related modules aifc -- Stuff to parse AIFF-C and AIFF files. bdb -- A generic Python debugger base class. bisect -- Bisection algorithms calendar -- Calendar printing functions cmd -- A generic class to build line-oriented command interpreters cmp -- Efficiently compare files, boolean outcome only cmpcache -- Same, but caches 'stat' results for speed codehack -- Extract a function name from a code object compileall -- Force "compilation" of all .py files in a directory colorsys -- Conversion functions between RGB and other color systems. commands -- Tools for executing UNIX commands dircache -- Sorted list of files in a dir, using a cache dircmp -- Defines a class to build directory diff tools on. dis -- Bytecode disassembler dospath -- common operations on DOS pathnames dump -- Print python code that reconstructs a variable emacs -- Execute Emacs code from a Python interpreter. fnmatch -- Filename matching with shell patterns fpformat -- General floating point formatting functions. ftplib -- An FTP client class. Based on RFC 959 getopt -- Standard command line processing. glob -- filename globbing. gopherlib -- Gopher protocol client interface grep -- 'grep' utilities httplib -- HTTP client class imghdr -- Recognizing image files based on their first few bytes. importall -- Utility module to import all modules in the path linecache -- Cache lines from files. lockfile -- Implements file locking maccache -- Mac cache listdir(), isdir(), isfile() or exists( macpath -- Pathname (or related) operations for the Macintosh mailbox -- A class to handle a unix-style or mmdf-style mailbox mhlib -- MH (mailbox) interface mimetools -- Various tools used by MIME-reading or MIME-writing programs. multifile -- Class to make multi-file messages easier to handle mutex -- Mutual exclusion -- for use with module sched newdir -- New dir() function nntplib -- An NNTP client class. Based on RFC 977 ntpath -- Common operations on DOS pathnames os -- Either mac, dos or posix depending system ospath -- ospath.py is obsolete packmail -- Create a self-unpacking shell archive. pdb -- A Python debugger persist -- Implement limited persistence. pipes -- Conversion pipeline templates poly -- Polynomials posixfile -- Extended (posix) file operations posixpath -- Common operations on POSIX pathnames profile -- Class for profiling python code. pstats -- Class for printing reports on profiled python code pty -- Pseudo terminal utilities py_compile -- Routine to "compile" a .py file to a .pyc file rand -- Don't use unless you want compatibility with C's rand() random -- Random variable generators regex_syntax -- Flags for regex.set_syntax(). regexp -- Backward compatibility for module "regexp" using "regex". regsub -- Regular expression subroutines repr -- Redo repr() but with limits on most sizes. rfc822 -- RFC-822 message manipulation class. sched -- A generally useful event scheduler class shutil -- Utility functions usable in a shell-like program sndhdr -- Several routines that help recognizing sound stat -- Constants and functions for interpreting stat/lstat struct statcache -- Maintain a cache of file stats. string -- A collection of string operations sunau -- Stuff to parse Sun and NeXT audio files. sunaudio -- Interpret sun audio headers symbol -- Non-terminal symbols of Python grammar (from "graminit.h") tb -- Print tracebacks, with a dump of local variables. tempfile -- Temporary file name allocation toaiff -- Convert "arbitrary" sound files to AIFF files token -- Tokens (from "token.h") tokenize -- Compiles a regular expression that recognizes Python tokens. traceback -- Format and print Python stack traces tty -- Terminal utilities types -- Define names for all type symbols in the std interpreter tzparse -- Parse a timezone specification. urllib -- Open an arbitrary URL util -- (Obsolete) some useful functions that don't fit elsewhere uu -- UUencode/UUdecode wave -- Stuff to parse WAVE files. whatsound -- Several routines that help recognizing sound files whrandom -- Wichmann-Hill random number generator zmod -- Demonstration of abstruse mathematical concepts
Workspace exploration and idiom hints
dir(<module>) list functions, variables in <module> dir() get object keys, defaults to local name space X.__methods__ list of methods supported by X (if any) X.__members__ List of X's data attributes if __name__ == '__main__': main() invoke main if running as script map(None, lst1, lst2, ...) merge lists b = a[:] create copy of seq structure _ in interactive mode, is last value printed
Python Mode for Emacs
Type C-c ? when in python-mode for extensive help. INDENTATION Primarily for entering new code: TAB indent line appropriately LFD insert newline, then indent DEL reduce indentation, or delete single character Primarily for reindenting existing code: C-c : guess py-indent-offset from file content; change locally C-u C-c : ditto, but change globally C-c TAB reindent region to match its context C-c < shift region left by py-indent-offset C-c > shift region right by py-indent-offset MARKING & MANIPULATING REGIONS OF CODE C-c C-b mark block of lines M-C-h mark smallest enclosing def C-u M-C-h mark smallest enclosing class C-c # comment out region of code C-u C-c # uncomment region of code MOVING POINT C-c C-p move to statement preceding point C-c C-n move to statement following point C-c C-u move up to start of current block M-C-a move to start of def C-u M-C-a move to start of class M-C-e move to end of def C-u M-C-e move to end of class EXECUTING PYTHON CODE C-c C-c sends the entire buffer to the Python interpreter C-c | sends the current region C-c ! starts a Python interpreter window; this will be used by subsequent C-c C-c or C-c | commands VARIABLES py-indent-offset indentation increment py-block-comment-prefix comment string used by py-comment-region py-python-command shell command to invoke Python interpreter py-scroll-process-buffer t means always scroll Python process buffer py-temp-directory directory used for temp files (if needed) py-beep-if-tab-change ring the bell if tab-width is changed
The Python Debugger
Accessing
import pdb (it's a module written in Python) -- defines functions run(string) -- interpret string in the debugger runctx(string, globals, locals) -- interpret string using globals and locals for namespace runcall(fun, arg1, arg2, ...) -- run function object <fun> with args pm() -- run postmortem on last exception (like debugging a core file) post_mortem(t) -- run postmortem on traceback object <t> -- defines class "Pdb" use Pdb to create reusable debugger objects. Object preserves state (i.e. break points) between calls. Pdb defines methods run(string) -- interpret string in the debugger runctx(string, globals, locals) -- interpret string using globals and locals for namespace runcall(fun, arg1, arg2, ...) -- run function object with args runs until a breakpoint hit, exception, or end of program If exception, variable '__exception__' holds (exception,value).Commands
h, help brief reminder of commands b, break [<arg>] if <arg> numeric, break at line <arg> in current file if <arg> is function object, break on entry to fcn <arg> if no arg, list breakpoints cl, clear [<arg>] if <arg> numeric, clear breakpoint at <arg> in current file if no arg, clear all breakpoints after confirmation w, where print current call stack u, up move up one stack frame (to top-level caller) d, down move down one stack frame s, step advance one line in the program, stepping into calls n, next advance one line, stepping over calls r, return continue execution until current function returns (return value is saved in variable "__return__", which can be printed or manipulated from debugger) c, continue continue until next breakpoint a, args print args to current function rv, retval prints return value from last function that returned p, print <arg> prints value of <arg> in current stack frame l, list [<first> [, <last>]] List source code for the current file. Without arguments, list 11 lines around the current line or continue the previous listing. With one argument, list 11 lines starting at that line. With two arguments, list the given range; if the second argument is less than the first, it is a count. whatis <arg> prints type of <arg> ! executes rest of line as a Python statement in the current stack frame q quit immediately stop execution and leave debugger <return> executes last command again Any input debugger doesn't recognize as a command is assumed to be a Python statement to execute in the current stack frame, same as the exclamation mark ("!") command does.Example
(1394) python Python 1.0.3 (Sep 26 1994) Copyright 1991-1994 Stichting Mathematisch Centrum, Amsterdam >>> import rm >>> rm.run() Traceback (innermost last): File "<stdin>", line 1 File "./rm.py", line 7 x = div(3) File "./rm.py", line 2 return a / r ZeroDivisionError: integer division or modulo >>> import pdb >>> pdb.pm() > ./rm.py(2)div: return a / r (Pdb) list 1 def div(a): 2 -> return a / r 3 4 def run(): 5 global r 6 r = 0 7 x = div(3) 8 print x [EOF] (Pdb) print r 0 (Pdb) q >>> pdb.runcall(rm.run) etc.Quirks
Breakpoints are stored as filename, line number tuples. If a module is reloaded after editing, any remembered breakpoints are likely to be wrong.
Always single-steps through top-most stack frame. That is, "c" acts like "n".