code view::python/PyMOTW/multiprocessing/basics.rst

[top] / python / PyMOTW / multiprocessing / basics.rst
     ######################
     multiprocessing Basics
     ######################
     
     The simplest way to spawn a second is to instantiate a
     :class:`Process` object with a target function and call :func:`start`
     to let it begin working.
     
     .. include:: multiprocessing_simple.py
         :literal:
         :start-after: #end_pymotw_header
     
     The output includes the word "Worker" printed five times, although it
     may not be entirely clean depending on the order of execution.
     
     .. {{{cog
     .. cog.out(run_script(cog.inFile, 'multiprocessing_simple.py'))
     .. }}}
     
     ::
     
     	$ python multiprocessing_simple.py
     
     	Worker
     	Worker
     	Worker
     	Worker
     	Worker
     
     .. {{{end}}}
     
     
     It usually more useful to be able to spawn a process with arguments to
     tell it what work to do.  Unlike with :mod:`threading`, to pass
     arguments to a :mod:`multiprocessing` :class:`Process` the argument
     must be able to be serialized using :mod:`pickle`.  This example
     passes each worker a number so the output is a little more interesting.
     
     .. include:: multiprocessing_simpleargs.py
         :literal:
         :start-after: #end_pymotw_header
     
     The integer argument is now included in the message printed by each worker:
     
     .. {{{cog
     .. cog.out(run_script(cog.inFile, 'multiprocessing_simpleargs.py'))
     .. }}}
     
     ::
     
     	$ python multiprocessing_simpleargs.py
     
     	Worker: 0
     	Worker: 1
     	Worker: 2
     	Worker: 4
     	Worker: 3
     
     .. {{{end}}}
     
     Importable Target Functions
     ===========================
     
     One difference between the :mod:`threading` and :mod:`multiprocessing`
     examples is the extra protection for ``__main__`` used in the
     :mod:`multiprocessing` examples.  Due to the way the new processes are
     started, the child process needs to be able to import the script
     containing the target function.  Wrapping the main part of the
     application in a check for ``__main__`` ensures that it is not run
     recursively in each child as the module is imported.  Another approach
     is to import the target function from a separate script.
     
     For example, this main program:
     
     .. include:: multiprocessing_import_main.py
         :literal:
         :start-after: #end_pymotw_header
     
     uses this worker function, defined in a separate module:
     
     .. include:: multiprocessing_import_worker.py
         :literal:
         :start-after: #end_pymotw_header
     
     and produces output like the first example above:
     
     .. {{{cog
     .. cog.out(run_script(cog.inFile, 'multiprocessing_import_main.py'))
     .. }}}
     
     ::
     
     	$ python multiprocessing_import_main.py
     
     	Worker
     	Worker
     	Worker
     	Worker
     	Worker
     
     .. {{{end}}}
     
     Determining the Current Process
     ===============================
     
     Passing arguments to identify or name the process is cumbersome, and
     unnecessary.  Each :class:`Process` instance has a name with a default
     value that can be changed as the process is created. Naming processes
     is useful for keeping track of them, especially in applications with
     multiple types of processes running simultaneously.
     
     .. include:: multiprocessing_names.py
         :literal:
         :start-after: #end_pymotw_header
     
     The debug output includes the name of the current process on each
     line. The lines with ``Process-3`` in the name column correspond to
     the unnamed process ``worker_1``.
     
     .. {{{cog
     .. cog.out(run_script(cog.inFile, 'multiprocessing_names.py'))
     .. }}}
     
     ::
     
     	$ python multiprocessing_names.py
     
     	worker 1 Starting
     	worker 1 Exiting
     	Process-3 Starting
     	Process-3 Exiting
     	my_service Starting
     	my_service Exiting
     
     .. {{{end}}}
     
     
     Daemon Processes
     ================
     
     By default the main program will not exit until all of the children
     have exited. There are times when starting a background process that
     runs without blocking the main program from exiting is useful, such as
     in services where there may not be an easy way to interrupt the
     worker, or where letting it die in the middle of its work does not
     lose or corrupt data (for example, a task that generates "heart beats"
     for a service monitoring tool). 
     
     To mark a process as a daemon, set its :attr:`daemon` attribute with a
     boolean value. The default is for processes to not be daemons, so
     passing True turns the daemon mode on.
     
     .. include:: multiprocessing_daemon.py
         :literal:
         :start-after: #end_pymotw_header
     
     The output does not include the "Exiting" message from the daemon
     process, since all of the non-daemon processes (including the main
     program) exit before the daemon process wakes up from its 2 second
     sleep.
     
     .. {{{cog
     .. cog.out(run_script(cog.inFile, 'multiprocessing_daemon.py'))
     .. }}}
     
     ::
     
     	$ python multiprocessing_daemon.py
     
     	Starting: daemon 13952
     	Starting: non-daemon 13953
     	Exiting : non-daemon 13953
     
     .. {{{end}}}
     
     The daemon process is terminated automatically before the main program
     exits, to avoid leaving orphaned processes running.  You can verify
     this by looking for the process id value printed when you run the
     program, and then checking for that process with a command like
     ``ps``.
     
     Waiting for Processes
     =====================
     
     To wait until a process has completed its work and exited, use the
     :func:`join` method.
     
     .. include:: multiprocessing_daemon_join.py
         :literal:
         :start-after: #end_pymotw_header
     
     Since the main process waits for the daemon to exit using
     :func:`join`, the "Exiting" message is printed this time.
     
     .. {{{cog
     .. cog.out(run_script(cog.inFile, 'multiprocessing_daemon_join.py'))
     .. }}}
     
     ::
     
     	$ python multiprocessing_daemon_join.py
     
     	Starting: non-daemon
     	Exiting : non-daemon
     	Starting: daemon
     	Exiting : daemon
     
     .. {{{end}}}
     
     By default, :func:`join` blocks indefinitely. It is also possible to
     pass a timeout argument (a float representing the number of seconds to
     wait for the process to become inactive). If the process does not
     complete within the timeout period, :func:`join` returns anyway.
     
     .. include:: multiprocessing_daemon_join_timeout.py
         :literal:
         :start-after: #end_pymotw_header
     
     Since the timeout passed is less than the amount of time the daemon
     sleeps, the process is still "alive" after :func:`join` returns.
     
     .. {{{cog
     .. cog.out(run_script(cog.inFile, 'multiprocessing_daemon_join_timeout.py'))
     .. }}}
     
     ::
     
     	$ python multiprocessing_daemon_join_timeout.py
     
     	Starting: non-daemon
     	Exiting : non-daemon
     	d.is_alive() True
     
     .. {{{end}}}
     
     Terminating Processes
     =====================
     
     Although it is better to use the *poison pill* method of signaling to
     a process that it should exit (see :ref:`multiprocessing-queues`), if
     a process appears hung or deadlocked it can be useful to be able to
     kill it forcibly.  Calling :func:`terminate` on a process object kills
     the child process.
     
     .. include:: multiprocessing_terminate.py
         :literal:
         :start-after: #end_pymotw_header
     
     .. note::
     
         It is important to :func:`join` the process after terminating it
         in order to give the background machinery time to update the
         status of the object to reflect the termination.
     
     .. {{{cog
     .. cog.out(run_script(cog.inFile, 'multiprocessing_terminate.py'))
     .. }}}
     
     ::
     
     	$ python multiprocessing_terminate.py
     
     	BEFORE: <Process(Process-1, initial)> False
     	DURING: <Process(Process-1, started)> True
     	TERMINATED: <Process(Process-1, stopped[SIGTERM])> False
     	JOINED: <Process(Process-1, stopped[SIGTERM])> False
     
     .. {{{end}}}
     
     
     Process Exit Status
     ===================
     
     The status code produced when the process exits can be accessed via
     the :attr:`exitcode` attribute.
     
     For :attr:`exitcode` values
     
     * ``== 0`` -- no error was produced
     * ``> 0`` -- the process had an error, and exited with that code
     * ``< 0`` -- the process was killed with a signal of ``-1 * exitcode``
     
     .. include:: multiprocessing_exitcode.py
         :literal:
         :start-after: #end_pymotw_header
     
     Processes that raise an exception automatically get an
     :attr:`exitcode` of 1.
     
     .. {{{cog
     .. cog.out(run_script(cog.inFile, 'multiprocessing_exitcode.py', break_lines_at=68))
     .. }}}
     
     ::
     
     	$ python multiprocessing_exitcode.py
     
     	Starting process for exit_error
     	Starting process for exit_ok
     	Starting process for return_value
     	Starting process for raises
     	Starting process for terminated
     	Process raises:
     	Traceback (most recent call last):
     	  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python
     	2.7/multiprocessing/process.py", line 232, in _bootstrap
     	    self.run()
     	  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python
     	2.7/multiprocessing/process.py", line 88, in run
     	    self._target(*self._args, **self._kwargs)
     	  File "multiprocessing_exitcode.py", line 24, in raises
     	    raise RuntimeError('There was an error!')
     	RuntimeError: There was an error!
     	exit_error.exitcode = 1
     	exit_ok.exitcode = 0
     	return_value.exitcode = 0
     	raises.exitcode = 1
     	terminated.exitcode = -15
     
     .. {{{end}}}
     
     
     Logging
     =======
     
     When debugging concurrency issues, it can be useful to have access to
     the internals of the objects provided by :mod:`multiprocessing`.
     There is a convenient module-level function to enable logging called
     :func:`log_to_stderr`.  It sets up a logger object using
     :mod:`logging` and adds a handler so that log messages are sent to the
     standard error channel.
     
     .. include:: multiprocessing_log_to_stderr.py
         :literal:
         :start-after: #end_pymotw_header
     
     By default the logging level is set to ``NOTSET`` so no messages are
     produced.  Pass a different level to initialize the logger to the
     level of detail you want.
     
     .. {{{cog
     .. cog.out(run_script(cog.inFile, 'multiprocessing_log_to_stderr.py'))
     .. }}}
     
     ::
     
     	$ python multiprocessing_log_to_stderr.py
     
     	[INFO/Process-1] child process calling self.run()
     	Doing some work
     	[INFO/Process-1] process shutting down
     	[DEBUG/Process-1] running all "atexit" finalizers with priority >= 0
     	[DEBUG/Process-1] running the remaining "atexit" finalizers
     	[INFO/Process-1] process exiting with exitcode 0
     	[INFO/MainProcess] process shutting down
     	[DEBUG/MainProcess] running all "atexit" finalizers with priority >= 0
     	[DEBUG/MainProcess] running the remaining "atexit" finalizers
     
     .. {{{end}}}
     
     To manipulate the logger directly (change its level setting or add
     handlers), use :func:`get_logger`.
     
     .. include:: multiprocessing_get_logger.py
         :literal:
         :start-after: #end_pymotw_header
     
     The logger can also be configured through the :mod:`logging`
     configuration file API, using the name ``multiprocessing``.
     
     .. {{{cog
     .. cog.out(run_script(cog.inFile, 'multiprocessing_get_logger.py'))
     .. }}}
     
     ::
     
     	$ python multiprocessing_get_logger.py
     
     	[INFO/Process-1] child process calling self.run()
     	Doing some work
     	[INFO/Process-1] process shutting down
     	[INFO/Process-1] process exiting with exitcode 0
     	[INFO/MainProcess] process shutting down
     
     .. {{{end}}}
     
     
     Subclassing Process
     ===================
     
     Although the simplest way to start a job in a separate process is to
     use :class:`Process` and pass a target function, it is also possible
     to use a custom subclass.  
     
     .. include:: multiprocessing_subclass.py
         :literal:
         :start-after: #end_pymotw_header
     
     The derived class should override :meth:`run` to do its work.
     
     .. {{{cog
     .. cog.out(run_script(cog.inFile, 'multiprocessing_subclass.py'))
     .. }}}
     
     ::
     
     	$ python multiprocessing_subclass.py
     
     	In Worker-1
     	In Worker-2
     	In Worker-3
     	In Worker-5
     	In Worker-4
     
     .. {{{end}}}
[top] / python / PyMOTW / multiprocessing / basics.rst
contact | logmethods.com
[code.view]