Python Virtual Environments

A Python virtual environment allows you to create an isolated Python installation with its own set of libraries and packages. This can be useful when you are running multiple tools or have a complex set of packages.

When to Use Virtual Environments

One of the reasons SciComp provides environment modules is to give you an easy way to access software that can frequently be difficult to install- thus, our default answer is to use the fhPython series of environment modules whenever possible. However, we understand that there are many libraries that are not part of our modules. In those cases virtual environments are preferable to installing libraries in your home directory.

Using the stand-alone, upstream Miniforge installer provides you with the greatest control, but can require significant knowledge and maintenance to get to work. Our ability to support these installations is hampered by the fact that these installations can be affected by other elements of your Linux environment.

Python venv

The venv module supports creating lightweight “virtual environments”, each with their own independent set of Python packages installed in their site directories. A virtual environment is created on top of an existing Python installation, known as the virtual environment’s “base” Python, and may optionally be isolated from the packages in the base environment, so only those explicitly installed in the virtual environment are available.

The Python Docs

venv is a python module that will create a virtual environment at a path on the file system. This path will contain any libraries and executables and can be activated by sourcing (the . operator in bash) an activation script.

As with most operations using Python in our environment you need to first load a Python environment module. The fhPython environment modules have many Python libraries already installed- this can present problems if you need to install a different version of a module that’s already a part of the fhPython environment module. Thus we suggest you use the more bare-bones Python environment modules:

rhino03[~/Work]$ ml Python/3.10.8-GCCcore-12.2.0
rhino03[~/Work]$ which python
/app/software/Python/3.10.8-GCCcore-12.2.0/bin/python
rhino03[~/Work]$ python --version
Python 3.10.8
rhino03[~/Work]$ python -m venv testenv

Once created, make sure you have the correct Python environment module loaded and then activate the virtual environment:

rhino03[~/Work]$ ml

Currently Loaded Modules:
  1) GCCcore/12.2.0                   8) SQLite/3.39.4-GCCcore-12.2.0
  2) zlib/1.2.12-GCCcore-12.2.0       9) XZ/5.2.7-GCCcore-12.2.0
  3) binutils/2.39-GCCcore-12.2.0    10) GMP/6.2.1-GCCcore-12.2.0
  4) bzip2/1.0.8-GCCcore-12.2.0      11) libffi/3.4.4-GCCcore-12.2.0
  5) ncurses/6.3-GCCcore-12.2.0      12) OpenSSL/1.1
  6) libreadline/8.2-GCCcore-12.2.0  13) Python/3.10.8-GCCcore-12.2.0  <-- must match python used during create
  7) Tcl/8.6.12-GCCcore-12.2.0

rhino03[~/Work]$ . ./testenv/bin/activate
(testenv) rhino03[~/Work]$ which python
/home/mrg/Work/testenv/bin/python

venv uses symbolic links to create the virtual environment- for example:

(testenv) rhino03[~/Work]$ ls -l /home/mrg/Work/testenv/bin/python
lrwxrwxrwx 1 mrg g_mrg 53 Dec  6 14:28 /home/mrg/Work/testenv/bin/python -> /app/software/Python/3.10.8-GCCcore-12.2.0/bin/python

In my newly created environment the Python interpreter is just a link to the interpreter used to create the environment (the interpreter loaded by the module command). Thus it is imperative that you load the same environment module the virtual environment was created with before you activate the virtual environment. Activating the environment module after activating the environment will have unpredictable effects on how Python runs.

The Conda Installer

“Conda” is a suite of tools for managing independent Python environments a’la venv but with some additional features. The most common Conda installers are Anaconda, Miniconda, and Miniforge. The current recommended Conda installer is Miniforge from the Conda project.

Miniforge from Upstream

Download the appropriate Miniforge for your workstation and install according the the instructions on that page. The installer does have help (bash Miniforge3-$(uname)-$(uname -m).sh -h) if there are additional features you’d like to use during install.

Once installed you will need to configure your channels- please follow the instructions here to complete that configuration (VPN required)

Configuring the Shell Environment

For my own environment, I have elected not to activate the base conda at login (I ran the installer with the -s option skipping install of the init scripts). I’ve created a function in my .bashrc startup file to perform the necessary steps to enable conda:

enable_conda(){
  eval "$(/home/mrg/bin/miniforge3/bin/conda shell.bash hook)"
}

You will need to upate the path to conda with the path to your installation. Once this has been added to your environment, you can just enter the command enable_conda to load the modules and set up conda as your Python interpreter.

Install the Build Tools

You may run into problems installing some packages which require compiling binaries. The options here are to load an Lmod module with a newer compiler or install the build tools from Conda

Activate the base conda environment and install the build tools:

conda install gcc_linux-64 gxx_linux-64 gfortran_linux-64

By default this will install the most current versions available in the Conda Forge. Specific versions can be specified using the == operator:

conda install gcc_linux-64==11.2.0 gxx_linux-64==11.2.0 gfortran_linux-64==11.2.0

NOTE: these are instructions for linux- you will need to check if build tools are available for your workstation.

Edit this Page via GitHub Comment by Filing an Issue Have Questions? Ask them here.