Installation Guide

Installation Guide#

This document provides step-by-step instructions for setting up the audio2face development environment on different operating systems.

Table of Contents#

Linux Environment Setup
Windows Environment Setup

Linux Environment Setup#

Linux Prerequisites#

Before starting, ensure you have the following system requirements:

Ubuntu 20.04 or compatible Linux distribution
Internet connection for downloading packages

Linux Step 1: Install Protocol Buffers Compiler#

Download and install protoc for protocol buffer compilation:

# Create protoc directory
mkdir -p protoc
cd protoc

# Download protoc
curl -LjO https://github.com/protocolbuffers/protobuf/releases/download/v31.1/protoc-31.1-linux-x86_64.zip

# Extract and set permissions
unzip protoc-31.1-linux-x86_64.zip
rm -f protoc-31.1-linux-x86_64.zip
chmod +x bin/protoc

# Verify installation
bin/protoc --version

# Go back to the root directory
cd ..

Linux Step 2: Set Up Python#

You need Python 3.10 or higher to run this project. This document provides one method using conda for Python installation as a reference.

Install Python using Miniconda:

# Download Miniconda installer
wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh

# Install Miniconda
bash Miniconda3-latest-Linux-x86_64.sh

# Clean up installer
rm -f Miniconda3-latest-Linux-x86_64.sh

# Configure conda channels
conda config --add channels conda-forge
conda tos accept

# Create audio2face environment with Python 3.10
conda create -n audio2face python=3.10 -y

# Activate the environment
conda activate audio2face

Linux Step 3: Install PyTorch and ONNX Runtime#

The audio2face service supports both CPU and GPU inference. Choose the appropriate installation method based on your hardware configuration.

Option A: CPU Inference Environment (Default)#

For CPU-only inference, install PyTorch and ONNX Runtime with CPU support:

# Activate the environment
conda activate audio2face

# Install PyTorch with CPU support
conda install pytorch==2.4.1 torchaudio==2.4.1 cpuonly -c pytorch

# Install ONNX Runtime with CPU support
pip install onnxruntime==1.22.0

Option B: GPU Inference Environment (Recommended for CUDA-enabled GPUs)#

For GPU-accelerated inference with faster processing speed, install PyTorch and ONNX Runtime with CUDA support:

# Activate the environment
conda activate audio2face

# Install PyTorch with CUDA 12.1 support
conda install pytorch==2.4.1 torchvision==0.19.1 torchaudio==2.4.1 pytorch-cuda=12.1 -c pytorch -c nvidia -y

# Install ONNX Runtime with GPU support
pip install onnxruntime-gpu==1.22.0

Note: GPU inference requires NVIDIA GPU with CUDA 12.1 support. If you don’t have a compatible GPU, use Option A for CPU inference.

Linux Step 4: Install the Project#

Install the audio2face package:

# Ensure you're in the project root directory
cd /path/to/audio2face

# Activate conda environment
conda activate audio2face

# Install the package
pip install .

Linux Step 5: Verify Installation#

Test that everything is working correctly:

# Activate the environment
conda activate audio2face

# Check if audio2face.apis can be imported
python -c "import audio2face.apis; print('audio2face.apis imported successfully')"

# Check if the main application runs
python main.py --help

Linux Environment Activation#

To work with the audio2face project, always activate the conda environment first:

# Activate the environment
conda activate audio2face

# Your terminal prompt should now show (audio2face)
# You can now run Python scripts and use the audio2face package

Windows Environment Setup#

Windows Prerequisites#

Before starting, ensure you have the following system requirements:

Windows 10/11 or compatible Windows distribution
Internet connection for downloading packages

Windows Step 1: Install Protocol Buffers Compiler#

Download and install protoc for protocol buffer compilation:

Download protoc:
- Visit Protocol Buffers v31.1 Release Page
- Download the Windows version: protoc-31.1-win64.zip
Extract the files:
- Create a protoc folder in your project root directory
- Extract the downloaded protoc-31.1-win64.zip file into the protoc folder
- Ensure the executable file is located at: protoc\bin\protoc.exe

Verify installation:

# Open Command Prompt in your project directory
protoc\bin\protoc.exe --version

Windows Step 2: Set Up Python#

You need Python 3.10 or higher to run this project. This document provides one method using conda for Python installation as a reference.

Install Python using Miniconda:

Download and Install Miniconda:
- Visit Miniconda Installation Guide
- Download the Windows installer from the Anaconda website
- Follow the official installation instructions to install Miniconda
- Important: During installation, make sure to check “Add Miniconda3 to my PATH environment variable” or add the Miniconda3/Scripts directory to the PATH environment variable manually to enable conda commands from any terminal

Create and Activate Environment:

# Create audio2face environment with Python 3.10
conda create -n audio2face python=3.10 -y

# Activate the environment
conda activate audio2face

Windows Step 3: Install PyTorch and ONNX Runtime#

Install PyTorch and ONNX Runtime with CPU support for Windows:

# Activate the environment
conda activate audio2face

# Install PyTorch with CPU support
conda install pytorch==2.4.1 torchaudio==2.4.1 cpuonly -c pytorch

# Install ONNX Runtime with CPU support
pip install onnxruntime==1.22.0

Windows Step 4: Install the Project#

Install the audio2face package:

# Ensure you're in the project root directory
cd /path/to/audio2face

# Activate conda environment
conda activate audio2face

# Temporarily add protoc to PATH for this session
set PATH=%PATH%;%CD%\protoc\bin

# Install the package
pip install .

Windows Step 5: Verify Installation#

Test that everything is working correctly:

# Activate the environment
conda activate audio2face

# Check if audio2face.apis can be imported
python -c "import audio2face.apis; print('audio2face.apis imported successfully')"

# Check if the main application runs
python main.py --help

Windows Environment Activation#

To work with the audio2face project, always activate the conda environment first:

# Activate the environment
conda activate audio2face

# Your terminal prompt should now show (audio2face)
# You can now run Python scripts and use the audio2face package