Environment Setup Guide
This guide will help you set up a complete Python development environment for the Data Science Bootcamp. Follow these steps to install Python, create virtual environments, and install all required dependencies.We recommend creating a dedicated virtual environment called
Bootcamp for all coursework to keep dependencies isolated and organized.Prerequisites
System Requirements
- Python 3.8 or higher (3.10 recommended)
- 4GB RAM minimum (8GB recommended for larger datasets)
- 2GB free disk space for Python, libraries, and datasets
- Terminal access (CMD/PowerShell on Windows, Terminal on macOS/Linux)
- Git (optional, for version control)
Check Python Installation
Before starting, verify Python is installed:Python 3.10.x or higher.
Installation Options
You can set up your environment using either venv (Python’s built-in tool) or Anaconda/Miniconda (data science-focused distribution).Option 1: Using venv (Recommended for Beginners)
venv is lightweight and comes pre-installed with Python 3.3+.Create Virtual Environment
Create a new virtual environment named This creates a
Bootcamp:Bootcamp/ directory containing an isolated Python environment.Activate Virtual Environment
Activate the environment to use it:When activated, you’ll see
On Windows PowerShell, if you get an execution policy error, run:
(Bootcamp) at the start of your terminal prompt.Install Core Dependencies
Install all required packages for the bootcamp:This installs:
- jupyter: Interactive notebook environment
- numpy: Numerical computing library
- pandas: Data manipulation and analysis
- matplotlib, seaborn: Data visualization
- scikit-learn: Machine learning algorithms
- tensorflow, keras: Deep learning framework
- torch, torchvision: PyTorch deep learning framework
- streamlit: Interactive web apps for data science
- lxml, requests: Web scraping and data loading
Option 2: Using Anaconda/Miniconda
Anaconda includes pre-installed data science packages and its own package manager.Install Anaconda or Miniconda
Download and install from:
- Anaconda (full distribution, ~3GB): anaconda.com/download
- Miniconda (minimal installer, ~400MB): docs.conda.io/en/latest/miniconda.html
Create Conda Environment
Create a new environment with Python 3.10:Confirm by typing
y when prompted.Project-Specific Setup
Installing from requirements.txt
Many projects include arequirements.txt file listing all dependencies. To install from this file:
Example: Module A3 Project Setup
For the Data Preparation project (Module A3):requirements.txt for this module includes:
Launching Jupyter Notebook
Jupyter notebooks are the primary tool for interactive data science work.Launch Jupyter
- Start the Jupyter server
- Open your default web browser
- Display the notebook interface at
http://localhost:8888
Create or Open Notebook
- Click New → Python 3 to create a new notebook
- Navigate to existing
.ipynbfiles to open them
Running Python Scripts
Basic Execution
With your virtual environment activated:Example: Running Module 2 Demos
From the Bootcamp repository:Running Streamlit Dashboards
Several projects include interactive Streamlit dashboards.Managing Your Environment
Deactivating Virtual Environment
When you’re done working:Removing Virtual Environment
To completely remove an environment:Exporting Dependencies
To share your environment setup:Common Issues and Solutions
Issue: “Python not found” or “Command not found”
Solution:- Ensure Python is added to your system PATH during installation
- Try
python3instead ofpythonon macOS/Linux - Reinstall Python with “Add to PATH” option checked
Issue: Permission denied when installing packages
Solution:- Ensure your virtual environment is activated
- Don’t use
sudowith pip in a virtual environment - On Windows, run terminal as Administrator if needed
Issue: Module not found after installation
Solution:- Verify virtual environment is activated
- Check you’re using the correct Python interpreter:
which python(macOS/Linux) orwhere python(Windows) - Reinstall the package:
pip install --force-reinstall package-name
Issue: Jupyter kernel not found
Solution:Best Practices
- One environment per project: Create separate environments for different projects to avoid dependency conflicts
-
Keep requirements updated: Regularly update your
requirements.txtas you add packages - Use version control: Git-ignore your virtual environment folder (it can be recreated from requirements.txt)
-
Document dependencies: Always include a
requirements.txtorenvironment.ymlwith your projects -
Regular updates: Periodically update packages for security and bug fixes:
Next Steps
Now that your environment is set up, you’re ready to start coding!Bootcamp Overview
Review the complete curriculum and module structure
Start Module A2
Begin with Python fundamentals and your first project