Overview
The bootcamp includes 111+ Jupyter notebooks organized across 8 modules (A1-A8). Each notebook is designed as an interactive learning experience combining theory, code examples, and hands-on exercises.All notebooks are located in the
source/ directory, organized by module folders: 000_A1 through 007_A8.Notebooks by Module
Module A1: Introduction to Data Science
Module A1: Introduction to Data Science
Focus: Foundations of data science, Python basics, and development environment setupTopics Covered:
- Data science workflow and methodology
- Setting up Python and Jupyter environments
- Introduction to computational thinking
- Overview of the bootcamp structure
Module A2: Python Programming Fundamentals
Module A2: Python Programming Fundamentals
Focus: Core Python programming conceptsTopics Covered:
- Variables, data types, and operators
- Control flow (if/else, loops)
- Functions and modules
- File I/O operations
Module A3: Data Manipulation with NumPy & Pandas
Module A3: Data Manipulation with NumPy & Pandas
Focus: Working with structured data using NumPy arrays and Pandas DataFramesKey Notebooks:
002_A3/L1_analisis_caso/numpy.ipynb- NumPy fundamentals002_A3/L2_analisis_caso/analisis_caso.ipynb- Data analysis case study002_A3/L5_analisis_caso/L5_analisis_caso.ipynb- Advanced data wrangling002_A3/PROYECTO/datos.ipynb- Module project
- NumPy arrays and vectorized operations
- Pandas DataFrames and Series
- Data cleaning and transformation
- Merging, joining, and concatenating datasets
- Working with CSV and Excel files
Module A4: Exploratory Data Analysis & Visualization
Module A4: Exploratory Data Analysis & Visualization
Focus: Statistical analysis and data visualization techniquesKey Notebooks:
003_A4/PROYECTO/comercio_ya.ipynb- E-commerce analysis project
Bootcamp-main/Modulo4/contains 20+ notebooks covering:- Univariate and multivariate analysis
- EDA best practices
- Matplotlib and Seaborn visualizations
- Statistical summaries and distributions
- Descriptive statistics
- Data distributions and outliers
- Correlation analysis
- Creating effective visualizations with Matplotlib and Seaborn
Module A5: Probability & Statistics
Module A5: Probability & Statistics
Focus: Statistical foundations for data scienceKey Notebooks:
004_A5/PROYECTO/simulacion.ipynb- Monte Carlo simulations004_A5/EJERCICIOS/L3_DataSnack_distribuciones.ipynb- Probability distributions004_A5/EJERCICIOS/DataNova.ipynb- Statistical analysis exercises004_A5/EJERCICIOS/Leccion3/Index_M5_AE3.ipynb- Hypothesis testing
Bootcamp-main/Modulo5/contains 5+ notebooks on statistical inference
- Probability theory fundamentals
- Common distributions (Normal, Binomial, Poisson)
- Hypothesis testing
- Confidence intervals
- Statistical inference
Module A6: Machine Learning Fundamentals
Module A6: Machine Learning Fundamentals
Focus: Supervised learning algorithms and model evaluationKey Notebooks:
005_A6/PROYECTO/proyecto_modulo6_gasto_clientes_completo.ipynb- Customer spending prediction005_A6/EJERCICIOS/L3_preprocesamiento_escalamiento.ipynb- Data preprocessing005_A6/EJERCICIOS/L4_regresion_lineal_ejercicio.ipynb- Linear regression005_A6/EJERCICIOS/L5_clasificacion_knn_ejercicio.ipynb- KNN classification005_A6/EJERCICIOS/L6_metricas_regresion_clasificacion.ipynb- Model metrics005_A6/EJERCICIOS/L7_optimizacion_modelo_clasificacion.ipynb- Model optimization005_A6/EJERCICIOS/L8_boosting_gradient_boosting_ejercicio.ipynb- Ensemble methods
Bootcamp-main/Modulo6/contains 19+ notebooks covering:- Train/test splits and cross-validation
- Feature scaling and engineering
- Regression and classification algorithms
- Decision trees and ensemble methods
- Supervised learning workflow
- Linear and logistic regression
- K-Nearest Neighbors (KNN)
- Decision trees and random forests
- Model evaluation metrics
- Hyperparameter tuning
- Gradient boosting
Module A7: Advanced Machine Learning
Module A7: Advanced Machine Learning
Focus: Unsupervised learning and advanced ML techniquesKey Notebooks:
Located in
006_A7/PROYECTO/ and 006_A7/EJERCICIOS/Topics Covered:- Clustering algorithms (K-Means, hierarchical)
- Dimensionality reduction (PCA, t-SNE)
- Anomaly detection
- Feature engineering techniques
- Model pipelines
Module A8: Deep Learning with Neural Networks
Module A8: Deep Learning with Neural Networks
Focus: Introduction to neural networks and deep learning frameworksKey Notebooks:
007_A8/PROYECTO/proyecto_mod8_keras.ipynb- Fashion MNIST with Keras/TensorFlow007_A8/PROYECTO/proyecto_mod8_pytorch.ipynb- Fashion MNIST with PyTorch007_A8/EJERCICIOS/perceptron_AND_paso_a_paso.ipynb- Perceptron fundamentals
- Neural network architecture
- Activation functions and backpropagation
- Building models with Keras/TensorFlow
- Building models with PyTorch
- Image classification with CNNs
- Training, validation, and testing
- Model optimization and regularization
How to Navigate the Notebooks
Directory Structure
Notebook Types
Project Notebooks
Located in
PROYECTO/ folders. These are comprehensive projects that integrate concepts from the entire module.Exercise Notebooks
Located in
EJERCICIOS/ folders. Focused practice on specific topics with guided exercises.Lesson Notebooks
Named with patterns like
Index_M*_AE*.ipynb or L*_analisis_caso.ipynb. Step-by-step lessons with examples.Live Coding
Named
LiveCoding*.ipynb. Real-time coding demonstrations with instructor notes.Using Jupyter Notebooks Effectively
Essential Shortcuts
Command Mode (press Esc)
Command Mode (press Esc)
A- Insert cell aboveB- Insert cell belowD, D- Delete cellM- Convert cell to MarkdownY- Convert cell to CodeShift + Up/Down- Select multiple cells
Edit Mode (press Enter)
Edit Mode (press Enter)
Shift + Enter- Run cell and move to nextCtrl + Enter- Run cell and stayAlt + Enter- Run cell and insert belowTab- Code completionShift + Tab- Show function documentation
Best Practices
Run Cells Sequentially
Always run cells in order from top to bottom, especially when learning. Later cells often depend on variables defined in earlier cells.
Restart Kernel Regularly
Use
Kernel > Restart & Run All to ensure your notebook runs cleanly from start to finish. This catches hidden dependencies.Document Your Work
Add Markdown cells to explain your thought process, findings, and conclusions. This helps with learning and future reference.
Save Often
Use
Ctrl + S frequently. Jupyter auto-saves, but manual saves ensure you don’t lose work.Troubleshooting Common Issues
Tips for Learning
Active Learning Approach
- Read First: Go through the entire notebook to understand the flow
- Type, Don’t Copy: Retype code examples rather than copy-pasting
- Experiment: Modify parameters and see what changes
- Break Things: Try to make code fail, then fix it - you’ll learn faster
- Add Comments: Explain code in your own words
Project-Based Learning
Start with Exercises
Begin with EJERCICIOS notebooks to practice individual concepts before tackling full projects.
Complete All Projects
The PROYECTO notebooks in each module integrate all concepts and prepare you for real-world work.
Review Case Studies
The
analisis_caso notebooks show how to apply techniques to realistic business scenarios.Compare Approaches
In Module A8, compare the Keras and PyTorch implementations to understand different frameworks.
Additional Resources
Reference Notebooks
TheBootcamp-main/ directory contains additional reference materials:
- Modulo3: 35+ notebooks on Pandas operations
- Modulo4: 20+ notebooks on visualization techniques
- Modulo5: 5+ notebooks on statistical methods
- Modulo6: 19+ notebooks on machine learning algorithms
Automatic Pipeline Notebooks
Theautomatico/ directory contains:
modulos_3_a_7_plan_estudio.ipynb- Study plan overviewauto_pipeline_csv_paso_a_paso.ipynb- Automated data pipeline example
Next Steps
Datasets Reference
Learn about the datasets used in these notebooks
Tools & Libraries
Set up your development environment
Glossary
Look up data science terms and concepts
Contact Manager Project
Start with the first module project