This repository contains scripts, notebooks, and reports for data analysis activities related to the DEVinHouse 2025 course.
DEVinHouse 2025: Course Link
- Python 3.8+
- Jupyter Notebook or Jupyter Lab
- PostgreSQL 16+ (for SQL exercises)
pip install pandas numpy matplotlib seaborn scipy faker requestsFor Jupyter support:
pip install jupyterdata/: Raw and processed data (CSV, images, text files)notebooks/: Jupyter Notebooks for data exploration and analysisreports/: Reports and presentationsscripts/: Python scripts for data processing and analysissql/: Database modeling, schemas, queries, and procedures
-
Clone the repository
git clone https://github.com/gustavofisica/data-analysis-devinhouse.git cd data-analysis-devinhouse -
Install Python dependencies
pip install pandas numpy matplotlib seaborn scipy faker requests jupyter
-
Setup PostgreSQL (for SQL exercises)
- Follow the detailed guide in
POSTGRESQL_SETUP.md
- Follow the detailed guide in
-
Run Jupyter Notebooks
jupyter lab # or jupyter notebook -
Execute Python Scripts
python scripts/M1S2/guessing_game.py
- 📊 Project Analysis - Detailed project structure and completion status
- 🐘 PostgreSQL Setup - Database configuration guide
- 🤝 Contributing Guidelines - How to contribute to this project
- 📋 Changelog - Project version history and changes
- 📁 Module READMEs - Each directory contains specific instructions
- Week 2 (M1S2) - Basic Python scripts
- Week 3 (M1S3) - Data structures
- Week 4 (M1S4) - File I/O and modularization
- Week 5 (M1S5) - Pandas & NumPy analysis
- Week 6 (M1S6) - Healthcare data insights project
- Week 7 (M1S7) - ER modeling
- Week 8 (M1S8) - DDL, DML, normalization
- Week 9 (M1S9) - Advanced queries
- Week 2 (M2S2) - Advanced data cleaning and statistical analysis with real datasets
# Common dependency conflicts
pip install --upgrade pip
pip install -r requirements.txt --force-reinstall# Cannot find data files
# Ensure you run jupyter from project root
cd /path/to/data-analysis-devinhouse
jupyter lab- Refer to POSTGRESQL_SETUP.md for authentication issues
- Fork this repository
- Create a feature branch (
git checkout -b feature/new-analysis) - Commit your changes (
git commit -am 'Add new analysis') - Push to the branch (
git push origin feature/new-analysis) - Open a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.
- 🎓 DEVinHouse 2025 for the comprehensive data analysis curriculum
- 📚 Course instructors and materials creators
- 🤝 Fellow students for collaboration and knowledge sharing