Skip to content

Data analysis projects and resources developed for the DEVinHouse 2025 course. Includes scripts, Jupyter notebooks, and reports for educational and practical data analysis activities.

License

Notifications You must be signed in to change notification settings

gustavofisica/data-analysis-devinhouse

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Analysis - DEVinHouse

License: MIT Python Version Course

This repository contains scripts, notebooks, and reports for data analysis activities related to the DEVinHouse 2025 course.

Course Reference

DEVinHouse 2025: Course Link

Prerequisites

  • Python 3.8+
  • Jupyter Notebook or Jupyter Lab
  • PostgreSQL 16+ (for SQL exercises)

Required Python Libraries

pip install pandas numpy matplotlib seaborn scipy faker requests

For Jupyter support:

pip install jupyter

Structure

  • data/: Raw and processed data (CSV, images, text files)
  • notebooks/: Jupyter Notebooks for data exploration and analysis
  • reports/: Reports and presentations
  • scripts/: Python scripts for data processing and analysis
  • sql/: Database modeling, schemas, queries, and procedures

Quick Start

  1. Clone the repository

    git clone https://github.com/gustavofisica/data-analysis-devinhouse.git
    cd data-analysis-devinhouse
  2. Install Python dependencies

    pip install pandas numpy matplotlib seaborn scipy faker requests jupyter
  3. Setup PostgreSQL (for SQL exercises)

  4. Run Jupyter Notebooks

    jupyter lab
    # or
    jupyter notebook
  5. Execute Python Scripts

    python scripts/M1S2/guessing_game.py

Documentation

Course Modules

Module 1: Python Fundamentals & Data Analysis

  • Week 2 (M1S2) - Basic Python scripts
  • Week 3 (M1S3) - Data structures
  • Week 4 (M1S4) - File I/O and modularization
  • Week 5 (M1S5) - Pandas & NumPy analysis
  • Week 6 (M1S6) - Healthcare data insights project

Module 1: Database & SQL

  • Week 7 (M1S7) - ER modeling
  • Week 8 (M1S8) - DDL, DML, normalization
  • Week 9 (M1S9) - Advanced queries

Module 2: Advanced Analysis

  • Week 2 (M2S2) - Advanced data cleaning and statistical analysis with real datasets

Common Issues & Solutions

Python Environment

# Common dependency conflicts
pip install --upgrade pip
pip install -r requirements.txt --force-reinstall

Jupyter Notebooks

# Cannot find data files
# Ensure you run jupyter from project root
cd /path/to/data-analysis-devinhouse
jupyter lab

PostgreSQL Connection

Contributing

  1. Fork this repository
  2. Create a feature branch (git checkout -b feature/new-analysis)
  3. Commit your changes (git commit -am 'Add new analysis')
  4. Push to the branch (git push origin feature/new-analysis)
  5. Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

  • 🎓 DEVinHouse 2025 for the comprehensive data analysis curriculum
  • 📚 Course instructors and materials creators
  • 🤝 Fellow students for collaboration and knowledge sharing

About

Data analysis projects and resources developed for the DEVinHouse 2025 course. Includes scripts, Jupyter notebooks, and reports for educational and practical data analysis activities.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published