Stellar Object Classification Project

Overview

This project classifies stellar objects into three categories (Galaxy, Star, Quasar) using sensor data. I compared the performance of three models: LightGBM, Random Forest, and XGBoost.

The Data

Source: https://www.kaggle.com/datasets/fedesoriano/stellar-classification-dataset-sdss17
Size: 100,000 rows, 17 features (only used 8 features in training).
Preprocessing: Standard Scaling applied; target labels encoded as integers.

Results & Visualizations

1. Model Comparison

Here is the performance comparison across Accuracy, F1, and Precision.

2. PairPlot

The pairplot for the data.

Key Findings

LightGBM achieved the best balance of speed and accuracy.
Random Forest had slightly higher recall and accuracy but was significantly slower to train.
The "Quasar" class was the hardest to predict due to class imbalance.

How to Run

Clone the repo.
Install requirements.
Run the notebook: stellar_classification.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
plots		plots
README.md		README.md
stellar_classification.ipynb		stellar_classification.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stellar Object Classification Project

Overview

The Data

Results & Visualizations

1. Model Comparison

2. PairPlot

Key Findings

How to Run

About

Uh oh!

Releases

Packages

Languages

shvwgi/Stellar-Classification

Folders and files

Latest commit

History

Repository files navigation

Stellar Object Classification Project

Overview

The Data

Results & Visualizations

1. Model Comparison

2. PairPlot

Key Findings

How to Run

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages