AI Answer Evaluator Dashboard

This project is a Flask-based web application that provides a dashboard to display, evaluate, and analyze student answers for various subjects using an AI model. It calculates evaluation accuracy, tracks token usage, and estimates costs associated with the AI evaluation service.

Features

Subject Dashboard: View a list of all subjects from the database.
Detailed Subject View: For each subject, view all questions, student-provided answers, the AI's evaluation, and the ground truth.
AI Evaluation: Trigger an AI-powered evaluation for all answers of a specific subject.
Accuracy Calculation: Automatically calculates the accuracy of the LLM's evaluation against the ground truth.
Request & Cost Logging: A dedicated page to view aggregated statistics on API requests, including token counts, inference time, and estimated costs, broken down by subject.

Prerequisites

Before you begin, ensure you have the following installed:

Python 3.7+
pip
A running PostgreSQL instance

Getting Started

Follow these instructions to get the project up and running on your local machine.

1. Clone the Repository

git clone <your-repository-url>
cd the-reaper

2. Set Up and Activate the Virtual Environment

Using a virtual environment is highly recommended.

For macOS and Linux users:

python3 -m venv venv
source venv/bin/activate

For Windows users:

python -m venv venv
.\venv\Scripts\activate

After running the activation command, you should see (venv) at the beginning of your terminal prompt.

3. Install Dependencies

With your virtual environment active, install the required packages:

pip install -r requirements.txt

4. Configure Environment Variables

Create a new file named .env in the root of your project directory. Copy the following and replace the placeholder values with your actual database credentials.

# The connection string for your PostgreSQL database
DATABASE_URL="postgresql://<user>:<password>@<host>:<port>/<database_name>"

# The URL of the separate AI evaluator service
EVALUATOR_URL="http://localhost:8000"

5. Running the Application

Once the setup is complete, run the Flask application:

python app.py

The application will start in debug mode and be accessible at [suspicious link removed].

How to Use

Open your web browser and navigate to http://127.0.0.1:5000.
The main dashboard will show a list of subjects.
Click on a subject to view its details, including questions and student answers.
On the subject detail page, click the "Evaluate All Answers (AI)" button to send the answers to the evaluation service.
Navigate to the "Request Logs" page from the header to see a summary of API usage and costs.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
templates		templates
.gitignore		.gitignore
README.md		README.md
app.py		app.py
evaluator_proxy.py		evaluator_proxy.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Answer Evaluator Dashboard

Features

Prerequisites

Getting Started

1. Clone the Repository

2. Set Up and Activate the Virtual Environment

3. Install Dependencies

4. Configure Environment Variables

5. Running the Application

How to Use

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

altf4m88/the-reaper

Folders and files

Latest commit

History

Repository files navigation

AI Answer Evaluator Dashboard

Features

Prerequisites

Getting Started

1. Clone the Repository

2. Set Up and Activate the Virtual Environment

3. Install Dependencies

4. Configure Environment Variables

5. Running the Application

How to Use

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages