KNAI - Natural Language Data Analytics Platform

About The Project

KNAI is an advanced AI-powered platform that revolutionizes data analysis by enabling natural language interactions with structured databases. Our solution transforms everyday language questions into optimized SQL queries, making data insights accessible to everyone in your organization, regardless of technical expertise.

Technical Architecture

Natural Language Processing Pipeline

Our backend implements a sophisticated NLP pipeline leveraging IBM WatsonX with the Granite-3.1-8b-instruct model. The pipeline consists of several key components:

Query Preprocessing

Input validation and sanitization
Context extraction from conversation history
Schema metadata integration

NLP-to-SQL Conversion

Semantic parsing using Granite-3.1-8b-instruct
SQL query generation with schema validation
Query optimization and safety checks

Response Processing

Result formatting and aggregation
Natural language response generation
Context persistence in Redis

Data Flow Architecture

The system implements a robust data flow process:

The FastAPI backend receives natural language queries via REST endpoints
Queries are enriched with conversation context from Redis
The enhanced query is processed through the WatsonX pipeline
Generated SQL is validated and optimized
Queries are executed against the PostgreSQL database
Results are processed and returned with both raw data and natural language insights

Security Implementation

Input validation and SQL injection prevention
Query validation framework preventing destructive operations
Role-based access control
Data encryption in transit and at rest

Caching and Performance

Redis implementation for conversation history
24-hour context retention with automatic cleanup
Query result caching for common requests
Asynchronous processing for long-running queries

API Documentation

Endpoints

Natural Language Query

POST /natural_language

Request Body:
{
    "query": str,          # Natural language query
    "conversation_id": str, # Optional: For context retention
    "metadata": {          # Optional: Additional context
        "schema": str,
        "filters": dict
    }
}

Response:
{
    "status": str,
    "response": {
        "natural_language": str,  # Generated response
        "sql_query": str,         # Generated SQL
        "results": dict,          # Query results
        "conversation_id": str    # For context tracking
    }
}

Error Handling

The API implements comprehensive error handling:

Input validation errors (400)
Authentication/Authorization errors (401/403)
Processing errors (500)
Query timeout handling (504)

Built With

Frontend
- Tailwind CSS
- Next.js
Backend
- Fast API
- Python
- IBM Code Engine
AI/ML
- IBM WatsonX
- Granite-3.1-8b-instruct Model
Databases
- PostgreSQL
- Redis

Getting Started

To get a local copy up and running, follow these simple steps.

Prerequisites

Python API
PostgreSQL 14+
Redis 6+
IBM Cloud account with WatsonX access

Installation

Clone the repository

git clone https://github.com/yourusername/knai.git

Navigate to the project directory

cd knai

Install dependencies for both frontend and backend

# Install frontend dependencies
cd frontend
npm install

# Install backend dependencies
cd ../backend
pip install -r requirements.txt

Set up environment variables

# Create .env files from examples
cp frontend/.env.example frontend/.env
cp backend/.env.example backend/.env

License

Distributed under the MIT License. See LICENSE for more information.

Acknowledgments

IBM for WatsonX platform support
The open-source community
Our early adopters and beta testers

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
backend		backend
frontend @ 19b7783		frontend @ 19b7783
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KNAI - Natural Language Data Analytics Platform

About The Project

Technical Architecture

Natural Language Processing Pipeline

Data Flow Architecture

Security Implementation

Caching and Performance

API Documentation

Endpoints

Natural Language Query

Error Handling

Built With

Getting Started

Prerequisites

Installation

License

Acknowledgments

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

Ivi-SCD/knai

Folders and files

Latest commit

History

Repository files navigation

KNAI - Natural Language Data Analytics Platform

About The Project

Technical Architecture

Natural Language Processing Pipeline

Data Flow Architecture

Security Implementation

Caching and Performance

API Documentation

Endpoints

Natural Language Query

Error Handling

Built With

Getting Started

Prerequisites

Installation

License

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages