Available for collaboration

Fiza Khan.

Data Scientist  /  ML Engineer  /  B.Tech CSE

Engineering insights at the crossroads of product, ML, and business impact. Turning raw data into decisions that actually matter — with precision, curiosity, and a forward-thinking mindset.

Scroll

Building at the edge of
data & intelligence.

Currently pursuing B.Tech in Computer Science while actively diving deep into data science and machine learning. With a solid foundation in statistics, SQL, Python, and deep learning, I love turning raw data into meaningful insights.

I'm passionate about solving real-world problems and creating data-driven solutions that leave an impact. I bring curiosity, precision, and a forward-thinking mindset to every project.

"I don't believe in overfitting — neither in models, nor in life. ⚡"

10+
Projects Built
94%
Best Model Accuracy
5+
Tech Stacks
Curiosity

My Tech Arsenal.

// Languages & Core
🐍 Python
☕ Java
🗄️ SQL
// Data & Machine Learning
🐼 Pandas
🔢 NumPy
⚙️ Scikit-Learn
🧠 TensorFlow
📊 Seaborn
📈 Matplotlib
📊 Power BI
// Frameworks & APIs
⚡ FastAPI
🌶️ Flask
☁️ AWS
🎈 Streamlit
// Databases & Tools
🐘 PostgreSQL
🔀 Git
📓 Jupyter
💻 VS Code
📗 Excel

Selected Projects.

// Power BI Dashboards

Amazon Sales & Insights Dashboard

Dynamic Power BI dashboard analyzing customer retention trends, repeat purchase behavior, and high-value segments. Automates CLV, AOV, and repeat purchase rate via DAX calculations.

View on GitHub

HR Attrition Dashboard

Interactive HR analytics dashboard breaking down attrition by age, salary, role, and education. DAX-powered KPIs including a 16.1% attrition rate for data-driven HR decisions.

View on GitHub

Inventory Cost Efficiency Dashboard

Supply chain analytics dashboard analyzing supplier performance, stock turnover, and shipping costs. Designed for retail, e-commerce, and manufacturing applications.

View on GitHub
// Machine Learning

Salary Prediction — Flask App

Deployed Flask web application estimating salaries from experience, education, and job title. Covers predictive modeling, outlier detection, and lightweight ML service architecture.

View on GitHub

Flight Price Prediction

85%+ accurate ML model forecasting airline ticket prices. Full-stack web app merging deep feature engineering with an intuitive HTML/CSS frontend for travelers.

View on GitHub

Customer Segmentation & Classification

Hybrid ML pipeline: DBSCAN clustering + LightGBM classifier on 2200+ records achieving 89% accuracy. Focused on actionable business insights and strategic feature engineering.

View on GitHub

Potato Disease Classifier

CNN deep learning model classifying potato leaf diseases with 94.65% validation accuracy. Lightweight Streamlit app for real-time image-based predictions.

View on GitHub
// Python Tools & Utilities

Automated EDA Tool

Streamlit app for instant exploratory data analysis — upload a CSV, get univariate/bivariate visualizations, correlation heatmaps, and outlier detection automatically.

View on GitHub

AI-Powered Dataset Generator

Customizable synthetic data generator for healthcare, sales, and e-commerce domains. Features row/column control, datatype flexibility, and instant summary statistics.

View on GitHub

Customer Segmentation Chatbot

AI-powered Streamlit chatbot classifying customers using LightGBM. Combines DBSCAN clustering and classification for fast, minimal-input, clean-result predictions.

View on GitHub
// Excel Analytics

Vrinda Store Sales Analysis

Deep-dive Excel report on customer demographics, regional sales, and order status. Females drive 64% of sales; pivot tables and visualizations surface actionable retail strategies.

View on GitHub

Let's build something
remarkable.

I'm always open to interesting data challenges, collaborations, and new opportunities. Drop a message — I'd love to connect.