Obesity Prediction Web App using Machine Learning
A Streamlit-based interactive web application that predicts an individual's obesity level based on lifestyle and physical metrics using a machine learning model trained on an international dataset.
Role
Data Scientist, Machine Learning Engineer
Partner
Independent (Solo Project)
Platform
Web and Mobile Apps
Intro




Building an Accessible Obesity Prediction Tool
This project was initiated to create an accessible web-based tool that can estimate an individual's obesity level using machine learning. By leveraging survey data from Latin American countries, the model helps users understand their risk category based on inputs such as age, eating habits, physical activity, and water intake. The application is built for general users and health education purposes.
Building an Accessible Obesity Prediction Tool
This project was initiated to create an accessible web-based tool that can estimate an individual's obesity level using machine learning. By leveraging survey data from Latin American countries, the model helps users understand their risk category based on inputs such as age, eating habits, physical activity, and water intake. The application is built for general users and health education purposes.
Building an Accessible Obesity Prediction Tool
This project was initiated to create an accessible web-based tool that can estimate an individual's obesity level using machine learning. By leveraging survey data from Latin American countries, the model helps users understand their risk category based on inputs such as age, eating habits, physical activity, and water intake. The application is built for general users and health education purposes.
Dataset Explanations
The dataset contains 2,111 entries from Mexico, Peru, and Colombia, collected from online surveys and synthetic data via SMOTE. It includes 17 features such as demographic attributes, eating habits, and physical activity. The target variable, NObeyesdad, represents 7 obesity levels:
Insufficient Weight
Normal Weight
Overweight Level I & II
Obesity Type I, II, III
Dataset Explanations
The dataset contains 2,111 entries from Mexico, Peru, and Colombia, collected from online surveys and synthetic data via SMOTE. It includes 17 features such as demographic attributes, eating habits, and physical activity. The target variable, NObeyesdad, represents 7 obesity levels:
Insufficient Weight
Normal Weight
Overweight Level I & II
Obesity Type I, II, III
Dataset Explanations
The dataset contains 2,111 entries from Mexico, Peru, and Colombia, collected from online surveys and synthetic data via SMOTE. It includes 17 features such as demographic attributes, eating habits, and physical activity. The target variable, NObeyesdad, represents 7 obesity levels:
Insufficient Weight
Normal Weight
Overweight Level I & II
Obesity Type I, II, III
Features
Key Features
Real-time prediction of obesity level based on 16 lifestyle related inputs.
Responsive and userfriendly interface using Streamlit.
Includes categorical and numerical input fields such as:
Age, Gender, Height, Weight
Frequency of eating vegetables
Alcohol and fast food consumption
Physical activity level
Technology usage and transportation habits
Model performance was improved through preprocessing, balancing (SMOTE), and hyperparameter tuning.
Key Features
Real-time prediction of obesity level based on 16 lifestyle related inputs.
Responsive and userfriendly interface using Streamlit.
Includes categorical and numerical input fields such as:
Age, Gender, Height, Weight
Frequency of eating vegetables
Alcohol and fast food consumption
Physical activity level
Technology usage and transportation habits
Model performance was improved through preprocessing, balancing (SMOTE), and hyperparameter tuning.
Key Features
Real-time prediction of obesity level based on 16 lifestyle related inputs.
Responsive and userfriendly interface using Streamlit.
Includes categorical and numerical input fields such as:
Age, Gender, Height, Weight
Frequency of eating vegetables
Alcohol and fast food consumption
Physical activity level
Technology usage and transportation habits
Model performance was improved through preprocessing, balancing (SMOTE), and hyperparameter tuning.
Tools





Conclusion
Outcome & Deployment
The machine learning model (Random Forest) achieved reliable results and was deployed as a user-friendly web application. Users can receive personalized predictions in seconds. This project demonstrates end-to-end data science capability from data wrangling, modeling, and evaluation, to realtime deployment using Streamlit Cloud.
The tool can be extended for real-world health monitoring use cases or educational purposes.
Outcome & Deployment
The machine learning model (Random Forest) achieved reliable results and was deployed as a user-friendly web application. Users can receive personalized predictions in seconds. This project demonstrates end-to-end data science capability from data wrangling, modeling, and evaluation, to realtime deployment using Streamlit Cloud.
The tool can be extended for real-world health monitoring use cases or educational purposes.
Outcome & Deployment
The machine learning model (Random Forest) achieved reliable results and was deployed as a user-friendly web application. Users can receive personalized predictions in seconds. This project demonstrates end-to-end data science capability from data wrangling, modeling, and evaluation, to realtime deployment using Streamlit Cloud.
The tool can be extended for real-world health monitoring use cases or educational purposes.
CATEGORY
Obesity Prediction
Obesity Prediction
Obesity Prediction
Streamlit
Streamlit
Streamlit
Data Science
Data Science
Data Science
Machine Learning
Machine Learning
Machine Learning
DURATION
May 2025


