Intro

Building an Accessible Obesity Prediction Tool

This project was initiated to create an accessible web-based tool that can estimate an individual's obesity level using machine learning. By leveraging survey data from Latin American countries, the model helps users understand their risk category based on inputs such as age, eating habits, physical activity, and water intake. The application is built for general users and health education purposes.

Building an Accessible Obesity Prediction Tool

This project was initiated to create an accessible web-based tool that can estimate an individual's obesity level using machine learning. By leveraging survey data from Latin American countries, the model helps users understand their risk category based on inputs such as age, eating habits, physical activity, and water intake. The application is built for general users and health education purposes.

Building an Accessible Obesity Prediction Tool

This project was initiated to create an accessible web-based tool that can estimate an individual's obesity level using machine learning. By leveraging survey data from Latin American countries, the model helps users understand their risk category based on inputs such as age, eating habits, physical activity, and water intake. The application is built for general users and health education purposes.

Dataset Explanations

The dataset contains 2,111 entries from Mexico, Peru, and Colombia, collected from online surveys and synthetic data via SMOTE. It includes 17 features such as demographic attributes, eating habits, and physical activity. The target variable, NObeyesdad, represents 7 obesity levels:

  • Insufficient Weight

  • Normal Weight

  • Overweight Level I & II

  • Obesity Type I, II, III

Dataset Explanations

The dataset contains 2,111 entries from Mexico, Peru, and Colombia, collected from online surveys and synthetic data via SMOTE. It includes 17 features such as demographic attributes, eating habits, and physical activity. The target variable, NObeyesdad, represents 7 obesity levels:

  • Insufficient Weight

  • Normal Weight

  • Overweight Level I & II

  • Obesity Type I, II, III

Dataset Explanations

The dataset contains 2,111 entries from Mexico, Peru, and Colombia, collected from online surveys and synthetic data via SMOTE. It includes 17 features such as demographic attributes, eating habits, and physical activity. The target variable, NObeyesdad, represents 7 obesity levels:

  • Insufficient Weight

  • Normal Weight

  • Overweight Level I & II

  • Obesity Type I, II, III

Features

Key Features

  • Real-time prediction of obesity level based on 16 lifestyle related inputs.

  • Responsive and userfriendly interface using Streamlit.

  • Includes categorical and numerical input fields such as:

    • Age, Gender, Height, Weight

    • Frequency of eating vegetables

    • Alcohol and fast food consumption

    • Physical activity level

    • Technology usage and transportation habits

  • Model performance was improved through preprocessing, balancing (SMOTE), and hyperparameter tuning.

Key Features

  • Real-time prediction of obesity level based on 16 lifestyle related inputs.

  • Responsive and userfriendly interface using Streamlit.

  • Includes categorical and numerical input fields such as:

    • Age, Gender, Height, Weight

    • Frequency of eating vegetables

    • Alcohol and fast food consumption

    • Physical activity level

    • Technology usage and transportation habits

  • Model performance was improved through preprocessing, balancing (SMOTE), and hyperparameter tuning.

Key Features

  • Real-time prediction of obesity level based on 16 lifestyle related inputs.

  • Responsive and userfriendly interface using Streamlit.

  • Includes categorical and numerical input fields such as:

    • Age, Gender, Height, Weight

    • Frequency of eating vegetables

    • Alcohol and fast food consumption

    • Physical activity level

    • Technology usage and transportation habits

  • Model performance was improved through preprocessing, balancing (SMOTE), and hyperparameter tuning.

Tools

Conclusion

Outcome & Deployment

The machine learning model (Random Forest) achieved reliable results and was deployed as a user-friendly web application. Users can receive personalized predictions in seconds. This project demonstrates end-to-end data science capability from data wrangling, modeling, and evaluation, to realtime deployment using Streamlit Cloud.
The tool can be extended for real-world health monitoring use cases or educational purposes.

Outcome & Deployment

The machine learning model (Random Forest) achieved reliable results and was deployed as a user-friendly web application. Users can receive personalized predictions in seconds. This project demonstrates end-to-end data science capability from data wrangling, modeling, and evaluation, to realtime deployment using Streamlit Cloud.
The tool can be extended for real-world health monitoring use cases or educational purposes.

Outcome & Deployment

The machine learning model (Random Forest) achieved reliable results and was deployed as a user-friendly web application. Users can receive personalized predictions in seconds. This project demonstrates end-to-end data science capability from data wrangling, modeling, and evaluation, to realtime deployment using Streamlit Cloud.
The tool can be extended for real-world health monitoring use cases or educational purposes.

CATEGORY

Obesity Prediction

Obesity Prediction

Obesity Prediction

Streamlit

Streamlit

Streamlit

Data Science

Data Science

Data Science

Machine Learning

Machine Learning

Machine Learning

DURATION

May 2025

Create a free website with Framer, the website builder loved by startups, designers and agencies.