This project aims to estimate medical costs based on various factors using regression analysis. We explore a dataset related to health insurance and investigate the impact of age, gender, and body mass index (BMI) on medical expenses.
- "I am not what happened to me, I am what I choose to become." β Christopher Gardner, The Pursuit of Happiness
- Special thanks to Professor James C. Dickens for guiding us during the Regression program.
- Gratitude to our family, friends, and American University for their support and encouragement.
- Data Source: Kaggle Insurance Dataset
- The dataset contains information on health insurance beneficiaries.
- Key variables include:
- Age: Age of the primary beneficiary
- Sex: Gender of the insurance contractor (female or male)
- BMI: Body mass index, providing insights into relative weight compared to height
- Age: Represents the age of the insured individual.
- Sex: Indicates the gender of the insurance contractor (female or male).
- BMI: Body mass index, which helps understand weight relative to height.
We utilized the following R libraries for our analysis:
olsrrtidyversedbplyrdplyrMatrixMASSggplot2tibbledata.tableggmosaicggforceggmapggthemespurrrkeepreadrgridExtrarandomForestcorrplotPerformanceAnalytics
Feel free to explore the code and adapt it to your specific needs. Good luck with your project! πππ©ββοΈπ¨ββοΈ