Lung Cancer Similarity Estimator (2015 - 2025)
A rough profile-based estimator using lung cancer patient attributes. It is a learning project and not medical advice.
Workflow
- Load and parse
lung_cancer_dataset.csvwith risk-factor columns. - Convert user inputs into a weighted similarity score (smoking, exposure, symptoms, demographics).
- Calibrate score against the dataset to compute percentile and a rough chance band.
- Return an estimate, confidence label, and plain-language interpretation.
Disclaimer: This tool is for educational demo purposes only and is not a diagnosis.
Important caveat: this is a demo estimator, not a medical model. The site's dataset (https://www.kaggle.com/datasets/zkskhurram/lung-cancer-clinical-dataset-20152025/data) is patient-only, so it is not a true population probability model.
Dataset
Default dataset path: lung_cancer_dataset.csv
Dataset not loaded yet.
Estimate My Similarity Profile
Fill what you know. Leave the rest at defaults.
Load your dataset to begin.