A Jupyter-based logistic regression model to predict telecom customer churn
Back to PortfolioThis project develops a machine learning model to predict customer churn in the telecom industry. Using historical data—demographics, service usage, billing, and tenure—I built a logistic regression model to identify customers at risk of leaving. The goal? Help telecom companies retain customers proactively, boosting satisfaction and cutting acquisition costs. Implemented in a Jupyter notebook, it combines data cleaning, visualization, and evaluation to deliver actionable insights.
telco-customer-churn.csv
telco-customer-churn.csv
, drop customerID
, encode categorical variables with pd.get_dummies(drop_first=True)
sns.countplot
)LogisticRegression
), predict churnclassification_report
; visualized with confusion matrix and ROC curvedrop_first=True
to avoid multicollinearityClone the repo and run the notebook yourself! See setup instructions on GitHub.