A Jupyter-based logistic regression model to predict telecom customer churn
Back to PortfolioThis project develops a machine learning model to predict customer churn in the telecom industry. Using historical data—demographics, service usage, billing, and tenure—I built a logistic regression model to identify customers at risk of leaving. The goal? Help telecom companies retain customers proactively, boosting satisfaction and cutting acquisition costs. Implemented in a Jupyter notebook, it combines data cleaning, visualization, and evaluation to deliver actionable insights.
telco-customer-churn.csv telco-customer-churn.csv, drop customerID, encode categorical variables with pd.get_dummies(drop_first=True)sns.countplot)LogisticRegression), predict churnclassification_report; visualized with confusion matrix and ROC curvedrop_first=True to avoid multicollinearityClone the repo and run the notebook yourself! See setup instructions on GitHub.