Week 7: Classification and Decision Trees in Finance
Learning Objectives:
- Master classification fundamentals in financial contexts
- Understand decision tree algorithms and their applications
- Learn model evaluation metrics for classification
- Apply classification models to financial problems
- Interpret and validate classification results
Core Resources:
1. Theoretical Foundations
- Classification and Regression Trees - Breiman et al.
- Credit Scoring Using Machine Learning - Hand & Henley
- Bank Rating Prediction - Altman & Rijken
- Regulatory Pressure and Fire Sales in the Corporate Bond Market- Ellul & Jotikasthira & Lundblad
- Fire Sales and Impediments to Liquidity Provision in the Corporate Bond Market- Wang & Zhang & Zhang
2. Technical Implementation
3. Financial Applications
- Data Sources:
- Code Samples:
- Applications:
- Credit Risk Assessment
- Rating Changes Prediction
- Fraud Detection
- Trading Signal Generation
- Default Prediction
Weekly Assignment
Due: End of Week 7
Option 1: Credit Risk Classification
- Data Preparation
- Use Lending Club or WRDS data
- Prepare features: credit scores, debt ratios, employment history
- Handle class imbalance
- Split data respecting time order
- Model Implementation
- Implement decision tree classifier
- Use cross-validation for parameter tuning
- Consider class weights or sampling techniques
- Compare with logistic regression
- Analysis
- Evaluate using precision, recall, F1-score
- Analyze feature importance
- Visualize decision tree structure
- Consider economic implications of misclassification
Option 2: Research-Based Application
Design your own classification application in finance. Some suggestions:
- Rating Change Prediction
- Predict credit rating upgrades/downgrades
- Use financial ratios and market indicators
- Compare with rating agency decisions
- Trading Signal Generation
- Classify market conditions
- Generate buy/sell signals
- Evaluate trading performance
Submission Requirements:
- Code with clear documentation
- Brief report including:
- Problem motivation and relevance
- Methodology and implementation details
- Results and interpretation
- Challenges encountered and solutions
- If choosing Option 2, include references to papers of similar applications
Implementation Tips:
Key Considerations:
- Handle class imbalance appropriately
- Use appropriate evaluation metrics
- Consider interpretability vs performance
- Document hyperparameter selection
- Be mindful of look-ahead bias
- Consider cost of misclassification