Skip to content

Model Training

Planned Architecture (Future Phases)

ML model training on financial datasets is implemented in Phase 6 (Weeks 21–24).


Overview


Problem Formulation

Task Definition

Target Variable

Feature Set


Models

Baseline — XGBoost

Deep Learning — PyTorch


Training Pipeline

flowchart LR
    A[(PostgreSQL)] --> B[Feature Engineering]
    B --> C[Train / Val / Test Split]
    C --> D[Model Training]
    D --> E[Evaluation]
    E --> F[MLflow Registry]

Dataset


Evaluation Metrics

Metric Description
Accuracy / F1 Classification quality
MAE / RMSE Regression error
AUC-ROC Ranking quality for sentiment

Reproducibility