This repository implements a binary classification model using FinBERT to analyze the tone of earnings call transcripts and predict market sentiment (0 = negative/neutral, 1 = positive).
- Task: Binary sentiment classification on earnings call transcripts
- Model:
ProsusAI/finbertfrom Hugging Face - Loss: Weighted
BCEWithLogitsLossto address class imbalance - Data: Custom cleaned JSONL from earnings calls; optional weak labels
pip install requirements.txt
pip install jsonlines
python generate_gold_labels.py
python FinBERTToneEmbeddingClassifier.pyWe used ProsusAI/FinBERT and custom cleaned transcripts from the lamini earnings-calls-qa dataset (transcripts only).
