----------------------------------------| 1. DATA SOURCES |
| Twitter API (academic tier), Reddit |
| API, and HuggingFace multilingual |
| sentiment datasets provide diverse |
| textual content. These sources |
| capture real-time reactions and |
| long-term opinion shifts across |
| regions. Data is assessed for |
| linguistic variety and topic |
| distribution before ingestion. |
---------------------------------------- | v -------------------------------------- | 2. INGESTION & PREPROCESSING |
| Streaming pipelines deduplicate |
| posts, enforce rate limits, and |
| align timestamps for temporal |
| analysis. Text cleaning removes |
| noise, while language detection |
| and emoji normalization retain |
| semantic nuance. Token boundaries |
| and user metadata enrich |
| downstream modeling. |
-------------------------------------- | v---------------------------------------| 3. EMBEDDING & FEATURE EXTRACTION |
| Transformer models (BERT, RoBERTa) |
| generate contextual embeddings |
| enriched with topic clusters and |
| sentiment lexicon scores. Metadata |
| features such as user engagement, |
| location, and hashtag frequency |
| improve robustness. Representations |
| are batched for efficient high- |
| throughput processing. |
--------------------------------------- | v ------------------------------------- | 4. SENTIMENT MODELING |
| Fine-tuned Transformer |
| classifiers predict polarity and |
| emotion intensity. Cross- |
| validation ensures model |
| resilience across languages and |
| dialects. Ensemble blending |
| (BERT + SVM) stabilizes |
| predictions during trending |
| events with high linguistic |
| variance. |
------------------------------------- | v---------------------------------------| 5. AGGREGATION & DASHBOARDING |
| Sentiment signals are aggregated |
| by topics, regions, and time |
| windows to reveal emerging trends. |
| Power BI dashboards update in near |
| real time, enabling brand managers |
| to react early. Historical trends |
| support deeper longitudinal |
| analysis. |
--------------------------------------- | v ------------------------------------- | 6. ALERTING & FEEDBACK LOOP |
| Anomaly detectors highlight |
| sudden sentiment spikes linked |
| to global or local events. |
| Feedback from analysts refines |
| topic models and sentiment |
| lexicons. Performance decay |
| triggers retraining to maintain |
| accuracy. |
-------------------------------------