The is a massive collection of Reddit comments designed for training and evaluating sarcasm detection systems. Release Date: April 2017.
It relies on "self-annotation," where users explicitly mark their own sarcastic comments with a /s tag, ensuring the labels are highly accurate compared to manual annotation by third parties. Kompus 2017 Sarba
You can find more detailed research on this corpus and its applications in the sarcasm identification systematic review on ResearchGate . [1704.05579] A Large Self-Annotated Corpus for Sarcasm The is a massive collection of Reddit comments
A sarcastic sentence might use positive words (e.g., "Great job!") to convey a negative meaning. You can find more detailed research on this
Identifying sarcasm is one of the hardest tasks in AI because it requires understanding: Sarcasm often depends on what was said previously.
It is widely used to develop Transformer-based models and other machine learning techniques to help computers understand nuanced human communication like irony and humor. Why Sarcasm Detection Matters