Reddit “Dead Internet” Comment Dataset (2026)
This analysis examines data from 500 records of synthetic Reddit comment metadata generated to simulate the “Dead Internet” environment of 2026.
Data Source: Kaggle — nudratabbas/the-dead-internet-theory-reddit-bot-vs-human
Key Variables:
- subreddit: Community where comment was posted (ex: r/funny, r/gaming)
- account_age_days: Account maturity and reputation signals
- reply_delay_seconds: Time from post to comment — instant replies suggest automation
- is_bot_flag: Ground-truth label — Bot (218) or Human (282)
- bot_type_label: Bot sub-type — AI Summarizer, Engagement Farmer, Reprint Bot, or None (Human)
- bot_probability: Model-estimated probability of being a bot (0–1)