2026-03-12
- Voice chat discussion on attack directions for Phase 2 of experiment
- Shift focus from prompt injections → realistic social scenarios (things that happen naturally)
- Prioritization framework: maximize quantity × badness (how many people affected × how bad if affected)
- Key social dynamics to test: peer pressure, bullying, groupthink, pile-ons
- Example: everyone is an ICE protester, then an ICE agent enters — social pressure dynamics
- Draw from early Facebook/social network problems that platforms had to address
- Avery interested in off-distribution slang and nihilism attacks (from Bijan’s ideas)
- Brainstorm doc created: https://docs.google.com/document/d/1D96qVVi0hdrOR0WDCnf15wFlHZNi-Hwdi-DjymBpKok/edit
- rjaditya suggested regrouping to sort ideas by impact and plan implementation
- Gio reported website workspace edits sometimes disappear; prefers clone repo → edit locally → push → restart from website