2026-03-12

2026-03-12

  • Voice chat discussion on attack directions for Phase 2 of experiment
    • Shift focus from prompt injections → realistic social scenarios (things that happen naturally)
    • Prioritization framework: maximize quantity × badness (how many people affected × how bad if affected)
    • Key social dynamics to test: peer pressure, bullying, groupthink, pile-ons
    • Example: everyone is an ICE protester, then an ICE agent enters — social pressure dynamics
    • Draw from early Facebook/social network problems that platforms had to address
    • Avery interested in off-distribution slang and nihilism attacks (from Bijan’s ideas)
    • Brainstorm doc created: https://docs.google.com/document/d/1D96qVVi0hdrOR0WDCnf15wFlHZNi-Hwdi-DjymBpKok/edit
    • rjaditya suggested regrouping to sort ideas by impact and plan implementation
  • Gio reported website workspace edits sometimes disappear; prefers clone repo → edit locally → push → restart from website