SimPPL Library

Readings the team comes back to.

Every link we've shared in our monthly newsletter, in one place. Policy briefs, datasets, technical write-ups, news features, tutorials, papers. Use the search box, filter by category, or stack multiple tags to narrow further.

Send this shelf to a friend.

Technical
Tool, skill, or subagent? Decomposing an agent that outgrew its prompt
Anthropic engineers share how to modularize complex prompts by extracting business logic into skills to restore accuracy.
youtube.comJune 2026
agents llms youtube permalink →
Technical
FlashLib: Fast GPU Acceleration for Classical ML
A new GPU-optimized library accelerating classical machine learning operators with speedups up to 208x.
x.comJune 2026
infrastructure training twitter permalink →
Research
How Engagement Algorithms Distort Perceptions of Political Norms
An RCT study on Bluesky demonstrating how engagement algorithms over-amplify moral outrage by up to 79%.
linkedin.comJune 2026
platform-policy trust-and-safety moderation permalink →
Research
stable-worldmodel: An Open Platform for JEPA & World Model Research
Galilai Group & NYU's open-source Python framework to standardize JEPA and world modeling research.
x.comJune 2026
training twitter permalink →
Research
Human mobility in the metaverse mirrors patterns in the physical world
Scientific Reports study showing metaverse navigation patterns match the highly routine behaviors of the physical world.
nature.comJune 2026
metaverse social-networks permalink →
Policy
A Framework for Digital Safety: Designing Social Media Interventions
Action-oriented taxonomy of digital-safety interventions across focus, scope, driver, user journey.
papers.ssrn.comTPRC 2025
platform-policy trust-and-safety moderation permalink →
Research
Bridging Nodes and Narrative Flows on Telegram
Cross-community bridging metric on Telegram disinformation networks. SimPPL paper.
arxiv.orgarXiv preprint
telegram disinformation cib permalink →
News
MIT Delta V Demo Day — Sakhi spinout
MIT News write-up of Sakhi at Delta V Demo Day.
news.mit.eduMIT News, Sep 2024
health-ai india permalink →
Policy
Mozilla Responsible Computing Challenge — first awardees in India
Inaugural India RCC cohort announcement; SimPPL was the first awardee.
mozillafoundation.orgJune 2024
india platform-policy regulator permalink →
News
Rest of World — SimPPL on automation against misinformation
Three-minute interview ahead of Meta's Bangladesh CIB takedown.
restofworld.orgRest of World, 2024
disinformation global-south journalism permalink →
Research
How Facebook Has Become a Political Battleground in Bangladesh
Tech Global Institute / SimPPL Info Lab report on Bangladesh election Pages and Groups.
infolab.techglobalinstitute.comTech Global Institute Info Lab
facebook election-integrity global-south permalink →
Research
Multiagent Simulators for Social Networks
SimPPL position paper. Multiagent + LLM-driven simulation as the public-research bridge.
arxiv.orgICML 2023 workshop
agents llms cib permalink →
Technical
Pro-Russian bot networks — active vs deleted user network graphs
Part 2: how information disseminates through coordinated structures.
jhagrutlalwani.vercel.appSimPPL blog
twitter cib disinformation permalink →
Technical
Pro-Russian bot networks on Twitter — article-sharing analysis
Part 1 of Jhagrut Lalwani's two-part SimPPL blog on coordinated behaviour.
jhagrutlalwani.vercel.appSimPPL blog
twitter cib disinformation permalink →
Research
ICML 2022 — Estimating the Impact of Coordinated Inauthentic Behavior
Multiagent Reddit simulation quantifying CIB's effect on recommender outputs.
ora.ox.ac.ukICML AI4ABM Workshop
cib agents reddit permalink →
News
Marquise Mason on X: \"#UK High Learn Ltd We attacked https://t.co/OEKetEWsFr 01.02.25 We have received numerous data...
Ransomware crew '8BASE' announcing a UK target on X, illustrating how threat actors broadcast on X for OSINT monitors to pick up.
x.com
cybersecurity twitter osint permalink →
Data
deepdarkCTI/twitter_threat_actors.md at main · fastfire/deepdarkCTI · GitHub
Curated index of X accounts run by ransomware groups and cybercriminals — a seed list for OSINT cyber-threat monitoring.
github.com
cybersecurity osint datasets github permalink →
Research
Ethan Mollick on X: \"Everyone is starting to sound like AI, even in spoken language Analysis of 280,000 transcripts o...
Mollick highlights a study on 280K academic talk transcripts showing speakers increasingly use ChatGPT-favored words — model collapse for humans.
x.com
llms evaluation twitter permalink →
Technical
Akshay 🚀 on X: \"Big moment for Postgres! Search has always been Postgres' weak spot, and everyone just accepted it. I...
TigerData open-sourced pg_textsearch, bringing native BM25 ranking to Postgres so teams can drop their separate Elasticsearch clusters.
x.com
vector-search infrastructure twitter permalink →
Technical
Reciprocal rank fusion | Elasticsearch Reference
Elasticsearch reference on Reciprocal Rank Fusion — a tuning-free method for combining multiple retrievers into one ranked result set.
elastic.co
vector-search infrastructure permalink →
Technical
Origon — The Intelligence Infrastructure
Origon's enterprise-AI-agent platform: agents trained on customer data and deployed to private cloud or on-prem for regulated industries.
origon.ai
agents infrastructure permalink →
Technical
Mixedbread on X: \"We build the first production ready multi-vector and multimodal search. Now we are serving over 1 b...
Mixedbread serves 1B+ docs with multi-vector and multimodal search at sub-50ms p50, with a production engineering write-up forthcoming.
x.com
vector-search infrastructure twitter permalink →
Technical
GitHub - aiming-lab/SimpleMem: SimpleMem: Efficient Lifelong Memory for LLM Agents — Text & Multimodal · GitHub
SimpleMem: efficient open-source lifelong-memory implementation for LLM agents across text and multimodal inputs.
github.com
agents llms github permalink →
Technical
Sprites - Stateful sandboxes
Sprites: hardware-isolated persistent Linux sandboxes for running AI-agent code, with checkpoint and restore.
sprites.dev
agents infrastructure permalink →
News
Many Small Steps for Robots, One Giant Leap for Mankind
Packy McCormick co-essay with Standard Bots' Evan Beard on cumulative small advances in robotics adding up to a giant leap.
notboring.co
agents permalink →
Technical
GitHub - braedonsaunders/codeflow: Paste any GitHub URL → interactive architecture map. See how files connect, find w...
Paste any GitHub URL into CodeFlow and get an interactive architecture map showing how files connect — browser-only, no install.
github.com
coding-agents github permalink →
Technical
json-render | The Generative UI Framework
json-render: a generative-UI framework that turns LLM JSON output into safe predefined components and actions.
json-render.dev
llms agents permalink →
News
Deedy (@deedydas) on XClaude Code did a side project that took me ~2 weeks in 2025 in 30mins.
Deedy Das post claiming Claude Code completed in 30 minutes a side project that took him two weeks in 2025.
share.google
coding-agents twitter permalink →
Technical
GitHub - mggrim/scholarly-ideas: Research puzzle development tool - helps academics develop rigorous research puzzles...
Scholarly-ideas: an LLM-driven research-puzzle development tool that grounds academic puzzles in real empirical anomalies.
github.com
llms github permalink →
Technical
Tweet Topic Clusters â @deedydas
Embeddings + k-means + GPT-4o labeling applied to 6,912 of @deedydas's tweets to surface 25 thematic clusters.
debarghyadas.com
llms twitter evaluation permalink →
News
Meet Qualia - YouTube
Quadrillion Labs unveils Qualia, an agentic data-scientist product, in a teaser for its private beta.
youtube.com
agents youtube permalink →
Technical
E2B | The Enterprise AI Agent Cloud
E2B's enterprise AI-agent cloud, used by Perplexity, Hugging Face, Manus, Groq, and Lindy for sandboxed agent execution.
e2b.dev
agents infrastructure permalink →
Technical
GitHub - robert-mcdermott/ai-knowledge-graph: AI Powered Knowledge Graph Generator · GitHub
Open-source pipeline that turns unstructured text documents into LLM-generated knowledge graphs.
github.com
llms github permalink →
News
You are being manipulated: RageCheck is a free tool that analyzes online content for manipulative framings—language d...
Jay Van Bavel promotes RageCheck, a free tool that flags emotionally manipulative framing patterns in online content.
linkedin.com
disinformation fact-checking permalink →
Tutorials
Loss Landscapes: Saddles, Minima & Generalization | TensorTonic
Interactive walk-through of neural-network loss landscape geometry: convexity, saddle points, sharp vs flat minima.
tensortonic.com
training permalink →
Technical
Code Execution with MCP: Fix Tool Token Bloat (Adam Jones, Anthropic) - YouTube
Anthropic engineer Adam Jones on cutting MCP tool-token bloat by having agents write code to call tools rather than register every tool.
youtube.com
agents youtube permalink →
Technical
GitHub - huseyinbabal/taws: Terminal UI for AWS (taws) - A terminal-based AWS resource viewer and manager · GitHub
taws: a Rust-built terminal UI for AWS — navigate, observe, and manage resources without the console.
github.com
github infrastructure permalink →
Technical
AI SDK 6 - Vercel
AI SDK 6 from Vercel ships tool-execution approval, DevTools, native MCP, reranking, and image editing for TypeScript AI apps.
vercel.com
agents llms infrastructure permalink →
Technical
Code execution with MCP: building more efficient AI agents \\ Anthropic
Anthropic engineering case for letting agents call MCP servers via code execution, so tool definitions stay out of the context window.
anthropic.com
agents infrastructure llms permalink →
Technical
Merkle Mountain Ranges - Grin Documentation
Grin's docs on Merkle Mountain Ranges — an append-only alternative to Merkle trees used to store blockchain kernels and proofs.
docs.grin.mw
infrastructure permalink →
Technical
Databases in 2025: A Year in Review // Blog // Andy Pavlo - Carnegie Mellon University
Andy Pavlo's 2025-in-review on databases: Postgres momentum, MCP everywhere, MongoDB v FerretDB, file formats, and Turbopuffer's rise.
cs.cmu.edu
infrastructure vector-search permalink →
News
swyx 🌉 on X: \"the way that @turbopuffer started late but overtook Pinecone and ripped out 4-5m ARR contracts needs to...
swyx on Turbopuffer overtaking Pinecone with $4-5M ARR contracts despite a late start, citing Pavlo's 2025 databases retrospective.
x.com
vector-search twitter permalink →
Technical
Devi Parikh on X: \"Building agents that skim the web is easy. Building agents that go deep, go broad, stay on-point,...
Devi Parikh: Yutori's blog on lessons building deep, broad, on-point, 24/7 agents that power Scouts — useful for anyone building agents.
x.com
agents twitter permalink →
Technical
ChapterPal — AI-assisted reading and note-taking
ChapterPal: AI-assisted reading and note-taking app, marketing landing page.
chapterpal.com
llms permalink →
Technical
Boris Cherny on X: \"I'm Boris and I created Claude Code. Lots of people have asked how I use Claude Code, so I wanted...
Boris Cherny (creator of Claude Code) shares his vanilla setup and notes there's no one correct way to use the tool.
x.com
coding-agents twitter permalink →
Research
Randall Balestriero on X: \"We start having provable measures of alignment between pretraining setups and eval perfs:...
Balestriero highlights provable measures of alignment between pretraining setups and eval performance — early but promising work.
x.com
training evaluation twitter permalink →
Research
Seunghyun Seo on X: \"Inspired by this thread, I'd like to share my slides on training horizon scaling. Lately, lots o...
Seunghyun Seo's slides on training-horizon scaling, focusing on the role of weight decay (not just learning rate) when scaling.
x.com
training twitter permalink →
Research
HumanPlane/LACUNA · Hugging Face
LACUNA: a PPO RL agent trained to trade Polymarket 15-minute crypto markets by fusing Binance order flow with Polymarket orderbook data.
huggingface.co
training huggingface datasets permalink →
Technical
Jaya Gupta on X: \"AI’s trillion-dollar opportunity: Context graphs\" / X
Jaya Gupta argues 'context graphs' that capture decision traces (not just data) are AI's next trillion-dollar opportunity.
x.com
agents twitter permalink →
Technical
Animesh Koratana on X: \"How to build a context graph\" / X
Animesh Koratana on how to actually build a context graph — modeling decision traces is structurally hard, not 'add memory to your agent'.
x.com
agents twitter permalink →
Tutorials
Manthan Gupta on X: \"How to Use LLM as a Judge (Without Getting Burned)\" / X
Practical guide: when to use LLM-as-judge, with five rules (reference-based, debiasing, ensembling, reasoning before scoring, calibration).
x.com
evaluation llms twitter permalink →
Technical
discovering the postrat canon in the community archive | lab notes #4
Lab notes from Epistemic Garden on deriving narrative strands from quote-tweets + semantic search across the postrat community archive.
xiqo.substack.com
llms datasets permalink →
News
AI Agents on WhatsApp: Scalable Support with ElevenLabs - YouTube
ElevenLabs becomes an official WhatsApp technology provider, deploying human-like voice agents inside WhatsApp business chats.
youtube.com
agents voice youtube permalink →
Tutorials
Miguel Ángel Pastor on X: \"A hardware-aware guide to data structures for system software engineers. https://t.co/cW77...
A hardware-aware guide to data structures for systems engineers, shared by Miguel Pastor.
x.com
infrastructure github twitter permalink →
Tutorials
Karan Lokchandani on X: \"@techNmak https://t.co/cexRv5mueF\" / X
Karan Lokchandani shares an open LLM-interview-questions PDF — handy prep for ML hiring loops.
x.com
llms twitter permalink →
Technical
Rate limits | Gemini API | Google AI for Developers
Google Gemini API rate-limit reference: RPM, TPM, RPD across tiers, model-specific caps, and project-level scoping.
ai.google.dev
llms infrastructure permalink →
Technical
Google AI Studio’s Interactions API for Gemini models and agents
Google AI Studio launches the Interactions API, a unified foundation for building Gemini-based models and agents in public beta.
blog.google
agents llms permalink →
News
Perplexity
Perplexity Page on a malicious VPN extension allegedly stealing ChatGPT credentials (paywalled or login-gated).
perplexity.ai
cybersecurity permalink →
Tutorials
LangChain on X: \"🚀☁️ Deploying LangChain Agents on AWS Serverless Made by the LangChain Community Thomas Taylor deplo...
LangChain shares Thomas Taylor's tutorial on deploying stateful LangGraph agents on AWS Lambda with DynamoDB checkpointing and CDK.
x.com
agents infrastructure twitter permalink →
Tutorials
.NET Concurrency: Async/Await Explained | Bharat Biyani posted on the topic | LinkedIn
Bharat Biyani's visual guide to .NET concurrency: state machine, stack vs heap, async vs parallel, the .Result deadlock trap.
linkedin.com
infrastructure permalink →
Tutorials
Design Patterns: Solutions to Common Software Problems | Bharat Biyani posted on the topic | LinkedIn
Bharat Biyani's design-patterns explainer: why they exist, what problems they solve, when (and when not) to use them, interview framing.
linkedin.com
infrastructure permalink →
Research
Belinda on X: \"What if you could watch an AI Scientist think? We built an interface to make @SakanaAILabs’s AI Scient...
Belinda's interpretable-interface project for Sakana's AI Scientist-v2 lets you watch every hypothesis, failed experiment, and 'aha'.
x.com
agents interpretability twitter permalink →
Research
What 2026 looks like — LessWrong
Daniel Kokotajlo's 2021 LessWrong post extrapolating AI futures year-by-year through 2026 — frequently cited AI-timeline vignette.
lesswrong.com
safety llms permalink →
Research
AI 2027
AI 2027: a detailed scenario forecasting superhuman AI's next-decade impact, with slowdown vs race endings and quantitative trend extrapolations.
ai-2027.com
safety agents permalink →
Technical
xjdr on X: \"# Why Training MoEs is So Hard recently, i have found myself wanting a small, research focused training r...
xjdr on why training MoEs under 20B params is hard: flop efficiency, load-balancing/router stability, and data quality/quantity.
x.com
training twitter permalink →
Tutorials
Archie Sengupta on X: \"distributed_gpu_training\" / X
Archie Sengupta's distributed-GPU-training explainer: from a single GPU's streaming multiprocessors to slicing models across racks.
x.com
training infrastructure twitter permalink →
Research
c++ design patterns for high frequency trading
arXiv preprint on C++ design patterns for high-frequency trading systems.
arxiv.org
infrastructure permalink →
Tutorials
eric zakariasson on X: \"the internal guide that technically lite team members go through when joining cursor https://...
Eric Zakariasson shares Cursor's internal onboarding guide for non-technical team members joining the company.
x.com
coding-agents twitter permalink →
Technical
Ax - Build Reliable AI Apps in TypeScript
Ax: a TypeScript DSPy port — declare signatures instead of prompts, with auto-prompt-tuning, GEPA optimizer, and 15+ LLM providers.
axllm.dev
llms agents permalink →
Technical
Preserve progress despite interruptions - AWS Lambda durable functions - AWS
AWS Lambda durable functions: automatic checkpointing, suspend execution up to a year, recover from failures, no infra to manage.
share.google
agents infrastructure permalink →
Technical
Clerk Billing
Clerk Billing: drop-in React components for B2C and B2B subscription billing without writing payment-integration or UI code.
clerk.com
infrastructure permalink →
Technical
Polar — A billing platform for the intelligence era | Polar
Polar: usage-based billing platform for the AI era — meter tokens, API calls, compute, GPU workloads.
polar.sh
infrastructure agents permalink →
Technical
K-Dense on X: \"Go from a dataset to a ready to submit manuscript in 1 day with our Claude Scientific Skills and Claud...
K-Dense ships free Claude Scientific Skills + Writer that take a dataset to a submission-ready manuscript in one day.
x.com
agents coding-agents twitter permalink →
Technical
Thariq on X: \"We built a Deep Research demo for the Claude Agent SDK! It's one our most requested use cases: spawn mu...
Thariq launches a Deep Research demo for the Claude Agent SDK: parallel agents researching a topic and synthesizing into a report.
x.com
agents twitter permalink →
News
refine | Pedro Sant'Anna
Pedro Sant'Anna recommends refine.ink: AI tool that proof-checks academic papers for internal consistency, typos, and equation proofs.
linkedin.com
llms permalink →
Tutorials
psmpy: Propensity Score Matching in Python! | Towards Data SciencePerforming propensity score matching in a python en...
Towards Data Science walkthrough of psmpy, a Python library for propensity-score matching (paywalled, fetch failed).
towardsdatascience.com
evaluation permalink →
Technical
GitHub - leoheuler/flashtensors · GitHub
flashtensors: run 100 large models on a single GPU with minimal time-to-first-token impact via tensor swap.
github.com
infrastructure llms github permalink →
Technical
Lakshya A Agrawal on X: \"GEPA featured in @OpenAI and @BainandCompany new cookbook tutorial, showing how to build sel...
GEPA featured in the OpenAI x Bain cookbook tutorial on building self-evolving agents that move beyond static prompts.
x.com
agents twitter training permalink →
Technical
Modal: High-performance AI infrastructure
Modal: high-performance AI infrastructure with sub-second cold starts, instant autoscaling, elastic GPU access without quotas.
modal.com
infrastructure training permalink →
Technical
GitHub - MoonshotAI/kosong: The LLM abstraction layer for modern AI agent applications. · GitHub
Kosong: Moonshot AI's open-source LLM abstraction layer for modern AI agent applications (now part of the kimi-cli monorepo).
github.com
agents llms github permalink →
Technical
GitHub - microsoft/agent-framework: A framework for building, orchestrating and deploying AI agents and multi-agent w...
Microsoft's Agent Framework: open-source framework for building, orchestrating, and deploying AI agents and multi-agent workflows in Python and .NET.
github.com
agents github permalink →
Technical
Agent Development Kit (ADK) - Agent Development Kit (ADK)
Google's Agent Development Kit (ADK): production-ready open-source agent framework in Python, TypeScript, Go, and Java.
google.github.io
agents permalink →
Research
AI impact on growth: economists vs AI experts | Tom Cunningham posted on the topic | LinkedIn
Tom Cunningham compares AI-impact-on-growth forecasts from economists (0.1-1.5%/yr) vs AI experts (3-30%/yr) — large disagreement, debate why.
linkedin.com
safety permalink →
Technical
Introduction - supermemory | Memory API for the AI era
Supermemory: a memory + RAG API for the AI era with graph memory, content types, multi-tenancy, and SDK integrations.
supermemory.ai
agents vector-search infrastructure permalink →
News
Santiago on X: \"The MiniMax M2 model is mind-blowing! It's open-source. It outperforms Gemini 2.5, Claude 4.1, and Qw...
Santiago: MiniMax M2 (open-source) outperforms Gemini 2.5, Claude 4.1, Qwen3 on coding/tool-use benchmarks at ~8% of Claude's cost.
x.com
llms evaluation twitter permalink →
Technical
Akshay 🚀 on X: \"Microsoft did it again! Building with AI agents almost never works on the first try. You spend days t...
Akshay highlights Microsoft's open-source Agent Lightning: train any AI agent (LangChain, AutoGen, CrewAI, etc.) with RL and prompt optimization.
x.com
agents training twitter permalink →
News
Google's NotebookLM: 8x bigger brain, custom goals, and more | Neil Hoyne posted on the topic | LinkedIn
Neil Hoyne summarizes NotebookLM's upgrade: 1M-token context, 6x longer memory, custom personas, deeper research.
linkedin.com
llms permalink →
Research
How to use behavioural science in social media for lasting change | Aleksandra Kuzmanovic posted on the topic | LinkedIn
WHO's Aleksandra Kuzmanovic on behavioural-science-informed framing for health communication on social media: a viewpoint with Meta researchers.
linkedin.com
health-ai disinformation permalink →
Research
Time to start looking into pangram’s models for AI generated text detection. Seems like it
BFI Chicago working paper on Pangram's models for AI-generated text detection (PDF, page is binary so excerpt unparsed).
bfi.uchicago.edu
llms evaluation permalink →
Technical
Johann Schopplich on X: \"JSON is token‑expensive for LLMs – just like @mattpocockuk frequently mentions. Meet TOON, t...
Johann Schopplich introduces TOON: Token-Oriented Object Notation, a JSON-like format that's 40-60% fewer tokens for LLMs.
x.com
llms infrastructure twitter permalink →
Technical
Elastic Dev on X: \"All you need is a natural language agent definition, and you have a custom AI assistant to help yo...
Elastic Dev shows Agent Builder: define an agent in natural language and get a custom AI assistant for Elastic data.
x.com
agents vector-search twitter permalink →
Technical
LangChain on X: \"🔍🤖 Enterprise Deep Research A multi-agent system leveraging LangGraph to power enterprise research a...
LangChain shares Salesforce AI Research's Enterprise Deep Research, a multi-agent system on LangGraph with streaming and human steering.
x.com
agents twitter permalink →
Research
Ai2 on X: \"On olmOCR-Bench, olmOCR 2 scores 82.4 points, up from 78.5 in our previous release—increasing performance...
AI2 olmOCR 2 scores 82.4 on olmOCR-Bench (up from 78.5), with gains across every document category.
x.com
evaluation llms twitter permalink →
Technical
Meilisearch: Unified Search & AI Retrieval Platform
Meilisearch: unified search and AI-retrieval platform with sub-50ms full-text, semantic, hybrid, and multi-modal search.
meilisearch.com
vector-search infrastructure permalink →
Tutorials
ES|QL query builder for Python Elasticsearch Client - Elasticsearch Labs
Elastic Labs blog: ES|QL query builder for the Python Elasticsearch client (8.19+) with familiar Python syntax.
share.google
vector-search infrastructure permalink →
Technical
GitHub - atiilla/GeoIntel: GeoIntel using Google's Gemini API to uncover the location where photos were taken through...
GeoIntel: Python tool using Google's Gemini API to uncover photo locations through AI-powered geo-location analysis.
github.com
osint github multi-modal permalink →
Technical
OpenAgent - The Open Source Agentic AISearch, think, and complete general tasks — Open-agent is a multimodal, agentic...
OpenAgent: open-source multimodal agentic AI for search, thinking, and general tasks (page returned empty).
open-agent.io
agents multi-modal permalink →
Technical
GitHub - elevenlabs/ui: ElevenLabs UI is a component library and custom registry built on top of shadcn/ui to help yo...
ElevenLabs UI: a shadcn-based component library and registry for building multimodal voice agents faster.
github.com
agents multi-modal github voice permalink →
Tutorials
How BookMyShow handled 1M ColdPlay ticket requests in 10 mins | Animesh Gaitonde posted on the topic | LinkedIn
Animesh Gaitonde on how BookMyShow handled 1M ColdPlay ticket requests in 10 minutes: pessimistic vs optimistic vs in-memory locking.
linkedin.com
infrastructure permalink →
Technical
TwelveLabs: Video Intelligence Platform & API
TwelveLabs: video-intelligence platform with 60x real-time ingest, indexing 10k hours/day; turn raw footage into searchable AI-ready data.
twelvelabs.io
multi-modal vector-search infrastructure permalink →
Technical
Aydyn Tairov on X: \"OpenZL - outperforms zstd, xz, gzip, and Blosc on multiple real-world datasets with 10x (!!!) spe...
Aydyn Tairov highlights Meta's OpenZL data-compression framework — graph-based composition of existing algorithms, 10x speedup over zstd.
x.com
infrastructure twitter permalink →
News
Google removes num=100 search parameter, impacting startups and LLMs | Adarsh Appaiah posted on the topic | LinkedIn
Adarsh Appaiah: Google removed the num=100 search parameter, cutting LLM-accessible long-tail results 90% and dropping Reddit's stock 15%.
linkedin.com
llms platform-policy permalink →
News
Maxime Labonne on X: \"LFM2-Audio just dropped! It's a 1.5B model that understands and generates both text and audio I...
Liquid AI ships LFM2-Audio, a 1.5B model handling text and audio with 10x faster inference and parity with 10x larger models.
x.com
llms voice twitter permalink →
Data
ToolUniverse — 1,000+ Scientific Tools for AI Scientists
ToolUniverse: a registry of 1,000+ scientific tools for AI Scientist agents.
aiscientist.tools
agents datasets permalink →
Technical
Lingo.dev – The localization engineering platform
Lingo.dev: localization-engineering platform that persists glossaries, brand voice, and per-locale model chains as a stateful translation API.
lingo.dev
llms infrastructure permalink →
Technical
Parsera - Transform Websites into Data
Parsera: agent-based and API-based scraping that turns any website into a custom dataset via natural-language prompts.
parsera.org
agents datasets infrastructure permalink →
Technical
Refine - AI-Powered Research Assistant
Refine.ink: AI-powered peer-review tool that flags accuracy, math, and internal-reference errors in research papers.
refine.ink
llms evaluation permalink →
Technical
Turing Post on X: \"An open-source extension for LLM serving engines – LMCache It's like a caching layer for large-sca...
Turing Post on LMCache, an open-source KV-cache management layer for LLM serving — 4-10x reduction in RAG, lower TTFT, integrated with NVIDIA Dynamo.
x.com
infrastructure llms twitter permalink →
Technical
Harrison Chase on X: \"Deep Agents - now on LangChain 1.0 We rewrote Deep Agents on top of LangChain 1.0, heavily util...
Harrison Chase: Deep Agents now run on LangChain 1.0 using new middleware — technical deep dive on what they are and how to use them.
x.com
agents twitter permalink →
Research
Valeriy M., PhD, MBA, CQF on X: \"🚀 Tsururu: A New Python Library for Time Series Forecasting (arXiv:2509.15843v1) Tsu...
Tsururu (arXiv 2509.15843): a Python time-series-forecasting library focused on strategies (recursive/direct/MIMO/hybrid) and preprocessing.
x.com
evaluation training twitter permalink →
News
X (link)
X post no longer available (deleted/missing page).
x.com
twitter permalink →
Technical
Dub - The Modern Link Attribution Platform
Dub: modern link-attribution platform for short links, conversion tracking, and affiliate programs.
dub.co
infrastructure permalink →
Technical
Akshay 🚀 on X: \"Finally, an open-source, enterprise-grade RAG solution! If you're building an enterprise-grade RAG sy...
Akshay highlights MindsDB Knowledge Bases: open-source enterprise RAG over 200+ data sources with embeddings, reranking, real-time sync.
x.com
agents vector-search twitter permalink →
Technical
Adalat AI - End-to-End Justice Tech Stack
Adalat AI: India's end-to-end justice tech stack — courtroom transcription, case-lifecycle management, real-time updates.
adalat.ai
agents india health-ai permalink →
News
Today on Indicator: I found that 53 TikTok videos pushing clickbaity hoaxes about Charlie Kirk's assassination got al...
Alexios Mantzarlis: 53 TikTok hoax videos about Charlie Kirk's assassination got 32M views in three days; TikTok took 48 down after disclosure.
linkedin.com
disinformation tiktok fact-checking permalink →
Data
GitHub - allenai/awesome-open-source-lms: Friends of OLMo and their links. · GitHub
Allen AI's curated list of open-source language models — links and resources from the OLMo team's NeurIPS 2024 tutorial.
github.com
llms datasets github permalink →
News
Crazy random line in the Cursor RL blog post saying they're collecting RL data from real users, updating the checkpoi...
Nathan Lambert flags Cursor's RL blog: collecting RL data from real users and updating checkpoints every 90-120 minutes — unthinkable a year ago.
linkedin.com
coding-agents training rlhf permalink →
Research
Geoffrey Litt on X: \"If you're thinking about AI-generated UIs, recommend checking out JELLY by @YiningCao3, @peiling...
Geoffrey Litt on JELLY: structured AI-generated UIs that first build a data schema users can edit, then compose UIs from premade widgets.
x.com
llms agents twitter permalink →
Technical
Strands AgentsAI-powered agents for modern workflows
Strands Agents: AI-powered agents for modern workflows (page returned empty).
strandsagents.com
agents permalink →
Technical
Advanced Context Engineering for Agents - YouTube
Dexter Horthy (Human Layer) on advanced context engineering for agents — spec-first, compaction strategies, subagents, planning workflows.
youtube.com
agents coding-agents youtube permalink →
Technical
Using LongMemEval to Improve Agent Memory - YouTube
Sam Bhagwat (Mastra) on LongMemEval: tailored templates and targeted updates yield SOTA results on agent-memory benchmarks.
youtube.com
agents evaluation youtube permalink →
Technical
Context Engineering for Engineers - YouTube
Jeff Huber (Chroma) on context engineering: filtering and compaction matter more than long-context windows for reliable agent performance.
youtube.com
agents llms youtube permalink →
Technical
Context Engineering: Lessons Learned from Scaling CoCounsel - YouTube
Jake Heller on scaling CoCounsel — context-engineering lessons from building professional-grade legal AI from GPT-4 onward.
youtube.com
agents llms youtube permalink →
Technical
Agentuity — The Full-Stack Platform for AI Agents
Agentuity: full-stack platform for AI agents — typed APIs, frontends, sandboxes, evals, OpenTelemetry observability, evals on live traffic.
agentuity.com
agents infrastructure permalink →
News
Meet Chatterbox Multilingual: An Open-Source Zero-Shot Text To Speech (TTS) Multilingual Model with Emotion Control a...
Resemble AI's Chatterbox Multilingual: production-grade open-source zero-shot TTS in 23 languages with emotion control and watermarking.
marktechpost.com
voice multi-modal permalink →
News
smol ai (follow @latentspacepod for ainews) on X: \"[5 Sept 2025] Kimi K2‑0905 and Qwen3‑Max preview: two 1T open weig...
smol AI: Kimi K2-0905 and Qwen3-Max-preview both launched as 1T-parameter open-weight models on the same day.
x.com
llms twitter permalink →
Technical
Neo - Autonomous AI Agent to build and evaluate AI models, AI Agents, LLM prompts and ML systems
Neo: autonomous ML-engineer agent that automates training, fine-tuning, RAG pipeline construction, and evaluation.
heyneo.so
agents training coding-agents permalink →
Technical
Agentic Design Patterns - Google Docs
Agentic Design Patterns Google Doc (login-gated; can't read content).
docs.google.com
agents permalink →
Technical
Do the simplest thing that could possibly work
Sean Goedecke on system design: do the simplest thing that could possibly work, in fixing bugs, maintaining systems, and architecting new ones.
seangoedecke.com
infrastructure permalink →
Data
Social Forest - #1 YouTube Data API & YouTube Scraper Alternative
Social Forest: YouTube Data API and YouTube Scraper alternative for accessing YouTube data at scale.
social-forest.com
youtube datasets permalink →
Technical
Charlie Marsh on X: \"Today, we're announcing our first hosted infrastructure product: pyx, a Python-native package re...
x.com
twitter permalink →
Technical
AngeTheGreat - YouTube
youtube.com
youtube permalink →
Technical
Overview | Embedding Atlas
apple.github.io
vector-search permalink →
Technical
How Roblox Partners With Law Enforcement | Roblox
corp.roblox.com
twitter journalism permalink →
Technical
Daytona - Secure Infrastructure for Running AI-Generated Code
daytona.io
agents permalink →
Technical
GitHub - getzep/graphiti: Build Real-Time Knowledge Graphs for AI Agents · GitHub
github.com
agents rag coding-agents github permalink →
Technical
Min Choi on X: \"@sama gpt-oss-20b running on 16GB GPU? 🤔\" / X
x.com
twitter permalink →
Data
openai/gpt-oss-120b · Hugging Face
huggingface.co
agents llms evaluation training permalink →
Technical
GPU graph rendering test - YouTube
youtube.com
youtube permalink →
Technical
LLM Evals: Everything You Need to Know – Hamel’s Blog - Hamel Husain
hamel.dev
llms evaluation training coding-agents permalink →
Technical
Speeding Up the Webcola Graph Viz Library with Rust + WebAssembly - Casey Primozic's Homepage
cprimozic.net
systems permalink →
Technical
Sid on X: \"Working on a side project with Claude called Granter that takes in info about your org, scans through avai...
x.com
twitter permalink →
Technical
GitHub - chiphuyen/sniffly: Claude Code dashboard with usage stats, error analysis, and sharable feature · GitHub
github.com
coding-agents github permalink →
Technical
User Embeddings: How TikTok Knows You Better Than You Do - YouTube
youtube.com
agents llms training vector-search permalink →
Technical
Satya Nadella on X: \"Today we’re releasing GitHub Spark — a new tool in Copilot that turns your ideas into full-stack...
x.com
coding-agents twitter permalink →
Data
Any_to_Any_RAG.ipynb · merve/smol-vision at main
huggingface.co
huggingface permalink →
Data
Qwen/Qwen2.5-Omni-7B · Hugging Face
huggingface.co
evaluation vector-search multi-modal huggingface permalink →
Technical
Cloudflare launches pay-per-crawl, a game changer for SaaS and content creators | Greg Isenberg posted on the topic |...
linkedin.com
linkedin permalink →
Technical
Ever wonder how platforms like TikTok and YouTube decide what your teen sees? We are developing a tool called Algorit...
linkedin.com
voice youtube tiktok permalink →
Technical
Why LinkedIn News Feed Is Showing Old Posts - Business Insider
businessinsider.com
bluesky facebook permalink →
Technical
The Big LLM Architecture Comparison
magazine.sebastianraschka.com
llms evaluation vector-search multi-modal permalink →
Data
Open ASR Leaderboard - a Hugging Face Space by hf-audio
huggingface.co
evaluation huggingface permalink →
Technical
secemp on X: \"I couldn't believe whisper was SOTA and then found out there is actually a better model from nvidia (WE...
x.com
twitter permalink →
Technical
Indian Tech & Infra on X: \"🚨 Perplexity Pro is offering 12 months FREE exclusively to Airtel users in India. https://...
x.com
twitter india permalink →
Technical
Amazon S3 Vectors
aws.amazon.com
agents rag vector-search permalink →
Technical
New Health AI Models for Developers: MedGemma 27B and MedSigLIP | Google Research posted on the topic | LinkedIn
linkedin.com
rag multi-modal health-ai permalink →
News
Researchers Jailbreak AI by Flooding It With Bullshit Jargon
404media.co
llms safety twitter bluesky permalink →
Technical
Beautiful themes for shadcn/ui — tweakcn | Theme Editor & Generator
tweakcn.com
design permalink →
Technical
Me: I want an LLM server for 500 users TikTok: You have an LLM server for 500'000 users at home AIBrix, courtesy of T...
linkedin.com
llms infrastructure coding-agents tiktok permalink →
Technical
Lloom: LLM-based concept induction on political-social media content
stanfordhci.github.io
evaluation permalink →
Technical
Discover Web apps | Mobbin
mobbin.com
design permalink →
Technical
DeepSeek-R1-0528: How to Run Locally | Unsloth Documentation
docs.unsloth.ai
llms evaluation training vector-search permalink →
Technical
No credit card or crazy infra needed anymore. 🦥 Just Unsloth and a Colab notebook with a T4 GPU. Fine-tuning massive,...
linkedin.com
llms training multi-modal voice permalink →
Technical
How I built a LinkedIn agent to find profiles fast | Abhijay Vuyyuru posted on the topic | LinkedIn
linkedin.com
agents youtube permalink →
Technical
GitHub - stanford-oval/storm: An LLM-powered knowledge curation system that researches a topic and generates a full-l...
github.com
llms vector-search github permalink →
Technical
WhatsApp AI Chatbot to give instant, accurate answers 24/7
joyz.ai
facebook permalink →
Technical
LangChain overview - Docs by LangChain
python.langchain.com
agents llms safety infrastructure permalink →
Technical
Joint Retrieval and Recommendation Modeling - by Janu Verma
januverma.substack.com
vector-search permalink →
Technical
Exa | Web Search API, AI Search Engine, & Website Crawler
exa.ai
llms evaluation coding-agents permalink →
Technical
User Guide
gerrit.wikimedia.org
wikimedia permalink →
Technical
How Israel uses Google Ads in its information offensive against Iran
indicator.media
privacy policy permalink →
Technical
ChatPDF AI | Chat with any PDF | Free
chatpdf.com
youtube permalink →
Technical
GitHub - HumanSignal/awesome-data-labeling: A curated list of awesome data labeling tools · GitHub
github.com
github permalink →
Technical
GitHub - AykutSarac/github-rater: 📊 Check your GitHub rating, view results and enhance your profile quality. · GitHub
github.com
github permalink →
Technical
Seedance 1.0 Lite | Text to Video | fal.ai
fal.ai
llms permalink →
Technical
ChatGPT convinced 3 people to do ketamine, fall in love with it and pushed them to domestic violence. Silicon can now...
linkedin.com
llms permalink →
Technical
Hugging Face drops support for Google's AI frameworks | Gaurav Jain posted on the topic | LinkedIn
linkedin.com
agents infrastructure huggingface permalink →
Technical
jason on X: \"3M downloads per month 11k stars 0 money raised 1.4M top line revenue for https://t.co/YrBEtDDpo9 thank...
x.com
twitter permalink →
Technical
How a Danish News Service Made a Profit with its Transcription Tool | by Clare Spencer | Generative AI in the Newsroom
generative-ai-newsroom.com
youtube journalism permalink →
Technical
Spinach Wikidata
spinach.genie.stanford.edu
knowledge-graphs permalink →
Technical
GitHub - stanford-oval/spinach: SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions · G...
github.com
llms github permalink →
Technical
DeerFlow
deerflow.tech
llms youtube permalink →
Technical
Josh Miller on X: \"You can also get unique context into AI chat from within tabs too. ex: Highlight text on a tab to...
twitter.com
twitter permalink →
Technical
GitHub - triton-inference-server/server: The Triton Inference Server provides an optimized cloud and edge inferencing...
github.com
github permalink →
Technical
Phil Eaton on X: \"Great paper on how Google runs a commercial research lab, courtesy of Ankush Menat. Would love to r...
twitter.com
twitter permalink →
Technical
Philrox on X: \"As promised, here's the update on my AI powered SEO keyword research workflow from The vibe Marketer I...
twitter.com
twitter permalink →
Technical
Denislav Jeliazkov on X: \"I design apps for a living. I've spent 6+ years using every FinTech app on the market. Here...
twitter.com
twitter permalink →
Technical
Charly Wargnier on X: \"This is crazy. Nate Herkelman turned @n8n_io into a full marketing team! 🤯 His AI agent: ↳ gen...
twitter.com
agents llms twitter permalink →
Technical
AI just killed PowerPoint. 😱 No more endless hours creating PPT. Here are 10 websites to create presentations with AI...
linkedin.com
telegram cybersecurity permalink →
Technical
bilal on X: \"r/localllama anons huffman encoding model weights and inventing a new FP format DFloat11 so they can fit...
twitter.com
twitter permalink →
Technical
How I use AI playlist - YouTube
youtube.com
coding-agents youtube disinformation permalink →
Technical
merve on X: \"Don't sleep on this! 🔥 @Meta dropped swiss army knives for vision with A2.0 license ❤️ > image/video...
twitter.com
twitter github huggingface permalink →
Technical
GitHub - kortix-ai/suna: The Autonomous Company Operating System · GitHub
github.com
agents llms coding-agents twitter permalink →
Technical
Visualizing PyTorch DTensor Sharding with JAX | Yi Wang posted on the topic | LinkedIn
linkedin.com
llms infrastructure github huggingface permalink →
Technical
Pretty WILD - SoTA open source TTS model that beats ElevenLabs/ Sesame - Dia 1.6B - Apache 2.0 licensed! 🔥 > Ultra re...
linkedin.com
agents voice huggingface permalink →
Technical
GitHub - birobirobiro/awesome-shadcn-ui: A curated list of awesome things related to shadcn/ui. · GitHub
github.com
voice github permalink →
Technical
Foundation Model for Personalized RecommendationBy Ko-Jen Hsiao, Yesu Feng and Sudarshan Lamkhede
netflixtechblog.com
llms recsys permalink →
Technical
PikTop
pik.top
evaluation tiktok permalink →
Technical
Pravda Dashboard — Tracking Russia's Pravda Network
solatrix.github.io
github privacy geopolitics permalink →
Technical
Pravda in numbers - Content and Network analysis | Amaury L.
linkedin.com
disinformation permalink →
Technical
Goodfire raises $50M Series A for AI understanding | Eric Ho posted on the topic | LinkedIn
linkedin.com
llms interpretability permalink →
Technical
Potato | Accelerate Scientific Execution
readysetpotato.com
llms permalink →
Technical
When building LLM-based applications that use RAG (Retrieval-Augmented Generation), splitting documents into small *c...
linkedin.com
llms rag permalink →
Technical
GitHub - browserbase/stagehand: The SDK For Browser Agents · GitHub
github.com
llms coding-agents github permalink →
Technical
The agents are coming and we can't catch them.
alexmreinhart.substack.com
agents youtube permalink →
Technical
Skilled Coder on X: \"Backend System Design for Rate Limiter (This is a high-level overview to help you understand how...
twitter.com
twitter permalink →
Technical
Convert FastAPI to MCP server with FastAPI-MCP | Akshay Pachaar posted on the topic | LinkedIn
linkedin.com
coding-agents github permalink →
Technical
Build an equity research agent with LlamaCloud and o3 | LlamaIndex posted on the topic | LinkedIn
linkedin.com
agents evaluation multi-modal permalink →
Technical
kwindla on X: \"We wrote down everything we've learned building voice AI agents over the past two years. Core technolo...
twitter.com
agents evaluation safety multi-modal permalink →
Technical
Clustering Documents with OpenAI embeddings, HDBSCAN and UMAP – Dylan Castillo
dylancastillo.co
llms vector-search tiktok permalink →
Technical
Manifold
manifold.markets
llms disinformation moderation privacy forecasting geopolitics open-source permalink →
Technical
DSPy | 🦜️🔗 LangChainDSPy is a fantastic framework for LLMs that introduces an automatic compiler that teaches LMs how...
python.langchain.com
llms permalink →
Data
Prompt Engineering Whitepaper (Kaggle / Google)
kaggle.com
datasets permalink →
Technical
Google Transparency Report | Zoe Darmé
linkedin.com
trust-and-safety permalink →
Technical
Compare Virtual Private Servers (VPS) by Price & Features | servers.fyi
servers.fyi
infrastructure permalink →
Technical
Note ranking algorithm
communitynotes.x.com
twitter permalink →
Technical
Social Media Behaviour
upb-ss1.github.io
facebook cib journalism permalink →
Technical
Raindrop | AI Agent Monitoring & Observability
dawnai.com
agents evaluation permalink →
Technical
Why MCP Won - Latent.Space
latent.space
infrastructure coding-agents twitter permalink →
Technical
TAO: Using test-time compute to train efficient LLMs without labeled data | Databricks Blog
databricks.com
llms evaluation training permalink →
Technical
Reve Image - AI Image Generator and Creative Tool
preview.reve.art
image-gen permalink →
Technical
Perplexity
perplexity.ai
llms twitter permalink →
Technical
Releases · aria2/aria2 · GitHub
github.com
github permalink →
Technical
🚀 Big news for anyone building AI agents - we’ve built the fastest way to deploy AI Agents! In just seconds, you can...
linkedin.com
agents llms permalink →
Technical
Briefer (YC S23) is launching its AI analyst—an intelligent agent that helps anyone on your team turn data into clear...
linkedin.com
agents platform-policy permalink →
Technical
John Horton on X: \"Gave a short, impromptu talk on working / Claude Code & LLM code generation generally 1/ https...
twitter.com
llms coding-agents twitter permalink →
Technical
What is the Model Context Protocol (MCP)? - Model Context Protocol
modelcontextprotocol.io
llms coding-agents permalink →
Technical
AI Workflow Automation Platform - n8n
n8n.io
agents permalink →
Technical
Independent Podcast & Audio Ad Measurement with Pixel-Based Attribution, Incrementality Testing & Cross-Channel Insights
podscribe.com
youtube permalink →
Technical
Gemini Embedding: Generalizable Embeddings from Gemini â Google DeepMind
deepmind.google
llms evaluation vector-search permalink →
Technical
Choose Boring Technology
boringtechnology.club
infrastructure permalink →
Technical
Here’s how I use LLMs to help me write code
simonwillison.net
llms coding-agents permalink →
Technical
LLM: A CLI utility and Python library for interacting with Large Language Models
llm.datasette.io
llms vector-search infrastructure youtube permalink →
Technical
Daryl Anselmo (@darylanselmo) • Threads, Say more
threads.net
training permalink →
Technical
Symbolic.ai - Powering Publishing with AI
symbolic.ai
fact-checking permalink →
Technical
Podscribe
app.podscribe.ai
news permalink →
Technical
LangSmith
smith.langchain.com
agents github platform-policy infrastructure privacy permalink →
Technical
Langflow | Low-code AI builder for agentic and RAG applications
langflow.org
agents llms rag vector-search permalink →
Technical
EarthKit Agent - Google Slides
docs.google.com
agents doc permalink →
Technical
Why archive.org can't prove the authenticity of their snapshots - Jett's blog
blog.jettchen.me
provenance permalink →
Technical
Everyone knows your location
timsh.org
privacy permalink →
Technical
Emergency Communication Resources - AlertMedia
pyrratech.com
disinformation cybersecurity permalink →
Technical
Mistral OCR is nice and fast but other models outperform it on document processing. We did a comprehensive benchmark...
linkedin.com
agents llms evaluation permalink →
Technical
Mistral OCR | Mistral AI
mistral.ai
llms evaluation multi-modal india permalink →
Technical
Gentelella Admin Theme (Colorlib) - DJ Unicode hackathon inspiration
colorlib.com
design permalink →
Technical
Wiz Research Uncovers Exposed DeepSeek Database Leaking Sensitive Information, Including Chat History | Wiz Blog
wiz.io
llms permalink →
Technical
How to Backdoor Large Language Models - by Shrivu Shankar
blog.sshh.io
llms permalink →
Technical
Disrupting malicious uses of AI by state-affiliated threat actors | OpenAI
openai.com
safety cybersecurity permalink →
Technical
Countering Cognitive Warfare in the Digital Age – Information Professionals Association
information-professionals.org
llms rag tiktok disinformation permalink →
Technical
How to leverage chat and support logs for RAG | Max Buckley posted on the topic | LinkedIn
linkedin.com
llms training vector-search permalink →
Technical
The public domain, digital commons, and digital public goods (DPGs): How Wikimedia projects advance a positive vision...
diff.wikimedia.org
policy wikimedia permalink →
Technical
Notion
mimansajaiswal-embedded-dbs.notion.site
llms permalink →
Technical
Evolving Outline to Power Our Providers | by Jigsaw | Jigsaw | Medium
medium.com
agents voice permalink →
Technical
The 2025 AI Agent Index
aiagentindex.mit.edu
agents evaluation safety permalink →
Technical
Welcome to LM Studio Docs! | LM Studio
lmstudio.ai
llms coding-agents huggingface permalink →
Technical
O2 unveils Daisy, the AI granny wasting scammers’ time - Virgin Media O2
news.virginmediao2.co.uk
youtube permalink →
Technical
[RFC] LLM APIs for Ray Data and Ray Serve · Issue #50639 · ray-project/ray · GitHub
github.com
llms infrastructure github permalink →
Technical
GitHub - docling-project/docling: Get your documents ready for gen AI · GitHub
github.com
agents llms github permalink →
Technical
Transformer²: Self-Adaptive LLMs
sakana.ai
llms permalink →
Technical
Pandas vs. FireDucks Performance Comparison
dailydoseofds.com
evaluation permalink →
Technical
Dan McAteer on X: \"this is an amazing way to think about prompting o1 from @benhylak https://t.co/byVj8wHmUT\" / X
twitter.com
agents twitter github permalink →
Technical
@jagolinzer.bsky.social on Bluesky
bsky.app
bluesky facebook permalink →
Technical
Countering China’s Information Manipulation: A Toolkit for Understa
iri.org
geopolitics policy permalink →
Technical
Anton Osika on X: \"we launched publicly 8 days ago, hit $1M ARR today, and only took down one cloud provider along th...
twitter.com
twitter permalink →
Technical
China's AI models surpass global counterparts in diversity and diffusion | Stanford Institute for Human-Centered Arti...
linkedin.com
regulator permalink →
Technical
Using the DSA to Study Platforms
verfassungsblog.de
platform-policy regulator permalink →
Technical
DSpace
openyls.law.yale.edu
safety moderation regulator privacy policy permalink →
Technical
Unpacking deceptive design
publicpolicy.google
twitter facebook permalink →
Technical
PCIO Platform Interventions Codebook - Google Docs
docs.google.com
measurement doc permalink →
Technical
disinfodex.org
disinfodex.org
disinformation permalink →
Technical
AI's Power Requirements Under Exponential Growth: Extrapolating AI Data Center Power Demand and Assessing Its Potenti...
rand.org
training voice facebook permalink →
Research
“Community Guidelines Make this the Best Party on the Internet”: An In-Depth Study of Online Platforms’ Content Moder...
arxiv.org
twitter tiktok moderation regulator permalink →
Research
Friction-In-Design Regulation as 21st Century Time, Place, and Manner Restriction
papers.ssrn.com
fact-checking privacy research-paper permalink →
Research
How can we combat online misinformation? A systematic overview of current interventions and their efficacy
papers.ssrn.com
fact-checking privacy research-paper permalink →
Technical
Policy Implications of DeepSeek AI’s Talent Base | Stanford HAI
hai.stanford.edu
llms permalink →
Technical
The UAE’s Trump-Era AI Strategy | Lawfare
lawfaremedia.org
voice bluesky facebook permalink →
Technical
list of policy newsletters and sources to mine using LLMs for improving our newsletter!list of policy newsletters and...
media.licdn.com
llms permalink →
Technical
CAIDP Update 7.11 - AI Policy News (March 24, 2025) | Center for AI and Digital Policy
linkedin.com
platform-policy privacy geopolitics linkedin permalink →
Technical
We Need an Interventionist Mindset | TechPolicy.Press
techpolicy.press
regulator permalink →
Technical
About | Civil Rights Table
civilrightstable.org
facebook permalink →
Technical
Reports & Documents | WaTech
watech.wa.gov
cybersecurity permalink →
Technical
policy framework for AI4Science
static.googleusercontent.com
science policy permalink →
Technical
from UC Berkeley
cltc.berkeley.edu
rl policy permalink →
Technical
Artificial Analysis State of AI: China | Artificial Analysis
linkedin.com
evaluation permalink →
Technical
International AI Safety Report | Jonas Freund
linkedin.com
safety disinformation permalink →
Technical
youtube just announced they may do away with fact-checking
files.maldita.es
youtube fact-checking permalink →
Technical
What does the public think about AI? | Harry Law | 11 comments
linkedin.com
journalism permalink →
Research
https://avikrishna.substack.com/p/eliciting-frontier-model-character?selection=2ddd1e4b-84e7-4cea-bfd3-41f1dc13f9ea&u...
open.substack.com
llms permalink →
Research
Qalb: Largest State-of-the-Art Urdu Large Language Model for 230M Speakers with Systematic Continued Pre-training | T...
taimoor.xyz
llms evaluation training infrastructure permalink →
Research
Nearly a third of social-media research has undisclosed ties to industry, preprint claims
science.org
disinformation health-ai reddit facebook privacy funding research-paper permalink →
Research
Training AI Co-Scientists Using Rubric Rewards | alphaXiv
alphaxiv.org
training huggingface fact-checking datasets health-ai facebook science rl labor-markets research-paper permalink →
Research
Artificial intelligence tools expand scientists’ impact but contract science’s focus | Nature
nature.com
llms permalink →
Research
Research into how narratives spread across social media platforms with some case studies f
scholarspace.manoa.hawaii.edu
research-paper permalink →
Research
fact checking reduces engagement with false information
papers.ssrn.com
fact-checking permalink →
Research
Do reasoning models have real “Aha!” moments—mid-chain realizations where they intrinsically self-correct? In a new p...
linkedin.com
llms evaluation training permalink →
Research
[2510.24810] COMMUNITYNOTES: A Dataset for Exploring the Helpfulness of Fact-Checking Explanations
arxiv.org
tiktok huggingface fact-checking permalink →
Research
God of Prompt on X: \"R.I.P few-shot prompting. Meta AI researchers discovered a technique that makes LLMs 94% more ac...
x.com
agents llms twitter permalink →
Research
new deepseek paper on introducing geometric constraints when training, for less instabilit
arxiv.org
llms research-paper permalink →
Research
alex zhang on X: \"Much like the switch in 2025 from language models to reasoning models, we think 2026 will be all ab...
x.com
llms twitter permalink →
Research
How LLMs result in increased rich elements and targeting by newsroomsHow LLMs result in increased rich elements and t...
papers.ssrn.com
llms journalism permalink →
Research
NATO releases research/report into cognitive warfare
sto.nato.int
info-ops geopolitics permalink →
Research
AI use in American newspapers is widespread | Peter Slattery, PhD | 11 comments
linkedin.com
llms journalism fact-checking permalink →
Research
Shizhe Diao on X: \"✨Introducing ProfBench. LLM eval shouldn’t be limited to math/code/short QA. Real work is: read pr...
x.com
agents llms evaluation training permalink →
Research
Ethan Mollick on X: \"AI can help explain complex topics easily by throwing together a simulation. As Eric says later...
x.com
twitter permalink →
Research
Alex Prompter on X: \"This paper from Harvard and MIT quietly answers the most important AI question nobody benchmarks...
x.com
llms evaluation twitter permalink →
Research
Crémieux on X: \"The Science paper: https://t.co/km05CPqfcX (now viral online, unfortunately!) A welcome correction th...
x.com
twitter permalink →
Research
Recent discoveries on the acquisition of the highest levels of human performance
science.org
fact-checking privacy research-paper permalink →
Research
apparently a super important theory ML paper by lenka zdeborova and crewapparently a super important theory ML paper...
arxiv.org
rag research-paper permalink →
Research
Locke Cai on X: \"RL for reasoning often rely on verifiers — great for math, but tricky for creative writing or open-e...
x.com
llms twitter permalink →
Research
AI Chatbot’s are getting more relationship seeking but not more useful
arxiv.org
health-ai research-paper permalink →
Research
https://andreyfradkin.com/assets/LLM_Demand_12_12_2025.pdf
andreyfradkin.com
llms permalink →
Research
Andrey Fradkin on X: \"How much does intelligence cost? How concentrated is the AI market and is it winner take all? W...
x.com
twitter permalink →
Research
The Tip of the Iceberg: How the Social Media Production-Consumption Gap Distorts Public Opinion for Citizens and Researchers
osf.io
privacy social-psychology research-paper permalink →
Research
Smartphones and Social Media Fuel Polarization Since 2008 | Jay Van Bavel, PhD posted on the topic | LinkedIn
linkedin.com
health-ai permalink →
Research
ChatGPT does not replicate human moral judgments: the importance of examining metrics beyond correlation to assess ag...
nature.com
llms permalink →
Research
Short-form video platforms drive mobile usage
osf.io
tiktok facebook privacy emotion-detection research-paper permalink →
Research
[2508.08596] How Conversational Structure and Style Shape Online Community Experiences
arxiv.org
reddit huggingface permalink →
Research
using personas doesn't make an AI better at a task
papers.ssrn.com
fact-checking privacy research-paper permalink →
Research
View of Searching for Elected Officials: Google’s Prioritization of Political Information
journalqd.org
research-paper permalink →
Research
Rethinking news framing with large language models | Scientific Reports
nature.com
llms voice disinformation fact-checking permalink →
Data
Paper page - From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence
huggingface.co
llms evaluation training multi-modal permalink →
Research
Reranking partisan animosity in algorithmic social media feeds alters affective polarization
science.org
vector-search permalink →
Research
[2510.15951] Attention to Non-Adopters
arxiv.org
llms huggingface permalink →
Research
New technical report on mixture of experts style models
storage.googleapis.com
evaluation permalink →
Research
will brown on X: \"@dejavucoder plenty about it in the paper :) https://t.co/32O2NccA3D https://t.co/GnhAY4cJwu\" / X
x.com
twitter permalink →
Research
a really important paper to understand model trends
dataprovenance.org
platform-policy permalink →
Research
How Instacart is using LLMs for better e-commerce search | Yuanzheng (Ron) Zhu posted on the topic | LinkedIn
linkedin.com
llms safety training vector-search permalink →
Research
Understanding the impact of misinformation on adolescents | Nature Human Behaviour
nature.com
disinformation permalink →
Research
Randall Balestriero on X: \"LeJEPA: a novel pretraining paradigm free of the (many) heuristics we relied on (stop-grad...
x.com
training twitter github permalink →
Research
[2510.08831] Everyone prefers human writers, including AI
arxiv.org
huggingface permalink →
Research
Language models cannot reliably distinguish belief from knowledge and fact | Nature Machine Intelligence
nature.com
llms evaluation disinformation journalism permalink →
Research
LLMs destroy signals in marketplaces that separate high skill workers from low skill worke
jesse-silbert.github.io
llms permalink →
Research
[2502.09992] Large Language Diffusion Models
arxiv.org
llms evaluation training image-gen permalink →
Research
\"Learn Text-to-SQL: Resources for Practitioners\" | Aman Chadha posted on the topic | LinkedIn
linkedin.com
agents llms evaluation permalink →
Research
\"US-China scientific leadership shift: China's rise in global science\" | James Evans posted on the topic | LinkedIn
linkedin.com
evaluation geopolitics linkedin permalink →
Research
Smartphone use in adults and linked mental health and wellbeing concernsSmartphone use in adults and linked mental he...
pnas.org
health-ai research-paper permalink →
Research
Patterns of news sharing and engagement across seven different social platforms
pnas.org
disinformation twitter bluesky facebook social-psychology research-paper permalink →
Research
[2509.00446] NEWSAGENT: Benchmarking Multimodal Agents as Journalists with Real-World Newswriting Tasks
arxiv.org
agents llms evaluation multi-modal permalink →
Research
LinkedIn
lnkd.in
safety fact-checking election-integrity permalink →
Research
[2510.20171] Collective Communication for 100k+ GPUs
arxiv.org
llms huggingface permalink →
Research
[2510.15053] The Physics of News, Rumors, and Opinions
arxiv.org
huggingface disinformation permalink →
Research
[2508.18541] Uncovering Intervention Opportunities for Suicide Prevention with Language Model Assistants
arxiv.org
llms huggingface permalink →
Research
Are these the happiest PhD students in the world?
nature.com
agents reddit bluesky facebook permalink →
Research
What makes PhD students happy? Good supervision
nature.com
reddit bluesky facebook india permalink →
Research
Research into the value of online data to platforms
pubsonline.informs.org
platform-policy privacy funding geopolitics research-paper permalink →
Research
Imaging Time-Series to Improve Classification and Imputation
arxiv.org
datasets research-paper permalink →
Research
elvis on X: \"People are sleeping on Deep Agents. Start using them now. This is a fun paper showcasing how to put toge...
x.com
twitter permalink →
Research
Discourse Graphs | A Tool for Collaborative Knowledge Synthesis
discoursegraphs.com
llms permalink →
Research
a massive community notes dataset
arxiv.org
twitter tiktok moderation permalink →
Research
[2508.06445] Echoes of Automation: The Increasing Use of LLMs in Newsmaking
arxiv.org
llms huggingface permalink →
Research
[2510.12323] RAG-Anything: All-in-One RAG Framework
share.google
llms rag evaluation multi-modal permalink →
Research
[2510.09263] SynthID-Image: Image watermarking at internet scale
arxiv.org
evaluation huggingface permalink →
Research
the economics and geography of data centers
pubsonline.informs.org
fact-checking privacy research-paper permalink →
Research
Aparna Dhinakaran on X: \"We improved @cline, a popular open-source coding agent, by +15% accuracy on SWE-Bench — with...
x.com
llms evaluation coding-agents twitter permalink →
Research
Ideological fragmentation of the social media ecosystem: From echo chambers to echo platforms
academic.oup.com
research-paper permalink →
Research
Simulating Social Networks with Hybrid Methodology | Lynnette Ng posted on the topic | LinkedIn
linkedin.com
llms twitter disinformation permalink →
Research
Less is More: Recursive Reasoning with Tiny Networks | alphaXiv
alphaxiv.org
labor-markets research-paper permalink →
Research
𝚐𝔪𝟾𝚡𝚡𝟾 on X: \"MODEL: https://t.co/hgHmWfu9b1 RELEASE: https://t.co/i0a5UL8r5C\" / X
x.com
twitter huggingface permalink →
Research
Rohan Paul on X: \"This paper introduces a new method called Agentic Context Engineering (ACE). It helps language mode...
x.com
agents llms coding-agents twitter permalink →
Research
Rohan Paul on X: \"A 7B model, tuned for forms and docs, beats giant models at pulling structured data. Beats GPT-4.1...
x.com
llms twitter permalink →
Research
Rohan Paul on X: \"A beautiful paper from MIT+Harvard+ @GoogleDeepMind 👏 Explains why Transformers miss multi digit mu...
x.com
llms training twitter permalink →
Research
Excited to share our latest paper from Meta Superintelligence Lab examining the factors that drive reasoning performa...
linkedin.com
llms evaluation permalink →
Research
Sycophantic AI increases attitude extremity and overconfidence
osf.io
research-paper permalink →
Research
synthesizing comments with LLMs to aid community notes
arxiv.org
llms disinformation fact-checking permalink →
Research
The complexity of misinformation extends beyond virus and warfare analogies | npj Complexity
nature.com
disinformation permalink →
Research
Pricing | Pangram Labs
pangram.com
interpretability twitter moderation regulator permalink →
Research
[2402.14873] Technical Report on the Pangram AI-Generated Text Classifier
arxiv.org
llms evaluation huggingface permalink →
Research
Rohan Paul on X: \"BIG claim. Giving an LLM just 78 carefully chosen, full workflow examples makes it perform better a...
x.com
llms training twitter permalink →
Research
LoRA Without Regret - Thinking Machines Lab
thinkingmachines.ai
llms training infrastructure permalink →
Research
Sarah Cen on X: \"We ran a longitudinal study of LLMs during the 2024 US election 🗳️ We queried 12 models on a survey...
x.com
llms twitter permalink →
Data
openai/gdpval · Datasets at Hugging Face
huggingface.co
evaluation huggingface permalink →
Research
elvis on X: \"Federation of Agents This is a neat concept to convert static multi-agent coordination into dynamic capa...
x.com
agents twitter permalink →
Research
Ivan Zhou on X: \"Automated prompt optimization (GEPA) can push open-source models beyond frontier performance on ente...
x.com
evaluation training twitter permalink →
Research
Gabriele Berton on X: \"[paper release!] Did you know that you can - speed up any LLM by 4x - and reduce its memory fo...
x.com
llms twitter permalink →
Research
Community Notes help reduce the virality of false information on X, study finds – UW News
washington.edu
twitter youtube moderation disinformation permalink →
Research
Current Real-World Use of Large Language Models for Mental Health
osf.io
research-paper permalink →
Research
What remains after LLMs: technical knowledge moves from hubs to niches
papers.ssrn.com
llms permalink →
Research
Jackson Atkins on X: \"MIT and Microsoft just made AI 64x better at planning, achieving 94% accuracy. 💥 Their PDDL-INS...
x.com
llms twitter permalink →
Research
[2502.16487] All That Glitters is Not Novel: Plagiarism in AI Generated Research
arxiv.org
llms huggingface permalink →
Research
Chao Huang on X: \"Our team's AI-Researcher has been accepted by NeurIPS 2025 and selected as a Spotlight! 🌟 The proje...
x.com
llms twitter github permalink →
Research
X (link)
x.com
llms evaluation twitter permalink →
Research
Rohan Paul on X: \"LLM for financial trading/decision making. A 4B model financial-domain model, Trading-R1, that writ...
x.com
llms training twitter permalink →
Research
ingroup positivity drives engagement during crisis events
pnas.org
social-psychology research-paper permalink →
Research
The Digital Ethnography Collective Reading List - Google Docs
docs.google.com
doc permalink →
Research
should ai nudge you? how people pay attention to AI signals
papers.ssrn.com
research-paper permalink →
Research
communicating uncertainty can increase AI adoption
papers.ssrn.com
research-paper permalink →
Research
how to improve knowledge accumulation in the social sciences
federicaizzo.com
research-paper permalink →
Research
How to destroy your reputation??? By transparently disclosing your usage of AI… Whereas shaky studies without peer re...
linkedin.com
linkedin permalink →
Research
[2509.11391v1] \"My Boyfriend is AI\": A Computational Analysis of Human-AI Companionship in Reddit's AI Community
arxiv.org
llms reddit huggingface permalink →
Research
Brian Armstrong on X: \"x402 + @Google just unlocked a new level for AI agents. Agents can actually pay each other now...
x.com
agents twitter permalink →
Research
DeepDive: Advancing Long-Horizon Search Agents with Knowledge Graphs and Multi-Turn Reinforcement Learning
arxiv.org
llms evaluation github permalink →
Research
the cost of selling tiktok on the ads market
pnas.org
tiktok permalink →
Research
How People Use ChatGPT | NBER
nber.org
llms bluesky facebook permalink →
Research
[2506.11727] Forgetful by Design? A Critical Audit of YouTube's Search API for Academic Research
arxiv.org
youtube huggingface permalink →
Research
Turing Post on X: \"One of the most comprehensive Surveys of Reinforcement Learning for LRMs Covers: - LLMs ➝ LRMs via...
x.com
llms training multi-modal twitter permalink →
Research
Misha Teplitskiy | Science of Science on X: \"One of the craziest soc sci papers of all time: an email nudge generated...
x.com
twitter permalink →
Research
Arvindh Arun on X: \"Why does horizon length grow exponentially as shown in the METR plot? Our new paper investigates...
x.com
llms evaluation twitter permalink →
Research
the impact of LLM Adoption on online user behavior
papers.ssrn.com
llms permalink →
Research
interesting study to see the effects of criticism and pushback against public health narra
pnas.org
health-ai research-paper permalink →
Research
REAL Evals - Realistic Evaluations for Agents Leaderboard
realevals.xyz
agents llms evaluation github permalink →
Data
Paper page - NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings
huggingface.co
llms evaluation vector-search multi-modal permalink →
Research
Ravid Shwartz Ziv on X: \"The new OpenAI paper “Why Language Models Hallucinate” is more like PR than research. The cl...
x.com
llms twitter permalink →
Research
LLM hallucinations are compression artefacts, not bugs. We can predict them with EDFL. | Leon Chlon, PhD posted on th...
linkedin.com
llms permalink →
Research
Why language models hallucinate | OpenAI
openai.com
llms evaluation permalink →
Research
Domenico Ferraro on X: \"As more data come in, the contractionary impact of tariffs is becoming increasingly clear. Ou...
x.com
twitter permalink →
Research
[2507.10599] Emergence of Hierarchical Emotion Organization in Large Language Models
arxiv.org
llms huggingface permalink →
Research
How can you more effectively talk with your hands? (Research just conditionally accepted at JMR) People often move th...
linkedin.com
llms multi-modal permalink →
Research
[2508.08285] The Illusion of Progress: Re-evaluating Hallucination Detection in LLMs
arxiv.org
llms evaluation huggingface permalink →
Research
Social opinions prediction utilizes fusing dynamics equation with LLM-based agents | Scientific Reports
nature.com
agents llms permalink →
Research
How To Become A Mechanistic Interpretability Researcher — AI Alignment Forum
alignmentforum.org
llms interpretability permalink →
Research
[2507.00926] HyperFusion: Hierarchical Multimodal Ensemble Learning for Social Media Popularity Prediction
arxiv.org
vector-search multi-modal huggingface permalink →
Research
criticism of using AI simulations to infer causation in social network settingscriticism of using AI simulations to i...
science.org
research-paper permalink →
Research
Another paper about the effects of GenAI on reduction in hiring early stage folksAnother paper about the effects of G...
papers.ssrn.com
labor-markets research-paper permalink →
Research
This new DeepMind research shows just how broken vector search is. Turns out some docs in your index are theoreticall...
linkedin.com
vector-search permalink →
Research
Canaries in the Coal Mine? Six Facts about the Recent Employment Effects of Artificial Intelligence - Stanford Digita...
digitaleconomy.stanford.edu
india privacy labor-markets permalink →
Research
[2504.13279] Just Another Hour on TikTok: ID sampling to obtain a complete slice of TikTok
arxiv.org
tiktok huggingface permalink →
Research
Detecting Child Objectification on Social Media: Challenges in Language Modeling - ACL Anthology
aclanthology.org
child-safety research-paper permalink →
Research
ACM · 3706598.3713362
dl.acm.org
research-paper permalink →
Research
Muyu He on X: \"Our EMNLP main paper presents a fun but very challenging benchmark for LLMs: to solve over 300+ Ace At...
x.com
llms evaluation training twitter permalink →
Research
A Novel Multi-Document Retrieval Benchmark: Journalist Source-Selection in Newswriting - ACL Anthology
aclanthology.org
llms evaluation permalink →
Research
🚨🚨New paper, now out in PNAS! We know that outrage and negativity go viral online, but is this *always* the case? No:...
linkedin.com
facebook permalink →
Research
Towards Interactive Evaluations for Interaction Harms in Human-AI Systems | Knight First Amendment Institute
knightcolumbia.org
agents llms regulator cybersecurity permalink →
Research
[2507.19373] Changes to the Facebook Algorithm Decreased News Visibility Between 2021-2024
arxiv.org
facebook huggingface permalink →
Research
racial discrimination in the follow-back rates to phd student twitter accounts by academic
docs.iza.org
twitter permalink →
Research
Proceedings of the ICWSM Workshops
workshop-proceedings.icwsm.org
agents llms disinformation permalink →
Research
India's Cash Transfer Experiment: Boosting Nutrition and Development | Karthik Muralidharan posted on the topic | Lin...
linkedin.com
india permalink →
Research
[2504.18041] RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models
arxiv.org
llms rag safety training permalink →
Research
Jiayuan Zhu on X: \"🎉Happy to share that our paper \"Ask Patients with Patience (APP): Enabling LLMs for Human-Centric...
x.com
llms twitter health-ai permalink →
Research
[2505.16023] Prototypical Human-AI Collaboration Behaviors from LLM-Assisted Writing in the Wild
arxiv.org
llms coding-agents huggingface permalink →
Research
Data is infrastructure
degruyterbrill.com
rl permalink →
Research
[2311.09730] Sociodemographic Prompting is Not Yet an Effective Approach for Simulating Subjective Judgments with LLMs
arxiv.org
llms huggingface permalink →
Research
VerbaAI
verbaai.org
llms permalink →
Research
[2508.15763] Intern-S1: A Scientific Multimodal Foundation Model
arxiv.org
multi-modal permalink →
Research
[2506.08292] From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium
arxiv.org
agents llms evaluation huggingface permalink →
Research
Arnav Arora on X: \"🚨New pre-print 🚨 News often conveys different things in text vs. image. Recent work in comp. frami...
x.com
llms multi-modal twitter permalink →
Research
What we can learn from TikTok through its Research API
gdfm.me
tiktok permalink →
Research
Jiashuo Liu on X: \"We built FutureX, the world’s first live benchmark for real future prediction — politics, economy,...
x.com
agents llms evaluation twitter permalink →
Research
I've shared this earlier but once upon a time I built a reddit content classifier based on
ora.ox.ac.uk
reddit permalink →
Research
[2508.09809] A Comprehensive Review of Datasets for Clinical Mental Health AI Systems
arxiv.org
health-ai permalink →
Research
ml model \"genetics\" and a \"family tree\"
arxiv.org
vector-search huggingface permalink →
Research
[2503.17684] Can LLMs Automate Fact-Checking Article Writing?
arxiv.org
agents llms huggingface fact-checking permalink →
Research
👋 Jan on X: \"Introducing Jan-v1: 4B model for web search, an open-source alternative to Perplexity Pro. In our evals,...
x.com
llms evaluation infrastructure twitter permalink →
Research
[2506.06299] How malicious AI swarms can threaten democracy: The fusion of agentic AI and LLMs marks a new frontier i...
arxiv.org
agents llms disinformation permalink →
Research
[2507.02197] Do Role-Playing Agents Practice What They Preach? Belief-Behavior Consistency in LLM-Based Simulations o...
arxiv.org
llms permalink →
Research
liberals and conservatives share information differently on social medialiberals and conservatives share information...
academic.oup.com
research-paper permalink →
Research
An Vo on X: \"🚨 Our latest work shows that SOTA VLMs (o3, o4-mini, Sonnet, Gemini Pro) fail at counting legs due to bi...
x.com
llms twitter permalink →
Research
elvis on X: \"Tool-Augmented Unified Retrieval Agent for AI Search Nice paper showing how to effectively extend RAG to...
x.com
agents twitter permalink →
Research
Beyond Binary Rewards: RL for Calibrated LMs
rl-calibration.github.io
llms interpretability twitter permalink →
Research
estimation of emotion in the sharing of content on social media platformsestimation of emotion in the sharing of cont...
psycnet.apa.org
emotion-detection research-paper permalink →
Research
[2505.20201] Reasoning Is Not All You Need: Examining LLMs for Multi-Turn Mental Health Conversations
arxiv.org
llms huggingface permalink →
Research
Brendan Jowett on X: \"AI agents are taking off. But we may be building them the wrong way. A new paper from NVIDIA ar...
x.com
agents llms twitter youtube permalink →
Research
Labeled Dataset for sensitive topics (conflictual language, profanity, sexually explicit m
arxiv.org
evaluation datasets research-paper permalink →
Research
[2506.06347] Unified Game Moderation: Soft-Prompting and LLM-Assisted Label Transfer for Resource-Efficient Toxicity...
arxiv.org
llms evaluation huggingface permalink →
Research
[2312.12651] Toxic Bias: Perspective API Misreads German as More Toxic
arxiv.org
twitter huggingface moderation permalink →
Research
[2507.17636] Who Attacks, and Why? Using LLMs to Identify Negative Campaigning in 18M Tweets across 19 Countries
arxiv.org
llms evaluation twitter huggingface permalink →
Research
case studies for arbiter to be a social sensemaking tool
journals.sagepub.com
research-paper permalink →
Research
[2507.21206] Agentic Web: Weaving the Next Web with AI Agents
arxiv.org
agents llms huggingface permalink →
Research
[2507.06268v1] A Collectivist, Economic Perspective on AI
arxiv.org
huggingface permalink →
Research
Jackson Atkins on X: \"LLMs can now self-optimize. A new method allows an AI to rewrite its own prompts to achieve up...
x.com
llms training twitter permalink →
Research
[2506.21734] Hierarchical Reasoning Model
arxiv.org
llms evaluation huggingface permalink →
Research
Persona vectors: Monitoring and controlling character traits in language models \\ Anthropic
anthropic.com
llms interpretability permalink →
Research
LinkedIn
lnkd.in
safety permalink →
Research
LinkedIn
lnkd.in
safety permalink →
Research
EuroCon: Benchmarking Parliament Deliberation for Political Consens
zowiezhang.github.io
evaluation permalink →
Research
HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter - ACL Anthology
aclanthology.org
twitter permalink →
Research
[2507.16045] Chameleon Channels: Measuring YouTube Accounts Repurposed for Deception and Profit
arxiv.org
youtube cib permalink →
Research
A Social Dynamical System for Twitter Analysis
arxiv.org
evaluation datasets twitter research-paper permalink →
Research
Huge congratulations to the brilliant minds behind this groundbreaking work which won ACL outstanding paper award! Ca...
linkedin.com
twitter india permalink →
Research
Introducing GSPO: A New RL Algorithm for LLMs | Alex Shan posted on the topic | LinkedIn
linkedin.com
llms evaluation training infrastructure permalink →
Research
what happens to academics post tenure, interesting work
pnas.org
research-paper permalink →
Research
neat recent work on improving the generation of research reports by Googleneat recent work on improving the generatio...
arxiv.org
llms agents research-paper permalink →
Research
[2407.12034] Understanding Transformers via N-gram Statistics
arxiv.org
llms huggingface permalink →
Research
When Claims Evolve: Evaluating and Enhancing the Robustness of Embedding Models Against Misinformation Edits - ACL An...
aclanthology.org
llms vector-search disinformation fact-checking permalink →
Research
What Does Consulting Do? | NBER
nber.org
bluesky facebook permalink →
Research
AlphaGo Moment for Model Architecture Discovery | alphaXiv
alphaxiv.org
labor-markets research-paper permalink →
Research
Rohan Paul on X: \"The paper builds a small simulated economy with 100 language‑model “workers” and one language‑model...
x.com
agents llms twitter mechanism-design permalink →
Research
#research #causal #ml | Ciarán M. Gilligan-Lee
linkedin.com
safety permalink →
Research
Michael R. Bock on X: \"1/ Can AI file your taxes? Not yet. We tested the latest frontier models and the results were...
x.com
twitter permalink →
Research
[2507.07931] Meek Models Shall Inherit the Earth
arxiv.org
evaluation huggingface permalink →
Research
Feature-based reward learning shapes human social learning strategies | Nature Human Behaviour
nature.com
agents research-paper permalink →
Research
Karan Singhal on X: \"📣 Excited to share our real-world study of an LLM clinical copilot, a collab between @OpenAI and...
x.com
llms coding-agents twitter health-ai permalink →
Research
[2507.13919] The Levers of Political Persuasion with Conversational AI
arxiv.org
llms training voice huggingface permalink →
Research
[2505.11711] Reinforcement Learning Finetunes Small Subnetworks in Large Language Models
arxiv.org
llms huggingface permalink →
Research
UniverseTBD on X: \"📢 New dataset out! We introduce HypoGen💥, a dataset of ~5.5K structured problem–hypothesis pairs (...
x.com
llms twitter permalink →
Research
[2410.02724] Large Language Models as Markov Chains
arxiv.org
llms huggingface permalink →
Research
[2504.11169] MuSeD: A Multimodal Spanish Dataset for Sexism Detection in Social Media Videos
arxiv.org
llms multi-modal tiktok permalink →
Research
Attributing news creation to AI doesn’t really reduce the perceived value and ideological
journals.sagepub.com
research-paper permalink →
Research
Zhu Jian-Qiao on X: \"Centaur may have learned a shortcut that explains away psychological tasks. @PsychBoyH Link to p...
x.com
llms twitter permalink →
Research
Alex Imas on X: \"🚨New paper (link in reply)🚨 Are we underestimating AI use in self-report surveys? YES, by as much as...
x.com
twitter permalink →
Research
Frontiers: Generative AI and Personalized Video Advertisements
pubsonline.informs.org
research-paper permalink →
Research
more research about how using LLMs can harm learning among students at PNAS this timemore research about how using LL...
pnas.org
llms permalink →
Research
New prosocial design interventions paper is out
doi.org
social-psychology research-paper permalink →
Research
JSTOR paper 2118400
jstor.org
research-paper permalink →
Research
Sukjun (June) Hwang on X: \"Tokenization has been the final barrier to truly end-to-end language models. We developed...
x.com
llms twitter permalink →
Research
A large-scale replication of scenario-based experiments in psychology and management using large language models | Na...
nature.com
llms permalink →
Research
the governance and behavioral challenges from personalizable AI
doi.org
research-paper permalink →
Research
[2507.03041] Optimas: Optimizing Compound AI Systems with Globally Aligned Local Rewards
arxiv.org
llms huggingface permalink →
Research
[2507.04545] Measuring Social Media Network Effects
arxiv.org
facebook huggingface permalink →
Research
Interesting work by Anthropic on self labeling by LLMs that we should read for the benchma
arxiv.org
llms permalink →
Research
Marcel Binz on X: \"Excited to see our Centaur project out in @Nature. TL;DR: Centaur is a computational model that pr...
x.com
llms twitter permalink →
Research
[2403.03744] MedSafetyBench: Evaluating and Improving the Medical Safety of Large Language Models
arxiv.org
llms evaluation training huggingface permalink →
Research
View of Changes in YouTube's Content Moderation Policy Had Little Detectable Impact on Election Denial Content
ojs.aaai.org
youtube moderation permalink →
Research
Beyond Semantics: Unreasonable Effectiveness of Reasonless Intermediate Tokens | Hacker News
news.ycombinator.com
llms facebook github permalink →
Research
[2502.05967] $μ$nit Scaling: Simple and Scalable FP8 LLM Training
arxiv.org
llms huggingface permalink →
Research
Tracing the thoughts of a large language model \\ Anthropic
anthropic.com
llms interpretability permalink →
Research
The Youth Vote in 2024 | CIRCLE
circle.tufts.edu
election-integrity permalink →
Research
Chain-of-Thought Is Not Explainability | alphaXiv
alphaxiv.org
llms permalink →
Research
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning | Nathan Lambert | 14 comments
linkedin.com
llms training permalink →
Research
[2506.21718] Performance Prediction for Large Systems via Text-to-Text Regression
arxiv.org
huggingface permalink →
Research
interesting piece on why privacy regulation can solve the disinformation probleminteresting piece on why privacy regu...
papers.ssrn.com
disinformation permalink →
Research
Clint Jarvis on X: \"Stanford paid 35,000 people to quit social media. This was the largest study on emotional health...
x.com
twitter permalink →
Research
[2506.17729] Efficient Difference-in-Differences and Event Study Estimators
arxiv.org
huggingface permalink →
Research
[2506.18167] Understanding Reasoning in Thinking Language Models via Steering Vectors
arxiv.org
llms huggingface permalink →
Research
Following news on social media boosts knowledge, belief accuracy and trust | Nature Human Behaviour
nature.com
disinformation fact-checking permalink →
Research
GitHub - google-deepmind/videoprism: Official repository for \"VideoPrism: A Foundational Visual Encoder for Video Und...
github.com
evaluation training github huggingface permalink →
Research
conway on X: \"latest moondream model is actually beating gpt-4o in several cases I've tested https://t.co/L09T015fvv\"...
x.com
twitter permalink →
Research
Updesh
aikosh.indiaai.gov.in
llms training coding-agents india permalink →
Research
What is LLooM? | LLooM
stanfordhci.github.io
github moderation permalink →
Research
Ryan Marten on X: \"Announcing OpenThinker3-7B, the new SOTA open-data 7B reasoning model: improving over DeepSeek-R1-...
x.com
evaluation twitter permalink →
Research
Modulate | Frontier voice AI company
modulate.ai
agents llms evaluation safety permalink →
Research
[2506.07667] Silencing Empowerment, Allowing Bigotry: Auditing the Moderation of Hate Speech on Twitch
arxiv.org
huggingface moderation permalink →
Research
Brian Christian on X: \"CREDITS: This work was done with @hannahrosekirk, @tsonj, @summerfieldlab, and Tsvetomira Dumb...
x.com
twitter permalink →
Research
BART: A Standard Tool for Data Science | Richard Hahn posted on the topic | LinkedIn
linkedin.com
causal-inference linkedin permalink →
Research
[2506.12349] Information Suppression in Large Language Models: Auditing, Quantifying, and Characterizing Censorship i...
arxiv.org
llms huggingface moderation permalink →
Research
addressing bias in financial decisionmaking by LLMs through representation engineeringaddressing bias in financial de...
papers.ssrn.com
llms permalink →
Research
How we built our multi-agent research system \\ Anthropic
anthropic.com
agents llms permalink →
Research
[2506.14295] The Impact of Generative AI on Social Media: An Experimental Study
arxiv.org
huggingface permalink →
News
The Impact of Generative AI on Critical Thinking: Self-Reported Reductions in Cognitive Effort and Confidence Effects...
microsoft.com
coding-agents youtube reddit facebook permalink →
Research
[2506.08872] Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task
arxiv.org
llms permalink →
Research
Primer: What We Know About Effective Misinformation Interventions
prosocialdesign.org
twitter facebook disinformation permalink →
Research
worth reading this paper about \"how many days did you practice music\" as a treatment affec
rss.onlinelibrary.wiley.com
research-paper permalink →
Research
GitHub - MCKnaus/dmlmt: Double Machine Learning for Multiple Treatments · GitHub
github.com
github permalink →
Research
OII | New study finds Republicans flagged for posting misleading tweets twice as often as Democrats on X/Twitter’s Co...
oii.ox.ac.uk
twitter disinformation fact-checking permalink →
Research
Real People Don’t Use UTM Codes. UTM codes are a great way to track the… | by Felipe Hoffa | The Startup | Medium
medium.com
twitter reddit facebook permalink →
Research
Toward Informal Language Processing: Knowledge of Slang in Large Language Models
arxiv.org
llms evaluation github permalink →
Research
new programming benchmark showcasing that LLMs do really poorly on programming benchmarksnew programming benchmark sh...
arxiv.org
llms evaluation permalink →
Research
This is wild. 🤯 Apple drops a paper saying AI \"reasoning\" is just fancy pattern-matching—models flop on stuff like To...
linkedin.com
llms permalink →
Research
AIM2025
sites.google.com
agents llms coding-agents permalink →
Research
[2504.06435] Human Trust in AI Search: A Large-Scale Experiment
arxiv.org
llms huggingface permalink →
Research
[2505.23802] MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
arxiv.org
llms evaluation health-ai permalink →
Research
The “multiple exposure effect” (MEE): How multiple exposures to similarly biased online content can cause increasingl...
journals.plos.org
twitter permalink →
Research
Voyager: An Open-Ended Embodied Agent with Large Language Models
arxiv.org
llms agents training fact-checking research-paper permalink →
Research
TabM: Advancing Tabular Deep Learning with Parameter-Efficient Ensembling
arxiv.org
rag evaluation research-paper permalink →
Research
LLM Evaluation research (look at the references)
arxiv.org
llms permalink →
Research
[2506.08945] Who is using AI to code? Global diffusion and impact of generative AI
arxiv.org
huggingface permalink →
Research
The Directory for Liquid ContentA scalable and modular taxonomy designed to map, describe, and standardize how digita...
liquidcontent.xyz
journalism permalink →
Research
How to inoculate AI models against misinformation | Sander van der Linden posted on the topic | LinkedIn
linkedin.com
agents llms training multi-modal permalink →
Research
Beyond Semantics: The Unreasonable Effectiveness of Reasonless Intermediate Tokens | Dagmar Monett | 121 comments
linkedin.com
linkedin permalink →
Research
World of Labour - Machine learning for causal inference in economics
wol.iza.org
privacy causal-inference permalink →
Research
[2504.02234] LLM Social Simulations Are a Promising Research Method
arxiv.org
llms training huggingface permalink →
Research
ai models for scientific reasoning and potentially discovery
storage.googleapis.com
research-paper permalink →
Research
How Malicious AI Swarms Can Threaten Democracy
osf.io
llms permalink →
Research
Nikhil Garg on X: \"Interesting new paper: https://t.co/jbDdacS6q1\" / X
x.com
twitter bluesky permalink →
Research
the future of machine learning will come from new age RL methods using environmental feedb
storage.googleapis.com
rl research-paper permalink →
Research
Sahar Abdelnabi 🕊 on X: \"Hawthorne effect describes how study participants modify their behavior if they know they ar...
x.com
llms safety twitter permalink →
Research
[2505.14617] The Hawthorne Effect in Reasoning Models: Evaluating and Steering Test Awareness
arxiv.org
llms huggingface permalink →
Research
Zochi Publishes A* Paper
intology.ai
llms safety permalink →
Research
Anne Ouyang on X: \"✨ New blog post 👀: We have some very fast AI-generated kernels generated with a simple test-time o...
x.com
twitter permalink →
Research
@here why isn't a bigger deal being made about this research does anyone know?@here why isn't a bigger deal being mad...
arxiv.org
llms permalink →
Research
ValuesML: A new Multilingual Dataset for Values Detection in News and Political Manifestos
osf.io
research-paper permalink →
Research
Algorithms for reliable decision-making need causal reasoning | Nature Computational Science
nature.com
evaluation causal-inference research-paper permalink →
Research
Paper2Poster
paper2poster.github.io
agents llms evaluation multi-modal permalink →
Research
Academic Library | Indicator
indicator.media
privacy permalink →
Research
research into modeling the half-life of a tweet based on some select empirical data and co
pnas.org
twitter permalink →
Research
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures | alphaXiv
alphaxiv.org
llms labor-markets research-paper permalink →
Research
Static network structure cannot stabilize cooperation among large language model agents | PLOS One
journals.plos.org
llms permalink →
Research
d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning | alphaXiv
alphaxiv.org
llms permalink →
Research
[2505.13775] Beyond Semantics: The Unreasonable Effectiveness of Reasonless Intermediate Tokens
arxiv.org
llms training permalink →
Research
[2505.13995] ELEPHANT: Measuring and understanding social sycophancy in LLMs
arxiv.org
llms evaluation reddit permalink →
Research
Rishi Jha on X: \"I’m stoked to share our new paper: “Harnessing the Universal Geometry of Embeddings” with @jxmnop, C...
x.com
vector-search twitter permalink →
Research
Tanishq Mathew Abraham, Ph.D. on X: \"Model Merging in Pre-training of Large Language Models \"We present the Pre-train...
x.com
llms twitter permalink →
Research
[2505.12546] Extracting memorized pieces of (copyrighted) books from open-weight language models
arxiv.org
llms permalink →
Research
Robert W Malone, MD on X: \"This new peer-reviewed study shows that living close to a golf course significantly increa...
twitter.com
twitter permalink →
Research
[2402.04607] Google Scholar is manipulatable
arxiv.org
huggingface permalink →
Research
[2411.13187] Engagement-Driven Content Generation with Large Language Models
arxiv.org
llms huggingface permalink →
Research
Very very very fast counting within a certain accuracy that powers a lot of industry infra
algo.inria.fr
algorithms research-paper permalink →
Research
Mapping the Institutional Pipeline for Global AI Talent | NBER
nber.org
bluesky facebook permalink →
Research
[2503.16527] LLM Generated Persona is a Promise with a Catch
arxiv.org
llms huggingface permalink →
Research
Twitter (link)
twitter.com
twitter permalink →
Research
AI in Software Engineering at Facebook | IEEE Journals & Magazine | IEEE Xplore
ieeexplore.ieee.org
facebook permalink →
Research
[2503.16586] Big Help or Big Brother? Auditing Tracking, Profiling, and Personalization in Generative AI Assistants
arxiv.org
agents permalink →
Research
ModelSlant.com
modelslant.com
llms permalink →
Research
MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control
arxiv.org
agents llms evaluation github permalink →
Research
Neel Nanda on X: \"After supervising 20+ papers, I have highly opinionated views on writing great ML papers. When I en...
twitter.com
interpretability twitter permalink →
Research
Coordinated link sharing on Facebook | Scientific Reports
doi.org
twitter facebook disinformation permalink →
Research
[2504.17004] (Im)possibility of Automated Hallucination Detection in Large Language Models
arxiv.org
llms training huggingface permalink →
Research
Detecting Synthetic, Doubting Authentic: AI Attribution Bias for Political Imagery
osf.io
research-paper permalink →
Research
The Effect of Deactivating Facebook and Instagram on Users’ Emotional State | NBER
nber.org
bluesky facebook permalink →
Research
[2504.20879] The Leaderboard Illusion
arxiv.org
llms evaluation permalink →
Research
[2502.02943] Behavioral Homophily in Social Media via Inverse Reinforcement Learning: A Reddit Case Study
arxiv.org
reddit huggingface permalink →
Research
[2502.07266] When More is Less: Understanding Chain-of-Thought Length in LLMs
arxiv.org
llms huggingface permalink →
Research
content labeling and community notes research
papers.ssrn.com
fact-checking datasets research-paper permalink →
Research
Could an AI Agent Become One of Your Coworkers? | by MIT IDE | MIT Initiative on the Digital Economy | Medium
medium.com
agents multi-modal permalink →
Research
Propensity Score Matching: A Guide to Causal Inference | Built In
builtin.com
causal-inference tutorial permalink →
Research
multiple period of treatment in DiD approaches
sciencedirect.com
research-paper permalink →
Research
#misinformation #research #politicalcommunication #datascience #digitalethics #eupolicy #digitalservicesact | Anton G...
linkedin.com
llms twitter tiktok disinformation permalink →
Research
[2504.13859] DoYouTrustAI: A Tool to Teach Students About AI Misinformation and Prompt Engineering
arxiv.org
llms disinformation permalink →
Research
Collaborating with AI Agents | Bugge Holm Hansen | 21 comments
linkedin.com
agents permalink →
Research
World of Labour - Does increasing the minimum wage reduce poverty in developing countries?
wol.iza.org
safety privacy policy emotion-detection labor-markets permalink →
Research
Andrew Gordon Wilson on X: \"Really excited about our new paper! It derives a generalization bound that predictably ge...
twitter.com
llms twitter permalink →
Research
Mind the (Language) Gap: Mapping the Challenges of LLM Development in Low-Resource Language Contexts | Stanford HAI
hai.stanford.edu
llms global-south permalink →
Research
https://andreyfradkin.com/assets/demandforllm.pdf
andreyfradkin.com
llms permalink →
Research
\"Mi Abogado: A Study on Foster Care and Legal Aid\" | Experimental posted on the topic | LinkedIn
linkedin.com
evaluation policy child-safety causal-inference linkedin permalink →
Research
Making AI-generated code more accurate in any language | MIT News | Massachusetts Institute of Technology
news.mit.edu
llms voice permalink →
Research
elvis on X: \"AgentA/B is a fully automated A/B testing framework that replaces live human traffic with large-scale LL...
twitter.com
llms twitter permalink →
Research
[2504.10157] SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-Worl...
arxiv.org
llms huggingface permalink →
Research
genAI spurs passive engagement not active ones
papers.ssrn.com
research-paper permalink →
Research
9 Yee Whye Teh - YouTube
youtube.com
youtube permalink →
Research
[2503.24322] NoProp: Training Neural Networks without Full Back-propagation or Full Forward-propagation
arxiv.org
evaluation permalink →
Research
Genglin Liu on X: \"Excited to share my first project at UCLA! We built MOSAIC — a social network simulator where LLM-...
twitter.com
llms twitter moderation permalink →
Research
LLMs can label data better than humans it turns out — for specific use casesLLMs can label data better than humans it...
journals.sagepub.com
llms permalink →
Research
Digital media – A threat to democracy? | Max-Planck-Gesellschaft
mpg.de
disinformation permalink →
Research
[2406.04236] Understanding Information Storage and Transfer in Multi-modal Large Language Models
arxiv.org
llms multi-modal permalink →
Research
Invisible Labor: The Backbone of Open Source Software
arxiv.org
open-source labor-markets research-paper permalink →
Research
Kiran Garimella on X: \"Community-based fact-checking effectiveness relies heavily on sourcing. This paper shows that...
twitter.com
twitter fact-checking permalink →
Research
Alexander Doria on X: \"A contrarian result I like a lot: smaller language models perform better on knowledge graphs t...
twitter.com
llms twitter permalink →
Research
[2501.19393] s1: Simple test-time scaling
arxiv.org
llms huggingface permalink →
Research
How AI outperforms humans in decision-making | Abel Sanchez posted on the topic | LinkedIn
linkedin.com
labor-markets linkedin permalink →
Research
Tanishq Mathew Abraham, Ph.D. on X: \"Perception Encoder: The best visual embeddings are not at the output of the netw...
twitter.com
vector-search multi-modal twitter permalink →
Research
Jillian Fisher on X: \"How do biased AI models effect human decision-making? 🤔 Our latest paper, “Biased AI can Influe...
twitter.com
twitter permalink →
Research
Abeer Aldayel (@Aldayelabeer@sciencemastodon.com) on X: \"📢**Persuasion takes different modes!** Instead of just askin...
twitter.com
llms twitter permalink →
Research
Nick Byrd, Ph.D. on X: \"Is #socialMedia bad for society? In two countries, following a couple mainstream #news accoun...
twitter.com
twitter youtube permalink →
Research
Social Media, Ethics, and Automation — Social Media, Ethics, and Automation
social-media-ethics-automation.github.io
twitter reddit bluesky disinformation permalink →
Research
GitHub - yuxiaw/OpenFactCheck · GitHub
github.com
llms evaluation github fact-checking permalink →
Research
re-sharing to confirm whether read this? truth social and news shari
tandfonline.com
research-paper permalink →
Research
Bots of a Feather: Mixing Biases in LLMs’ Opinion Dynamics | Springer Nature Link
link.springer.com
llms vector-search github permalink →
Research
Navigating the uncertainty: the impact of a student-centered final year project allocation mechanism on student perfo...
nature.com
research-paper permalink →
Research
Large Language Models: A Survey with Applications in Political Science
osf.io
research-paper permalink →
Research
[2503.02080] Linear Representations of Political Perspective Emerge in Large Language Models
arxiv.org
llms interpretability permalink →
Research
TAIS RFP: Research Areas | Coefficient Giving
openphilanthropy.org
llms safety permalink →
Research
Sign In | alphaXiv
alphaxiv.org
labor-markets research-paper permalink →
Research
[2504.03767] MCP Safety Audit: LLMs with the Model Context Protocol Allow Major Security Exploits
arxiv.org
agents llms permalink →
Research
TelegramScrap: A comprehensive tool for scraping Telegram data
arxiv.org
telegram huggingface disinformation permalink →
Research
Participants listen to AI advice
sciencedirect.com
research-paper permalink →
Research
mdpi.com
mdpi.com
research-paper permalink →
Research
Wild stuff >20sec videos… up to 1 min with a 5B model????
arxiv.org
evaluation datasets research-paper permalink →
Research
Now we have a misinformation test for people!
sciencedirect.com
disinformation permalink →
Research
How OpenAI's GPT-4 generates images with Transfusion | Max Buckley posted on the topic | LinkedIn
linkedin.com
llms multi-modal image-gen permalink →
Research
Better Feeds: Algorithms That Put People First – Knight-Georgetown Institute
kgi.georgetown.edu
safety regulator policy permalink →
Research
[1908.08313] Auditing Radicalization Pathways on YouTube
arxiv.org
youtube huggingface permalink →
Research
Elicit: AI for scientific research
elicit.com
health-ai permalink →
Research
Perplexity
perplexity.ai
cib permalink →
Research
A global comparison of social media bot and human characteristics | Scientific Reports
nature.com
twitter disinformation cybersecurity permalink →
Research
very much worth reading the way they manually research influence operations as we are seek
jns.scholar.princeton.edu
cib permalink →
Research
[2502.16280] Human Preferences in Large Language Model Latent Space: A Technical Analysis on the Reliability of Synth...
arxiv.org
llms huggingface permalink →
Research
[2503.21934] Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad
arxiv.org
llms evaluation huggingface permalink →
Research
How a PhD student’s lab size affects their chance of future academic success
nature.com
agents reddit bluesky facebook permalink →
Research
Introducing Ai2 Paper Finder | Ai2
allenai.org
agents llms permalink →
Research
foreign interference tactics specifically from a media and comms standpoint that would be
eeas.europa.eu
rl geopolitics permalink →
Research
dr. jack morris on X: \"# A new type of information theory this paper is not super well-known but has changed my opini...
twitter.com
llms training vector-search twitter permalink →
Research
AgentRxiv
agentrxiv.github.io
llms evaluation permalink →
Research
AI Agents, Digital Twins, and the New Way to Manage Operations
business.columbia.edu
agents permalink →
Research
The Cybernetic Teammate - by Ethan Mollick
oneusefulthing.org
llms causal-inference permalink →
Research
Chatbots as social companions
academic.oup.com
research-paper permalink →
Research
Our paper we can learn a lot about effective framing from what they’ve done in their workOur paper we can learn a lot...
dl.acm.org
research-paper permalink →
Research
[2503.05336] Toward an Evaluation Science for Generative AI Systems
arxiv.org
evaluation huggingface permalink →
Research
GitHub - internetarchive/newsum: Daily TV News Summary using GPT · GitHub
github.com
github permalink →
Research
Inductive reasoning in minds and machines - PubMedInduction-the ability to generalize from existing knowledge-is the...
pubmed.ncbi.nlm.nih.gov
research-paper permalink →
Research
[2503.02886] Exploring Political Ads on News and Media Websites During the 2024 U.S. Elections
arxiv.org
huggingface permalink →
Research
When Incentives Backfire, Data Stops Being Human
arxiv.org
research-paper permalink →
Research
arXiv · 2501.11433
arxiv.org
rl research-paper permalink →
Research
Culturally Yours | Understanding cultural references in text
mbzuai.ac.ae
llms youtube permalink →
Research
[2501.09102] Tracking the Takes and Trajectories of English-Language News Narratives across Trustworthy and Worrisome...
arxiv.org
llms huggingface disinformation fact-checking permalink →
Research
[2503.01532] Unmasking Implicit Bias: Evaluating Persona-Prompted LLM Responses in Power-Disparate Social Scenarios
arxiv.org
llms huggingface permalink →
Research
Detecting misbehavior in frontier reasoning models | OpenAI
openai.com
agents llms permalink →
Research
Decentralized Society: Finding Web3's Soul by Puja Ohlhaver, E. Glen Weyl, Vitalik Buterin
papers.ssrn.com
research-paper permalink →
Research
Red teaming ChatGPT in medicine to yield real-world insights on model behavior | npj Digital Medicine
nature.com
safety permalink →
Research
[2503.02250] AI Automatons: AI Systems Intended to Imitate Humans
arxiv.org
huggingface permalink →
Research
Optimizing language models for human preferences should be viewed as a causal problemOptimizing language models for h...
arxiv.org
llms permalink →
Research
UCSD SMS Analytics Research Project
sms-analytics.sysnet.ucsd.edu
llms trust-and-safety permalink →
Research
Having an advisor with a strong publication record + past students who have been successfu
nber.org
research-paper permalink →
Research
fbarchive.org
fbarchive.org
osint permalink →
Research
Why Economists Should Conduct Field Experiments and 14 Tips for Pulling One Off
ideas.repec.org
privacy permalink →
Research
[2410.23506] The Belief State Transformer
arxiv.org
huggingface permalink →
Research
[2411.10109] LLM Agents Grounded in Self-Reports Enable General-Purpose Simulation of Individuals
arxiv.org
llms huggingface permalink →
Research
well structured paper conveying how to write clearly
questromworld.bu.edu
research-paper permalink →
Research
Data & Society — Data Voids
datasociety.net
llms disinformation permalink →
Research
Chain of Agents: Large language models collaborating on long-context tasks
research.google
agents llms rag training permalink →
Research
[2502.11264] Strategic Wealth Accumulation Under Transformative AI Expectations
arxiv.org
huggingface permalink →
Research
The Economist: Generative AI and inequality | Rem Koning posted on the topic | LinkedIn
linkedin.com
youtube permalink →
Research
[2502.14143] Multi-Agent Risks from Advanced AI
arxiv.org
agents llms permalink →
Research
factors that cause people to believe in misinformation
pnas.org
disinformation permalink →
Research
Breakdown of the foundations of GenAI, applications, and lessons to learn about governance
papers.ssrn.com
research-paper permalink →
Research
Engagement-based algorithms disrupt human social norm learning
osf.io
research-paper permalink →
Research
GitHub - om-ai-lab/VLM-R1: Solve Visual Understanding with Reinforced VLMs · GitHub
github.com
llms evaluation training infrastructure permalink →
Research
FUTURE-AI: international consensus guideline for trustworthy and de
bmj.com
research-paper permalink →
Research
Safer Internet Day 2025: Staying Ahead and keeping the ecosystem safe
blog.google
twitter youtube facebook trust-and-safety permalink →
Research
arXiv · 2502.06807
arxiv.org
rl research-paper permalink →
Research
[2412.17847] Bridging the Data Provenance Gap Across Text, Speech and Video
arxiv.org
multi-modal youtube permalink →
Research
Osf (link)
osf.io
moderation permalink →
Research
tracker?hashtag=%23YCPAntham WhatsApp trend tracker by Princeton’s digital wellness labtracker?hashtag=%23YCPAntham W...
digitalwitnesslab.org
facebook permalink →
Research
[2502.00873] Language Models Use Trigonometry to Do Addition
arxiv.org
llms huggingface permalink →
Research
[2501.18649] Fake News Detection After LLM Laundering: Measurement and Explanation
arxiv.org
llms huggingface disinformation permalink →
Research
[2501.18438] o3-mini vs DeepSeek-R1: Which One is Safer?
arxiv.org
llms huggingface permalink →
Research
Been Kim on X: \"@karpathy We taught some superhuman chess moves from AlphaZero to Grandmasters some time ago (https:/...
twitter.com
twitter permalink →
Research
#mbzuai #llm #llm360 #ai | MBZUAI (Mohamed bin Zayed University of Artificial Intelligence)
linkedin.com
llms evaluation multi-modal huggingface permalink →
Research
Jiayi Pan on X: \"We reproduced DeepSeek R1-Zero in the CountDown game, and it just works Through RL, the 3B base LM d...
twitter.com
twitter github permalink →
Research
🤖 How do chatbots respond to political questions? Large language models (LLMs) are reshaping our information environm...
linkedin.com
llms huggingface permalink →
Research
Junxian He on X: \"We replicated the DeepSeek-R1-Zero and DeepSeek-R1 training on 7B model with only 8K examples, the...
twitter.com
twitter github permalink →
Research
A lot of TikTok influencers are shifting to xiaohongshu or RED the Chinese platform in an
tandfonline.com
tiktok permalink →
Research
Introducing “DomainDemo: a dataset of domain-sharing activities among different demographic groups on Twitter.” Today...
linkedin.com
twitter permalink →
Research
4.5 Million (Suspected) Fake \\faStar Stars in GitHub: A Growing Spiral of Popularity Contests, Scams, and Malware
arxiv.org
github disinformation permalink →
Research
Artificial Societies
societies.io
evaluation simulation permalink →
Research
We made an AI simulation of 1000 real VC investors and how they interact with each other on LinkedIn. Why simulate a...
linkedin.com
simulation funding linkedin permalink →
Research
joyojeet pal on X: \"Our latest work on politicians engaging YouTubers for outreach Summary: 1. Influencers routinely...
twitter.com
twitter youtube permalink →
Research
[1004.4704] Homophily and Contagion Are Generically Confounded in Observational Social Network Studies
arxiv.org
huggingface permalink →
Research
GenAI can harm learning
papers.ssrn.com
research-paper permalink →
Research
someone built nice multiagent simulators
oasis.camel-ai.org
agents permalink →
Research
[2407.00215] LLM Critics Help Catch LLM Bugs
arxiv.org
llms training huggingface permalink →
Research
Human study on AI spear phishing campaigns — LessWrong
lesswrong.com
agents llms safety osint permalink →
Research
Evaluating the effect of viral posts on social media engagement | Scientific Reports
nature.com
evaluation youtube facebook disinformation permalink →
Data
In collaboration with Nature, I investigated the impact of the Trump administration on US science one year after its...
linkedin.com
platform-policy health-ai privacy simulation funding linkedin permalink →
Data
Datawrapper: Create charts, maps, and tables
datawrapper.de
data-viz datasets permalink →
Research
US science after a year of Trump: what has been lost and what remains
nature.com
twitter facebook disinformation health-ai permalink →
Data
Show HN: Self-host Reddit – 2.38B posts, works offline, yours forever | Hacker News
news.ycombinator.com
llms coding-agents reddit github permalink →
Data
distil-labs/distil-qwen3-4b-text2sql · Hugging Face
huggingface.co
llms training infrastructure huggingface permalink →
Data
facebook/research-plan-gen · Datasets at Hugging Face
huggingface.co
facebook huggingface permalink →
Data
World News API: Pricing
worldnewsapi.com
journalism permalink →
Data
Searchable.City
searchable.city
osint permalink →
Data
Exa on X: \"Introducing state-of-the-art People Search: You can now semantically search over 1 billion people using a...
x.com
evaluation vector-search twitter permalink →
Data
Data Types - Platform Data Guide | Show Me The Data
show-me-the-data.com
privacy permalink →
Data
Introduction - Platform Data Guide | Show Me The Data
show-me-the-data.com
twitter youtube tiktok facebook permalink →
Data
Archive on X: \"Bot made $900 -> $208k in 3 months on polymarket one of the most talked about bots on polymarket ri...
x.com
twitter permalink →
Research
Mapping the online manipulation economy
science.org
research-paper permalink →
Research
I would look at the plots in this supplementary file for the science paper mapping online
science.org
research-paper permalink →
Data
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool
paperzilla.ai
research-paper permalink →
Data
Media Cloud
mediacloud.org
voice reddit github permalink →
Research
[2510.23645] Global YouTube Trending Dataset (2022-2025): Three Years of Platform-Curated, Cross-National Trends in D...
arxiv.org
youtube huggingface permalink →
Data
Donating Your Social Media - UCSF Library
library.ucsf.edu
twitter facebook permalink →
Research
lots of fact-check datasets here
dl.acm.org
fact-checking permalink →
Research
[2404.07340] RIP Twitter API: A eulogy to its vast research contributions
arxiv.org
twitter disinformation permalink →
Data
AVAILABLE DATASETS – Mobilize Center
mobilize.stanford.edu
evaluation health-ai permalink →
Data
Postscope - Twitter/X Visualization Tool
postscope.pages.dev
twitter permalink →
Data
Webchiver: Build Your Own Personal Web Archive
webchiver.com
twitter permalink →
Data
GitHub - sherlock-project/sherlock: Hunt down social media accounts by username across social networks · GitHub
github.com
github permalink →
Data
SnapStream ✂️ on X: \"SnapStream lets you see which networks are taking events like the President speaking live &...
x.com
twitter permalink →
Research
a good paper showing evolution of sharing from one subreddit to another, across multiple d
arxiv.org
reddit permalink →
Data
Talk to the City
talktothe.city
policy permalink →
Data
DSA: Risk Assessment & Audit Database | Alexander Hohlfeld
linkedin.com
platform-policy permalink →
Data
Oreocide on X: \"@captgouda24 This page may be of interest to you https://t.co/qV6Ftnojqj\" / X
x.com
twitter github datasets permalink →
Data
Nicholas Decker on X: \"I’ve started an ongoing project to collect all the datasets which economists can use, all in o...
x.com
twitter permalink →
Data
Releases · ArthurHeitmann/arctic_shift · GitHub
github.com
reddit github permalink →
Data
Reddit subreddits metadata, rules and wikis 2025-01 - Academic Torrents
academictorrents.com
reddit github permalink →
Data
Reddit - Please wait for verification
reddit.com
reddit permalink →
Data
Using the LessWrong API to query for events — LessWrong
lesswrong.com
vector-search permalink →
Data
GitHub - HackerNews/API: Documentation and Samples for the Official HN API · GitHub
github.com
github permalink →
Data
Live Trade BenchLive evaluation of trading agents
trade-bench.live
evaluation permalink →
Data
BAAI on X: \"We're releasing InfoSeek, a dataset that trained a 3B model to rival Gemini/Sonnet 4.0 on deep research t...
x.com
twitter github huggingface permalink →
Data
gnews · PyPI
pypi.org
github india permalink →
Data
GitHub - kharrigian/mental-health-datasets: An evolving list of electronic media data sets used to model mental-healt...
github.com
multi-modal twitter reddit github permalink →
Data
An Emerging Lobby: An Analysis of Campaign Contributions from Indian-Americans, 1998-2022 – Joyojeet Pal
joyojeet.people.si.umich.edu
india permalink →
Data
REALLY COOL DATASET that we should immediately integrate into arbiter for the agent to sea
dataverse.harvard.edu
agents datasets permalink →
Data
OpenDataLab å¼é¢AIå¤§æ¨¡åæ¶ä»£çå¼æ¾æ°æ®å¹³å°
opendatalab.com
datasets permalink →
Data
Pratyush Maini on X: \"1/Pretraining is hitting a data wall; scaling raw web data alone leads to diminishing returns....
x.com
llms training twitter permalink →
Data
Can we check if this data comprising 117 million posts is publicly accessible? It would be
academic.oup.com
research-paper permalink →
Data
Substack API repo reaches 100 GitHub stars, thanks to community feedback | Nick Hagar posted on the topic | LinkedIn
linkedin.com
github platform-policy privacy linkedin permalink →
Data
Introducing FineWeb2: A 20TB multilingual dataset | Thomas Wolf posted on the topic | LinkedIn
linkedin.com
llms training huggingface permalink →
Data
GitHub - iptv-org/iptv: Collection of publicly available IPTV channels from all over the world · GitHub
github.com
github permalink →
Data
Multi-Token Attention | Research - AI at Meta
share.google
research-paper permalink →
Data
share.google
share.google
research-paper permalink →
Research
[2404.11988] The Emerging Generative Artificial Intelligence Divide in the United States
arxiv.org
huggingface permalink →
Research
[2504.06318] The Schwurbelarchiv: a German Language Telegram dataset for the Study of Conspiracy Theories
arxiv.org
multi-modal telegram disinformation permalink →
Data
Discord Fetch – discord_fetch
hamelsmu.github.io
llms evaluation github permalink →
Data
Launch YC: Clado: Deep Research for People | Y Combinator
ycombinator.com
agents evaluation permalink →
Data
Reducto: AI document parsing & extraction software
reducto.ai
agents llms vector-search multi-modal permalink →
Data
GitHub - stanford-futuredata/ColBERT: ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'2...
github.com
llms evaluation vector-search infrastructure permalink →
Data
A great dataset: 560 podcast episodes, 300+ hours of content, and full transcripts from a covert Russian influence op...
linkedin.com
platform-policy datasets privacy linkedin permalink →
Data
Zack Kanter on X: \"It unfortunately seems that 37signals spent the last two and a half years on a Manhattan project t...
twitter.com
twitter permalink →
Data
Distill — Latest articles about machine learning
distill.pub
interpretability multi-modal journalism permalink →
Data
On the Biology of a Large Language Model
transformer-circuits.pub
llms interpretability permalink →
Data
Can Large Language Models Explain Their Internal Mechanisms?
pair.withgoogle.com
llms interpretability permalink →
Data
Here Are All The ‘Bro’ Podcast Episodes With Trump
forbes.com
youtube permalink →
Data
Instagram Statistics Marketers Should Know in 2025 [Updated] | Sprout Social
sproutsocial.com
evaluation facebook india permalink →
Data
Digital 2025: Global Overview Report — DataReportal – Global Digital Insights
datareportal.com
tiktok permalink →
Data
platformabuse.org
platformabuse.org
osint permalink →
Data
Cognition on X: \"Project DeepWiki Up-to-date documentation you can talk to, for every repo in the world. Think Deep R...
twitter.com
twitter permalink →
Data
Airtable - Social Media Monitoring Products Repository
airtable.com
datasets permalink →
Data
About - Sourcebase
sourcebase.ai
evaluation fact-checking permalink →
Data
OSINT +500 Tools - Start.me
start.me
osint permalink →
Data
GitHub - cassidoo/scrapers: A list of scrapers from around the web. · GitHub
github.com
github datasets permalink →
Data
Google Sheets now has AI for formulas | Simon Taylor posted on the topic | LinkedIn
linkedin.com
llms fact-checking linkedin permalink →
Data
Yushe on X: \"4chan just got hacked hard. The person who hacked them claimed they dumped the entire database. https://...
twitter.com
twitter permalink →
Data
I’ve built the Perplexity of the DarkWeb! Let me explain 👇 First, if you've been living in a cave, Perplexity is a se...
linkedin.com
agents llms permalink →
Data
Pitch Decks That Helped Hot Startups Raise Millions - Business Insider
businessinsider.com
bluesky facebook permalink →
Data
[Interview] Mark Ledwich - Algorithmic Extremism: Examining YouTube's Rabbit Hole of Radicalization - YouTube
youtube.com
youtube permalink →
Data
Recfluence
recfluence.net
youtube permalink →
Data
John B. Holbein on X: \"Wow! This project looks amazing. In it, three scientists at Columbia, Michigan, and Maryland i...
twitter.com
twitter permalink →
Data
Home - Nielsen Kilts Datasets - Research Guides at New York University
guides.nyu.edu
datasets permalink →
Data
TVNewser
adweek.com
journalism permalink →
Data
Worldwide â X (Twitter) trending topics and hashtags today | trends24.in
trends24.in
twitter permalink →
Data
Ad Library
facebook.com
facebook permalink →
Data
Platform
openmeasures.io
evaluation permalink →
Data
Trending narratives on social: ‘Deport all Muslims,’ Tesla fires are “terrorism,” Biden stranded the astronauts, Step...
mailchi.mp
twitter permalink →
Data
really love the data viz
thcostello.com
datasets permalink →
Research
[2303.05345] TGDataset: Collecting and Exploring the Largest Telegram Channels Dataset
arxiv.org
telegram huggingface health-ai permalink →
Data
Discord
discord.com
discord permalink →
Data
The Top 100 Gen AI Consumer Apps - 4th Edition | Andreessen Horowitz
a16z.com
llms evaluation coding-agents twitter permalink →
Data
javascript - Modifying the positions of streams in the D3 stream graph - Stack Overflow
stackoverflow.com
data-viz infrastructure permalink →
Data
Dataset - Democratic Erosion
democratic-erosion.org
coding-agents permalink →
Data
The GDELT Project
gdeltproject.org
twitter permalink →
Data
Reports – National AI Opinion Monitor
naiom.net
disinformation permalink →
Data
DeepSeek R1 Distill Llama 70B offers a more accessible and fast way of accessing reasoning capabilities than the full...
linkedin.com
agents evaluation permalink →
Data
Cloudflare
workers.cloudflare.com
agents permalink →
Data
GitHub - BloombergGraphics/2025-youtube-podcast-men-for-trump: Data from the Bloomberg News analysis on streamers and...
github.com
youtube github permalink →

All tags

Browse by topic. Stack a few to narrow.

llms 289 twitter 187 agents 123 evaluation 103 huggingface 96 research-paper 85 github 62 training 58 infrastructure 55 disinformation 49 youtube 43 facebook 39 vector-search 39 multi-modal 34 coding-agents 33 fact-checking 28 privacy 25 safety 24 datasets 23 reddit 20 health-ai 19 tiktok 18 moderation 15 voice 15 bluesky 15 platform-policy 14 india 13 linkedin 13 journalism 12 interpretability 12 rag 11 policy 10 labor-markets 10 regulator 9 cib 9 cybersecurity 9 osint 8 geopolitics 8 rl 7 causal-inference 6 trust-and-safety 5 telegram 5 funding 4 social-psychology 4 global-south 3 election-integrity 3 design 3 image-gen 3 doc 3 emotion-detection 3 simulation 3 wikimedia 2 open-source 2 science 2 child-safety 2 data-viz 2 metaverse 1 social-networks 1 rlhf 1 systems 1 knowledge-graphs 1 recsys 1 forecasting 1 news 1 provenance 1 measurement 1 info-ops 1 mechanism-design 1 algorithms 1 tutorial 1 discord 1

CALL FOR READINGS

Have a paper or report we should add?

The library is curated by the team but suggestions are welcome. Send a one-line note via the contact form or DM us on LinkedIn and Twitter.