Readings the team comes back to.
Every link we've shared in our monthly newsletter, in one place. Policy briefs, datasets, technical write-ups, news features, tutorials, papers. Use the search box, filter by category, or click any tag to narrow further.
- Policy
A Framework for Digital Safety: Designing Social Media Interventions
Action-oriented taxonomy of digital-safety interventions across focus, scope, driver, user journey.
papers.ssrn.comTPRC 2025 - Research
Bridging Nodes and Narrative Flows on Telegram
Cross-community bridging metric on Telegram disinformation networks. SimPPL paper.
arxiv.orgarXiv preprint - News
MIT Delta V Demo Day — Sakhi spinout
MIT News write-up of Sakhi at Delta V Demo Day.
news.mit.eduMIT News, Sep 2024 - Policy
Mozilla Responsible Computing Challenge — first awardees in India
Inaugural India RCC cohort announcement; SimPPL was the first awardee.
mozillafoundation.orgJune 2024 - News
Rest of World — SimPPL on automation against misinformation
Three-minute interview ahead of Meta's Bangladesh CIB takedown.
restofworld.orgRest of World, 2024 - Research
How Facebook Has Become a Political Battleground in Bangladesh
Tech Global Institute / SimPPL Info Lab report on Bangladesh election Pages and Groups.
infolab.techglobalinstitute.comTech Global Institute Info Lab - Research
Multiagent Simulators for Social Networks
SimPPL position paper. Multiagent + LLM-driven simulation as the public-research bridge.
arxiv.orgICML 2023 workshop - Technical
Pro-Russian bot networks — active vs deleted user network graphs
Part 2: how information disseminates through coordinated structures.
jhagrutlalwani.vercel.appSimPPL blog - Technical
Pro-Russian bot networks on Twitter — article-sharing analysis
Part 1 of Jhagrut Lalwani's two-part SimPPL blog on coordinated behaviour.
jhagrutlalwani.vercel.appSimPPL blog - Research
ICML 2022 — Estimating the Impact of Coordinated Inauthentic Behavior
Multiagent Reddit simulation quantifying CIB's effect on recommender outputs.
ora.ox.ac.ukICML AI4ABM Workshop - News
Marquise Mason on X: \"#UK High Learn Ltd We attacked https://t.co/OEKetEWsFr 01.02.25 We have received numerous data...
Ransomware crew '8BASE' announcing a UK target on X, illustrating how threat actors broadcast on X for OSINT monitors to pick up.
x.com - Data
deepdarkCTI/twitter_threat_actors.md at main · fastfire/deepdarkCTI · GitHub
Curated index of X accounts run by ransomware groups and cybercriminals — a seed list for OSINT cyber-threat monitoring.
github.com - Research
Ethan Mollick on X: \"Everyone is starting to sound like AI, even in spoken language Analysis of 280,000 transcripts o...
Mollick highlights a study on 280K academic talk transcripts showing speakers increasingly use ChatGPT-favored words — model collapse for humans.
x.com - Technical
Akshay 🚀 on X: \"Big moment for Postgres! Search has always been Postgres' weak spot, and everyone just accepted it. I...
TigerData open-sourced pg_textsearch, bringing native BM25 ranking to Postgres so teams can drop their separate Elasticsearch clusters.
x.com - Technical
Reciprocal rank fusion | Elasticsearch Reference
Elasticsearch reference on Reciprocal Rank Fusion — a tuning-free method for combining multiple retrievers into one ranked result set.
elastic.co - Technical
Origon — The Intelligence Infrastructure
Origon's enterprise-AI-agent platform: agents trained on customer data and deployed to private cloud or on-prem for regulated industries.
origon.ai - Technical
Mixedbread on X: \"We build the first production ready multi-vector and multimodal search. Now we are serving over 1 b...
Mixedbread serves 1B+ docs with multi-vector and multimodal search at sub-50ms p50, with a production engineering write-up forthcoming.
x.com - Technical
GitHub - aiming-lab/SimpleMem: SimpleMem: Efficient Lifelong Memory for LLM Agents — Text & Multimodal · GitHub
SimpleMem: efficient open-source lifelong-memory implementation for LLM agents across text and multimodal inputs.
github.com - Technical
Sprites - Stateful sandboxes
Sprites: hardware-isolated persistent Linux sandboxes for running AI-agent code, with checkpoint and restore.
sprites.dev - News
Many Small Steps for Robots, One Giant Leap for Mankind
Packy McCormick co-essay with Standard Bots' Evan Beard on cumulative small advances in robotics adding up to a giant leap.
notboring.co - Technical
GitHub - braedonsaunders/codeflow: Paste any GitHub URL → interactive architecture map. See how files connect, find w...
Paste any GitHub URL into CodeFlow and get an interactive architecture map showing how files connect — browser-only, no install.
github.com - Technical
json-render | The Generative UI Framework
json-render: a generative-UI framework that turns LLM JSON output into safe predefined components and actions.
json-render.dev - News
Deedy (@deedydas) on XClaude Code did a side project that took me ~2 weeks in 2025 in 30mins.
Deedy Das post claiming Claude Code completed in 30 minutes a side project that took him two weeks in 2025.
share.google - Technical
GitHub - mggrim/scholarly-ideas: Research puzzle development tool - helps academics develop rigorous research puzzles...
Scholarly-ideas: an LLM-driven research-puzzle development tool that grounds academic puzzles in real empirical anomalies.
github.com - Technical
Tweet Topic Clusters â @deedydas
Embeddings + k-means + GPT-4o labeling applied to 6,912 of @deedydas's tweets to surface 25 thematic clusters.
debarghyadas.com - News
Meet Qualia - YouTube
Quadrillion Labs unveils Qualia, an agentic data-scientist product, in a teaser for its private beta.
youtube.com - Technical
E2B | The Enterprise AI Agent Cloud
E2B's enterprise AI-agent cloud, used by Perplexity, Hugging Face, Manus, Groq, and Lindy for sandboxed agent execution.
e2b.dev - Technical
GitHub - robert-mcdermott/ai-knowledge-graph: AI Powered Knowledge Graph Generator · GitHub
Open-source pipeline that turns unstructured text documents into LLM-generated knowledge graphs.
github.com - News
You are being manipulated: RageCheck is a free tool that analyzes online content for manipulative framings—language d...
Jay Van Bavel promotes RageCheck, a free tool that flags emotionally manipulative framing patterns in online content.
linkedin.com - Tutorials
Loss Landscapes: Saddles, Minima & Generalization | TensorTonic
Interactive walk-through of neural-network loss landscape geometry: convexity, saddle points, sharp vs flat minima.
tensortonic.com - Technical
Code Execution with MCP: Fix Tool Token Bloat (Adam Jones, Anthropic) - YouTube
Anthropic engineer Adam Jones on cutting MCP tool-token bloat by having agents write code to call tools rather than register every tool.
youtube.com - Technical
GitHub - huseyinbabal/taws: Terminal UI for AWS (taws) - A terminal-based AWS resource viewer and manager · GitHub
taws: a Rust-built terminal UI for AWS — navigate, observe, and manage resources without the console.
github.com - Technical
AI SDK 6 - Vercel
AI SDK 6 from Vercel ships tool-execution approval, DevTools, native MCP, reranking, and image editing for TypeScript AI apps.
vercel.com - Technical
Code execution with MCP: building more efficient AI agents \\ Anthropic
Anthropic engineering case for letting agents call MCP servers via code execution, so tool definitions stay out of the context window.
anthropic.com - Technical
Merkle Mountain Ranges - Grin Documentation
Grin's docs on Merkle Mountain Ranges — an append-only alternative to Merkle trees used to store blockchain kernels and proofs.
docs.grin.mw - Technical
Databases in 2025: A Year in Review // Blog // Andy Pavlo - Carnegie Mellon University
Andy Pavlo's 2025-in-review on databases: Postgres momentum, MCP everywhere, MongoDB v FerretDB, file formats, and Turbopuffer's rise.
cs.cmu.edu - News
swyx 🌉 on X: \"the way that @turbopuffer started late but overtook Pinecone and ripped out 4-5m ARR contracts needs to...
swyx on Turbopuffer overtaking Pinecone with $4-5M ARR contracts despite a late start, citing Pavlo's 2025 databases retrospective.
x.com - Technical
Devi Parikh on X: \"Building agents that skim the web is easy. Building agents that go deep, go broad, stay on-point,...
Devi Parikh: Yutori's blog on lessons building deep, broad, on-point, 24/7 agents that power Scouts — useful for anyone building agents.
x.com - Technical
ChapterPal — AI-assisted reading and note-taking
ChapterPal: AI-assisted reading and note-taking app, marketing landing page.
chapterpal.com - Technical
Boris Cherny on X: \"I'm Boris and I created Claude Code. Lots of people have asked how I use Claude Code, so I wanted...
Boris Cherny (creator of Claude Code) shares his vanilla setup and notes there's no one correct way to use the tool.
x.com - Research
Randall Balestriero on X: \"We start having provable measures of alignment between pretraining setups and eval perfs:...
Balestriero highlights provable measures of alignment between pretraining setups and eval performance — early but promising work.
x.com - Research
Seunghyun Seo on X: \"Inspired by this thread, I'd like to share my slides on training horizon scaling. Lately, lots o...
Seunghyun Seo's slides on training-horizon scaling, focusing on the role of weight decay (not just learning rate) when scaling.
x.com - Research
HumanPlane/LACUNA · Hugging Face
LACUNA: a PPO RL agent trained to trade Polymarket 15-minute crypto markets by fusing Binance order flow with Polymarket orderbook data.
huggingface.co - Technical
Jaya Gupta on X: \"AI’s trillion-dollar opportunity: Context graphs\" / X
Jaya Gupta argues 'context graphs' that capture decision traces (not just data) are AI's next trillion-dollar opportunity.
x.com - Technical
Animesh Koratana on X: \"How to build a context graph\" / X
Animesh Koratana on how to actually build a context graph — modeling decision traces is structurally hard, not 'add memory to your agent'.
x.com - Tutorials
Manthan Gupta on X: \"How to Use LLM as a Judge (Without Getting Burned)\" / X
Practical guide: when to use LLM-as-judge, with five rules (reference-based, debiasing, ensembling, reasoning before scoring, calibration).
x.com - Technical
discovering the postrat canon in the community archive | lab notes #4
Lab notes from Epistemic Garden on deriving narrative strands from quote-tweets + semantic search across the postrat community archive.
xiqo.substack.com - News
AI Agents on WhatsApp: Scalable Support with ElevenLabs - YouTube
ElevenLabs becomes an official WhatsApp technology provider, deploying human-like voice agents inside WhatsApp business chats.
youtube.com - Tutorials
Miguel Ángel Pastor on X: \"A hardware-aware guide to data structures for system software engineers. https://t.co/cW77...
A hardware-aware guide to data structures for systems engineers, shared by Miguel Pastor.
x.com - Tutorials
Karan Lokchandani on X: \"@techNmak https://t.co/cexRv5mueF\" / X
Karan Lokchandani shares an open LLM-interview-questions PDF — handy prep for ML hiring loops.
x.com - Technical
Rate limits | Gemini API | Google AI for Developers
Google Gemini API rate-limit reference: RPM, TPM, RPD across tiers, model-specific caps, and project-level scoping.
ai.google.dev - Technical
Google AI Studio’s Interactions API for Gemini models and agents
Google AI Studio launches the Interactions API, a unified foundation for building Gemini-based models and agents in public beta.
blog.google - News
Perplexity
Perplexity Page on a malicious VPN extension allegedly stealing ChatGPT credentials (paywalled or login-gated).
perplexity.ai - Tutorials
LangChain on X: \"🚀☁️ Deploying LangChain Agents on AWS Serverless Made by the LangChain Community Thomas Taylor deplo...
LangChain shares Thomas Taylor's tutorial on deploying stateful LangGraph agents on AWS Lambda with DynamoDB checkpointing and CDK.
x.com - Tutorials
.NET Concurrency: Async/Await Explained | Bharat Biyani posted on the topic | LinkedIn
Bharat Biyani's visual guide to .NET concurrency: state machine, stack vs heap, async vs parallel, the .Result deadlock trap.
linkedin.com - Tutorials
Design Patterns: Solutions to Common Software Problems | Bharat Biyani posted on the topic | LinkedIn
Bharat Biyani's design-patterns explainer: why they exist, what problems they solve, when (and when not) to use them, interview framing.
linkedin.com - Research
Belinda on X: \"What if you could watch an AI Scientist think? We built an interface to make @SakanaAILabs’s AI Scient...
Belinda's interpretable-interface project for Sakana's AI Scientist-v2 lets you watch every hypothesis, failed experiment, and 'aha'.
x.com - Research
What 2026 looks like — LessWrong
Daniel Kokotajlo's 2021 LessWrong post extrapolating AI futures year-by-year through 2026 — frequently cited AI-timeline vignette.
lesswrong.com - Research
AI 2027
AI 2027: a detailed scenario forecasting superhuman AI's next-decade impact, with slowdown vs race endings and quantitative trend extrapolations.
ai-2027.com - Technical
xjdr on X: \"# Why Training MoEs is So Hard recently, i have found myself wanting a small, research focused training r...
xjdr on why training MoEs under 20B params is hard: flop efficiency, load-balancing/router stability, and data quality/quantity.
x.com - Tutorials
Archie Sengupta on X: \"distributed_gpu_training\" / X
Archie Sengupta's distributed-GPU-training explainer: from a single GPU's streaming multiprocessors to slicing models across racks.
x.com - Research
c++ design patterns for high frequency trading
arXiv preprint on C++ design patterns for high-frequency trading systems.
arxiv.org - Tutorials
eric zakariasson on X: \"the internal guide that technically lite team members go through when joining cursor https://...
Eric Zakariasson shares Cursor's internal onboarding guide for non-technical team members joining the company.
x.com - Technical
Ax - Build Reliable AI Apps in TypeScript
Ax: a TypeScript DSPy port — declare signatures instead of prompts, with auto-prompt-tuning, GEPA optimizer, and 15+ LLM providers.
axllm.dev - Technical
Preserve progress despite interruptions - AWS Lambda durable functions - AWS
AWS Lambda durable functions: automatic checkpointing, suspend execution up to a year, recover from failures, no infra to manage.
share.google - Technical
Clerk Billing
Clerk Billing: drop-in React components for B2C and B2B subscription billing without writing payment-integration or UI code.
clerk.com - Technical
Polar — A billing platform for the intelligence era | Polar
Polar: usage-based billing platform for the AI era — meter tokens, API calls, compute, GPU workloads.
polar.sh - Technical
K-Dense on X: \"Go from a dataset to a ready to submit manuscript in 1 day with our Claude Scientific Skills and Claud...
K-Dense ships free Claude Scientific Skills + Writer that take a dataset to a submission-ready manuscript in one day.
x.com - Technical
Thariq on X: \"We built a Deep Research demo for the Claude Agent SDK! It's one our most requested use cases: spawn mu...
Thariq launches a Deep Research demo for the Claude Agent SDK: parallel agents researching a topic and synthesizing into a report.
x.com - News
refine | Pedro Sant'Anna
Pedro Sant'Anna recommends refine.ink: AI tool that proof-checks academic papers for internal consistency, typos, and equation proofs.
linkedin.com - Tutorials
psmpy: Propensity Score Matching in Python! | Towards Data SciencePerforming propensity score matching in a python en...
Towards Data Science walkthrough of psmpy, a Python library for propensity-score matching (paywalled, fetch failed).
towardsdatascience.com - Technical
GitHub - leoheuler/flashtensors · GitHub
flashtensors: run 100 large models on a single GPU with minimal time-to-first-token impact via tensor swap.
github.com - Technical
Lakshya A Agrawal on X: \"GEPA featured in @OpenAI and @BainandCompany new cookbook tutorial, showing how to build sel...
GEPA featured in the OpenAI x Bain cookbook tutorial on building self-evolving agents that move beyond static prompts.
x.com - Technical
Modal: High-performance AI infrastructure
Modal: high-performance AI infrastructure with sub-second cold starts, instant autoscaling, elastic GPU access without quotas.
modal.com - Technical
GitHub - MoonshotAI/kosong: The LLM abstraction layer for modern AI agent applications. · GitHub
Kosong: Moonshot AI's open-source LLM abstraction layer for modern AI agent applications (now part of the kimi-cli monorepo).
github.com - Technical
GitHub - microsoft/agent-framework: A framework for building, orchestrating and deploying AI agents and multi-agent w...
Microsoft's Agent Framework: open-source framework for building, orchestrating, and deploying AI agents and multi-agent workflows in Python and .NET.
github.com - Technical
Agent Development Kit (ADK) - Agent Development Kit (ADK)
Google's Agent Development Kit (ADK): production-ready open-source agent framework in Python, TypeScript, Go, and Java.
google.github.io - Research
AI impact on growth: economists vs AI experts | Tom Cunningham posted on the topic | LinkedIn
Tom Cunningham compares AI-impact-on-growth forecasts from economists (0.1-1.5%/yr) vs AI experts (3-30%/yr) — large disagreement, debate why.
linkedin.com - Technical
Introduction - supermemory | Memory API for the AI era
Supermemory: a memory + RAG API for the AI era with graph memory, content types, multi-tenancy, and SDK integrations.
supermemory.ai - News
Santiago on X: \"The MiniMax M2 model is mind-blowing! It's open-source. It outperforms Gemini 2.5, Claude 4.1, and Qw...
Santiago: MiniMax M2 (open-source) outperforms Gemini 2.5, Claude 4.1, Qwen3 on coding/tool-use benchmarks at ~8% of Claude's cost.
x.com - Technical
Akshay 🚀 on X: \"Microsoft did it again! Building with AI agents almost never works on the first try. You spend days t...
Akshay highlights Microsoft's open-source Agent Lightning: train any AI agent (LangChain, AutoGen, CrewAI, etc.) with RL and prompt optimization.
x.com - News
Google's NotebookLM: 8x bigger brain, custom goals, and more | Neil Hoyne posted on the topic | LinkedIn
Neil Hoyne summarizes NotebookLM's upgrade: 1M-token context, 6x longer memory, custom personas, deeper research.
linkedin.com - Research
How to use behavioural science in social media for lasting change | Aleksandra Kuzmanovic posted on the topic | LinkedIn
WHO's Aleksandra Kuzmanovic on behavioural-science-informed framing for health communication on social media: a viewpoint with Meta researchers.
linkedin.com - Research
Time to start looking into pangram’s models for AI generated text detection. Seems like it
BFI Chicago working paper on Pangram's models for AI-generated text detection (PDF, page is binary so excerpt unparsed).
bfi.uchicago.edu - Technical
Johann Schopplich on X: \"JSON is token‑expensive for LLMs – just like @mattpocockuk frequently mentions. Meet TOON, t...
Johann Schopplich introduces TOON: Token-Oriented Object Notation, a JSON-like format that's 40-60% fewer tokens for LLMs.
x.com - Technical
Elastic Dev on X: \"All you need is a natural language agent definition, and you have a custom AI assistant to help yo...
Elastic Dev shows Agent Builder: define an agent in natural language and get a custom AI assistant for Elastic data.
x.com - Technical
LangChain on X: \"🔍🤖 Enterprise Deep Research A multi-agent system leveraging LangGraph to power enterprise research a...
LangChain shares Salesforce AI Research's Enterprise Deep Research, a multi-agent system on LangGraph with streaming and human steering.
x.com - Research
Ai2 on X: \"On olmOCR-Bench, olmOCR 2 scores 82.4 points, up from 78.5 in our previous release—increasing performance...
AI2 olmOCR 2 scores 82.4 on olmOCR-Bench (up from 78.5), with gains across every document category.
x.com - Technical
Meilisearch: Unified Search & AI Retrieval Platform
Meilisearch: unified search and AI-retrieval platform with sub-50ms full-text, semantic, hybrid, and multi-modal search.
meilisearch.com - Tutorials
ES|QL query builder for Python Elasticsearch Client - Elasticsearch Labs
Elastic Labs blog: ES|QL query builder for the Python Elasticsearch client (8.19+) with familiar Python syntax.
share.google - Technical
GitHub - atiilla/GeoIntel: GeoIntel using Google's Gemini API to uncover the location where photos were taken through...
GeoIntel: Python tool using Google's Gemini API to uncover photo locations through AI-powered geo-location analysis.
github.com - Technical
OpenAgent - The Open Source Agentic AISearch, think, and complete general tasks — Open-agent is a multimodal, agentic...
OpenAgent: open-source multimodal agentic AI for search, thinking, and general tasks (page returned empty).
open-agent.io - Technical
GitHub - elevenlabs/ui: ElevenLabs UI is a component library and custom registry built on top of shadcn/ui to help yo...
ElevenLabs UI: a shadcn-based component library and registry for building multimodal voice agents faster.
github.com - Tutorials
How BookMyShow handled 1M ColdPlay ticket requests in 10 mins | Animesh Gaitonde posted on the topic | LinkedIn
Animesh Gaitonde on how BookMyShow handled 1M ColdPlay ticket requests in 10 minutes: pessimistic vs optimistic vs in-memory locking.
linkedin.com - Technical
TwelveLabs: Video Intelligence Platform & API
TwelveLabs: video-intelligence platform with 60x real-time ingest, indexing 10k hours/day; turn raw footage into searchable AI-ready data.
twelvelabs.io - Technical
Aydyn Tairov on X: \"OpenZL - outperforms zstd, xz, gzip, and Blosc on multiple real-world datasets with 10x (!!!) spe...
Aydyn Tairov highlights Meta's OpenZL data-compression framework — graph-based composition of existing algorithms, 10x speedup over zstd.
x.com - News
Google removes num=100 search parameter, impacting startups and LLMs | Adarsh Appaiah posted on the topic | LinkedIn
Adarsh Appaiah: Google removed the num=100 search parameter, cutting LLM-accessible long-tail results 90% and dropping Reddit's stock 15%.
linkedin.com - News
Maxime Labonne on X: \"LFM2-Audio just dropped! It's a 1.5B model that understands and generates both text and audio I...
Liquid AI ships LFM2-Audio, a 1.5B model handling text and audio with 10x faster inference and parity with 10x larger models.
x.com - Data
ToolUniverse — 1,000+ Scientific Tools for AI Scientists
ToolUniverse: a registry of 1,000+ scientific tools for AI Scientist agents.
aiscientist.tools - Technical
Lingo.dev – The localization engineering platform
Lingo.dev: localization-engineering platform that persists glossaries, brand voice, and per-locale model chains as a stateful translation API.
lingo.dev - Technical
Parsera - Transform Websites into Data
Parsera: agent-based and API-based scraping that turns any website into a custom dataset via natural-language prompts.
parsera.org - Technical
Refine - AI-Powered Research Assistant
Refine.ink: AI-powered peer-review tool that flags accuracy, math, and internal-reference errors in research papers.
refine.ink - Technical
Turing Post on X: \"An open-source extension for LLM serving engines – LMCache It's like a caching layer for large-sca...
Turing Post on LMCache, an open-source KV-cache management layer for LLM serving — 4-10x reduction in RAG, lower TTFT, integrated with NVIDIA Dynamo.
x.com - Technical
Harrison Chase on X: \"Deep Agents - now on LangChain 1.0 We rewrote Deep Agents on top of LangChain 1.0, heavily util...
Harrison Chase: Deep Agents now run on LangChain 1.0 using new middleware — technical deep dive on what they are and how to use them.
x.com - Research
Valeriy M., PhD, MBA, CQF on X: \"🚀 Tsururu: A New Python Library for Time Series Forecasting (arXiv:2509.15843v1) Tsu...
Tsururu (arXiv 2509.15843): a Python time-series-forecasting library focused on strategies (recursive/direct/MIMO/hybrid) and preprocessing.
x.com - News
X (link)
X post no longer available (deleted/missing page).
x.com - Technical
Dub - The Modern Link Attribution Platform
Dub: modern link-attribution platform for short links, conversion tracking, and affiliate programs.
dub.co - Technical
Akshay 🚀 on X: \"Finally, an open-source, enterprise-grade RAG solution! If you're building an enterprise-grade RAG sy...
Akshay highlights MindsDB Knowledge Bases: open-source enterprise RAG over 200+ data sources with embeddings, reranking, real-time sync.
x.com - Technical
Adalat AI - End-to-End Justice Tech Stack
Adalat AI: India's end-to-end justice tech stack — courtroom transcription, case-lifecycle management, real-time updates.
adalat.ai - News
Today on Indicator: I found that 53 TikTok videos pushing clickbaity hoaxes about Charlie Kirk's assassination got al...
Alexios Mantzarlis: 53 TikTok hoax videos about Charlie Kirk's assassination got 32M views in three days; TikTok took 48 down after disclosure.
linkedin.com - Data
GitHub - allenai/awesome-open-source-lms: Friends of OLMo and their links. · GitHub
Allen AI's curated list of open-source language models — links and resources from the OLMo team's NeurIPS 2024 tutorial.
github.com - News
Crazy random line in the Cursor RL blog post saying they're collecting RL data from real users, updating the checkpoi...
Nathan Lambert flags Cursor's RL blog: collecting RL data from real users and updating checkpoints every 90-120 minutes — unthinkable a year ago.
linkedin.com - Research
Geoffrey Litt on X: \"If you're thinking about AI-generated UIs, recommend checking out JELLY by @YiningCao3, @peiling...
Geoffrey Litt on JELLY: structured AI-generated UIs that first build a data schema users can edit, then compose UIs from premade widgets.
x.com - Technical
Strands AgentsAI-powered agents for modern workflows
Strands Agents: AI-powered agents for modern workflows (page returned empty).
strandsagents.com - Technical
Advanced Context Engineering for Agents - YouTube
Dexter Horthy (Human Layer) on advanced context engineering for agents — spec-first, compaction strategies, subagents, planning workflows.
youtube.com - Technical
Using LongMemEval to Improve Agent Memory - YouTube
Sam Bhagwat (Mastra) on LongMemEval: tailored templates and targeted updates yield SOTA results on agent-memory benchmarks.
youtube.com - Technical
Context Engineering for Engineers - YouTube
Jeff Huber (Chroma) on context engineering: filtering and compaction matter more than long-context windows for reliable agent performance.
youtube.com - Technical
Context Engineering: Lessons Learned from Scaling CoCounsel - YouTube
Jake Heller on scaling CoCounsel — context-engineering lessons from building professional-grade legal AI from GPT-4 onward.
youtube.com - Technical
Agentuity — The Full-Stack Platform for AI Agents
Agentuity: full-stack platform for AI agents — typed APIs, frontends, sandboxes, evals, OpenTelemetry observability, evals on live traffic.
agentuity.com - News
Meet Chatterbox Multilingual: An Open-Source Zero-Shot Text To Speech (TTS) Multilingual Model with Emotion Control a...
Resemble AI's Chatterbox Multilingual: production-grade open-source zero-shot TTS in 23 languages with emotion control and watermarking.
marktechpost.com - News
smol ai (follow @latentspacepod for ainews) on X: \"[5 Sept 2025] Kimi K2‑0905 and Qwen3‑Max preview: two 1T open weig...
smol AI: Kimi K2-0905 and Qwen3-Max-preview both launched as 1T-parameter open-weight models on the same day.
x.com - Technical
Neo - Autonomous AI Agent to build and evaluate AI models, AI Agents, LLM prompts and ML systems
Neo: autonomous ML-engineer agent that automates training, fine-tuning, RAG pipeline construction, and evaluation.
heyneo.so - Technical
Agentic Design Patterns - Google Docs
Agentic Design Patterns Google Doc (login-gated; can't read content).
docs.google.com - Technical
Do the simplest thing that could possibly work
Sean Goedecke on system design: do the simplest thing that could possibly work, in fixing bugs, maintaining systems, and architecting new ones.
seangoedecke.com - Data
Social Forest - #1 YouTube Data API & YouTube Scraper Alternative
Social Forest: YouTube Data API and YouTube Scraper alternative for accessing YouTube data at scale.
social-forest.com - Technical
Charlie Marsh on X: \"Today, we're announcing our first hosted infrastructure product: pyx, a Python-native package re...
x.com - Technical
AngeTheGreat - YouTube
youtube.com - Technical
Overview | Embedding Atlas
apple.github.io - Technical
How Roblox Partners With Law Enforcement | Roblox
corp.roblox.com - Technical
Daytona - Secure Infrastructure for Running AI-Generated Code
daytona.io - Technical
GitHub - getzep/graphiti: Build Real-Time Knowledge Graphs for AI Agents · GitHub
github.com - Technical
Min Choi on X: \"@sama gpt-oss-20b running on 16GB GPU? 🤔\" / X
x.com - Data
openai/gpt-oss-120b · Hugging Face
huggingface.co - Technical
GPU graph rendering test - YouTube
youtube.com - Technical
LLM Evals: Everything You Need to Know – Hamel’s Blog - Hamel Husain
hamel.dev - Technical
Speeding Up the Webcola Graph Viz Library with Rust + WebAssembly - Casey Primozic's Homepage
cprimozic.net - Technical
Sid on X: \"Working on a side project with Claude called Granter that takes in info about your org, scans through avai...
x.com - Technical
GitHub - chiphuyen/sniffly: Claude Code dashboard with usage stats, error analysis, and sharable feature · GitHub
github.com - Technical
User Embeddings: How TikTok Knows You Better Than You Do - YouTube
youtube.com - Technical
Satya Nadella on X: \"Today we’re releasing GitHub Spark — a new tool in Copilot that turns your ideas into full-stack...
x.com - Data
Any_to_Any_RAG.ipynb · merve/smol-vision at main
huggingface.co - Data
Qwen/Qwen2.5-Omni-7B · Hugging Face
huggingface.co - Technical
Cloudflare launches pay-per-crawl, a game changer for SaaS and content creators | Greg Isenberg posted on the topic |...
linkedin.com - Technical
Ever wonder how platforms like TikTok and YouTube decide what your teen sees? We are developing a tool called Algorit...
linkedin.com - Technical
Why LinkedIn News Feed Is Showing Old Posts - Business Insider
businessinsider.com - Technical
The Big LLM Architecture Comparison
magazine.sebastianraschka.com - Data
Open ASR Leaderboard - a Hugging Face Space by hf-audio
huggingface.co - Technical
secemp on X: \"I couldn't believe whisper was SOTA and then found out there is actually a better model from nvidia (WE...
x.com - Technical
Indian Tech & Infra on X: \"🚨 Perplexity Pro is offering 12 months FREE exclusively to Airtel users in India. https://...
x.com - Technical
Amazon S3 Vectors
aws.amazon.com - Technical
New Health AI Models for Developers: MedGemma 27B and MedSigLIP | Google Research posted on the topic | LinkedIn
linkedin.com - News
Researchers Jailbreak AI by Flooding It With Bullshit Jargon
404media.co - Technical
Beautiful themes for shadcn/ui — tweakcn | Theme Editor & Generator
tweakcn.com - Technical
Me: I want an LLM server for 500 users TikTok: You have an LLM server for 500'000 users at home AIBrix, courtesy of T...
linkedin.com - Technical
Lloom: LLM-based concept induction on political-social media content
stanfordhci.github.io - Technical
Discover Web apps | Mobbin
mobbin.com - Technical
DeepSeek-R1-0528: How to Run Locally | Unsloth Documentation
docs.unsloth.ai - Technical
No credit card or crazy infra needed anymore. 🦥 Just Unsloth and a Colab notebook with a T4 GPU. Fine-tuning massive,...
linkedin.com - Technical
How I built a LinkedIn agent to find profiles fast | Abhijay Vuyyuru posted on the topic | LinkedIn
linkedin.com - Technical
GitHub - stanford-oval/storm: An LLM-powered knowledge curation system that researches a topic and generates a full-l...
github.com - Technical
WhatsApp AI Chatbot to give instant, accurate answers 24/7
joyz.ai - Technical
LangChain overview - Docs by LangChain
python.langchain.com - Technical
Joint Retrieval and Recommendation Modeling - by Janu Verma
januverma.substack.com - Technical
Exa | Web Search API, AI Search Engine, & Website Crawler
exa.ai - Technical
User Guide
gerrit.wikimedia.org - Technical
How Israel uses Google Ads in its information offensive against Iran
indicator.media - Technical
ChatPDF AI | Chat with any PDF | Free
chatpdf.com - Technical
GitHub - HumanSignal/awesome-data-labeling: A curated list of awesome data labeling tools · GitHub
github.com - Technical
GitHub - AykutSarac/github-rater: 📊 Check your GitHub rating, view results and enhance your profile quality. · GitHub
github.com - Technical
Seedance 1.0 Lite | Text to Video | fal.ai
fal.ai - Technical
ChatGPT convinced 3 people to do ketamine, fall in love with it and pushed them to domestic violence. Silicon can now...
linkedin.com - Technical
Hugging Face drops support for Google's AI frameworks | Gaurav Jain posted on the topic | LinkedIn
linkedin.com - Technical
jason on X: \"3M downloads per month 11k stars 0 money raised 1.4M top line revenue for https://t.co/YrBEtDDpo9 thank...
x.com - Technical
How a Danish News Service Made a Profit with its Transcription Tool | by Clare Spencer | Generative AI in the Newsroom
generative-ai-newsroom.com - Technical
Spinach Wikidata
spinach.genie.stanford.edu - Technical
GitHub - stanford-oval/spinach: SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions · G...
github.com - Technical
DeerFlow
deerflow.tech - Technical
Josh Miller on X: \"You can also get unique context into AI chat from within tabs too. ex: Highlight text on a tab to...
twitter.com - Technical
GitHub - triton-inference-server/server: The Triton Inference Server provides an optimized cloud and edge inferencing...
github.com - Technical
Phil Eaton on X: \"Great paper on how Google runs a commercial research lab, courtesy of Ankush Menat. Would love to r...
twitter.com - Technical
Philrox on X: \"As promised, here's the update on my AI powered SEO keyword research workflow from The vibe Marketer I...
twitter.com - Technical
Denislav Jeliazkov on X: \"I design apps for a living. I've spent 6+ years using every FinTech app on the market. Here...
twitter.com - Technical
Charly Wargnier on X: \"This is crazy. Nate Herkelman turned @n8n_io into a full marketing team! 🤯 His AI agent: ↳ gen...
twitter.com - Technical
AI just killed PowerPoint. 😱 No more endless hours creating PPT. Here are 10 websites to create presentations with AI...
linkedin.com - Technical
bilal on X: \"r/localllama anons huffman encoding model weights and inventing a new FP format DFloat11 so they can fit...
twitter.com - Technical
How I use AI playlist - YouTube
youtube.com - Technical
merve on X: \"Don't sleep on this! 🔥 @Meta dropped swiss army knives for vision with A2.0 license ❤️ > image/video...
twitter.com - Technical
GitHub - kortix-ai/suna: The Autonomous Company Operating System · GitHub
github.com - Technical
Visualizing PyTorch DTensor Sharding with JAX | Yi Wang posted on the topic | LinkedIn
linkedin.com - Technical
Pretty WILD - SoTA open source TTS model that beats ElevenLabs/ Sesame - Dia 1.6B - Apache 2.0 licensed! 🔥 > Ultra re...
linkedin.com - Technical
GitHub - birobirobiro/awesome-shadcn-ui: A curated list of awesome things related to shadcn/ui. · GitHub
github.com - Technical
Foundation Model for Personalized RecommendationBy Ko-Jen Hsiao, Yesu Feng and Sudarshan Lamkhede
netflixtechblog.com - Technical
PikTop
pik.top - Technical
Pravda Dashboard — Tracking Russia's Pravda Network
solatrix.github.io - Technical
Pravda in numbers - Content and Network analysis | Amaury L.
linkedin.com - Technical
Goodfire raises $50M Series A for AI understanding | Eric Ho posted on the topic | LinkedIn
linkedin.com - Technical
Potato | Accelerate Scientific Execution
readysetpotato.com - Technical
When building LLM-based applications that use RAG (Retrieval-Augmented Generation), splitting documents into small *c...
linkedin.com - Technical
GitHub - browserbase/stagehand: The SDK For Browser Agents · GitHub
github.com - Technical
The agents are coming and we can't catch them.
alexmreinhart.substack.com - Technical
Skilled Coder on X: \"Backend System Design for Rate Limiter (This is a high-level overview to help you understand how...
twitter.com - Technical
Convert FastAPI to MCP server with FastAPI-MCP | Akshay Pachaar posted on the topic | LinkedIn
linkedin.com - Technical
Build an equity research agent with LlamaCloud and o3 | LlamaIndex posted on the topic | LinkedIn
linkedin.com - Technical
kwindla on X: \"We wrote down everything we've learned building voice AI agents over the past two years. Core technolo...
twitter.com - Technical
Clustering Documents with OpenAI embeddings, HDBSCAN and UMAP – Dylan Castillo
dylancastillo.co - Technical
Manifold
manifold.markets - Technical
DSPy | 🦜️🔗 LangChainDSPy is a fantastic framework for LLMs that introduces an automatic compiler that teaches LMs how...
python.langchain.com - Data
Prompt Engineering Whitepaper (Kaggle / Google)
kaggle.com - Technical
Google Transparency Report | Zoe Darmé
linkedin.com - Technical
Compare Virtual Private Servers (VPS) by Price & Features | servers.fyi
servers.fyi - Technical
Note ranking algorithm
communitynotes.x.com - Technical
Social Media Behaviour
upb-ss1.github.io - Technical
Raindrop | AI Agent Monitoring & Observability
dawnai.com - Technical
Why MCP Won - Latent.Space
latent.space - Technical
TAO: Using test-time compute to train efficient LLMs without labeled data | Databricks Blog
databricks.com - Technical
Reve Image - AI Image Generator and Creative Tool
preview.reve.art - Technical
Perplexity
perplexity.ai - Technical
Releases · aria2/aria2 · GitHub
github.com - Technical
🚀 Big news for anyone building AI agents - we’ve built the fastest way to deploy AI Agents! In just seconds, you can...
linkedin.com - Technical
Briefer (YC S23) is launching its AI analyst—an intelligent agent that helps anyone on your team turn data into clear...
linkedin.com - Technical
John Horton on X: \"Gave a short, impromptu talk on working / Claude Code & LLM code generation generally 1/ https...
twitter.com - Technical
What is the Model Context Protocol (MCP)? - Model Context Protocol
modelcontextprotocol.io - Technical
AI Workflow Automation Platform - n8n
n8n.io - Technical
Independent Podcast & Audio Ad Measurement with Pixel-Based Attribution, Incrementality Testing & Cross-Channel Insights
podscribe.com - Technical
Gemini Embedding: Generalizable Embeddings from Gemini â Google DeepMind
deepmind.google - Technical
Choose Boring Technology
boringtechnology.club - Technical
Here’s how I use LLMs to help me write code
simonwillison.net - Technical
LLM: A CLI utility and Python library for interacting with Large Language Models
llm.datasette.io - Technical
Daryl Anselmo (@darylanselmo) • Threads, Say more
threads.net - Technical
Symbolic.ai - Powering Publishing with AI
symbolic.ai - Technical
Podscribe
app.podscribe.ai - Technical
LangSmith
smith.langchain.com - Technical
Langflow | Low-code AI builder for agentic and RAG applications
langflow.org - Technical
EarthKit Agent - Google Slides
docs.google.com - Technical
Why archive.org can't prove the authenticity of their snapshots - Jett's blog
blog.jettchen.me - Technical
Everyone knows your location
timsh.org - Technical
Emergency Communication Resources - AlertMedia
pyrratech.com - Technical
Mistral OCR is nice and fast but other models outperform it on document processing. We did a comprehensive benchmark...
linkedin.com - Technical
Mistral OCR | Mistral AI
mistral.ai - Technical
Gentelella Admin Theme (Colorlib) - DJ Unicode hackathon inspiration
colorlib.com - Technical
Wiz Research Uncovers Exposed DeepSeek Database Leaking Sensitive Information, Including Chat History | Wiz Blog
wiz.io - Technical
How to Backdoor Large Language Models - by Shrivu Shankar
blog.sshh.io - Technical
Disrupting malicious uses of AI by state-affiliated threat actors | OpenAI
openai.com - Technical
Countering Cognitive Warfare in the Digital Age – Information Professionals Association
information-professionals.org - Technical
How to leverage chat and support logs for RAG | Max Buckley posted on the topic | LinkedIn
linkedin.com - Technical
The public domain, digital commons, and digital public goods (DPGs): How Wikimedia projects advance a positive vision...
diff.wikimedia.org - Technical
Notion
mimansajaiswal-embedded-dbs.notion.site - Technical
Evolving Outline to Power Our Providers | by Jigsaw | Jigsaw | Medium
medium.com - Technical
The 2025 AI Agent Index
aiagentindex.mit.edu - Technical
Welcome to LM Studio Docs! | LM Studio
lmstudio.ai - Technical
O2 unveils Daisy, the AI granny wasting scammers’ time - Virgin Media O2
news.virginmediao2.co.uk - Technical
[RFC] LLM APIs for Ray Data and Ray Serve · Issue #50639 · ray-project/ray · GitHub
github.com - Technical
GitHub - docling-project/docling: Get your documents ready for gen AI · GitHub
github.com - Technical
Transformer²: Self-Adaptive LLMs
sakana.ai - Technical
Pandas vs. FireDucks Performance Comparison
dailydoseofds.com - Technical
Dan McAteer on X: \"this is an amazing way to think about prompting o1 from @benhylak https://t.co/byVj8wHmUT\" / X
twitter.com - Technical
@jagolinzer.bsky.social on Bluesky
bsky.app - Technical
Countering China’s Information Manipulation: A Toolkit for Understa
iri.org - Technical
Anton Osika on X: \"we launched publicly 8 days ago, hit $1M ARR today, and only took down one cloud provider along th...
twitter.com - Technical
China's AI models surpass global counterparts in diversity and diffusion | Stanford Institute for Human-Centered Arti...
linkedin.com - Technical
Using the DSA to Study Platforms
verfassungsblog.de - Technical
DSpace
openyls.law.yale.edu - Technical
Unpacking deceptive design
publicpolicy.google - Technical
PCIO Platform Interventions Codebook - Google Docs
docs.google.com - Technical
disinfodex.org
disinfodex.org - Technical
AI's Power Requirements Under Exponential Growth: Extrapolating AI Data Center Power Demand and Assessing Its Potenti...
rand.org - Research
“Community Guidelines Make this the Best Party on the Internet”: An In-Depth Study of Online Platforms’ Content Moder...
arxiv.org - Research
Friction-In-Design Regulation as 21st Century Time, Place, and Manner Restriction
papers.ssrn.com - Research
How can we combat online misinformation? A systematic overview of current interventions and their efficacy
papers.ssrn.com - Technical
Policy Implications of DeepSeek AI’s Talent Base | Stanford HAI
hai.stanford.edu - Technical
The UAE’s Trump-Era AI Strategy | Lawfare
lawfaremedia.org - Technical
list of policy newsletters and sources to mine using LLMs for improving our newsletter!list of policy newsletters and...
media.licdn.com - Technical
CAIDP Update 7.11 - AI Policy News (March 24, 2025) | Center for AI and Digital Policy
linkedin.com - Technical
We Need an Interventionist Mindset | TechPolicy.Press
techpolicy.press - Technical
About | Civil Rights Table
civilrightstable.org - Technical
Reports & Documents | WaTech
watech.wa.gov - Technical
policy framework for AI4Science
static.googleusercontent.com - Technical
from UC Berkeley
cltc.berkeley.edu - Technical
Artificial Analysis State of AI: China | Artificial Analysis
linkedin.com - Technical
International AI Safety Report | Jonas Freund
linkedin.com - Technical
youtube just announced they may do away with fact-checking
files.maldita.es - Technical
What does the public think about AI? | Harry Law | 11 comments
linkedin.com - Research
https://avikrishna.substack.com/p/eliciting-frontier-model-character?selection=2ddd1e4b-84e7-4cea-bfd3-41f1dc13f9ea&u...
open.substack.com - Research
Qalb: Largest State-of-the-Art Urdu Large Language Model for 230M Speakers with Systematic Continued Pre-training | T...
taimoor.xyz - Research
Nearly a third of social-media research has undisclosed ties to industry, preprint claims
science.org - Research
Training AI Co-Scientists Using Rubric Rewards | alphaXiv
alphaxiv.org - Research
Artificial intelligence tools expand scientists’ impact but contract science’s focus | Nature
nature.com - Research
Research into how narratives spread across social media platforms with some case studies f
scholarspace.manoa.hawaii.edu - Research
fact checking reduces engagement with false information
papers.ssrn.com - Research
Do reasoning models have real “Aha!” moments—mid-chain realizations where they intrinsically self-correct? In a new p...
linkedin.com - Research
[2510.24810] COMMUNITYNOTES: A Dataset for Exploring the Helpfulness of Fact-Checking Explanations
arxiv.org - Research
God of Prompt on X: \"R.I.P few-shot prompting. Meta AI researchers discovered a technique that makes LLMs 94% more ac...
x.com - Research
new deepseek paper on introducing geometric constraints when training, for less instabilit
arxiv.org - Research
alex zhang on X: \"Much like the switch in 2025 from language models to reasoning models, we think 2026 will be all ab...
x.com - Research
How LLMs result in increased rich elements and targeting by newsroomsHow LLMs result in increased rich elements and t...
papers.ssrn.com - Research
NATO releases research/report into cognitive warfare
sto.nato.int - Research
AI use in American newspapers is widespread | Peter Slattery, PhD | 11 comments
linkedin.com - Research
Shizhe Diao on X: \"✨Introducing ProfBench. LLM eval shouldn’t be limited to math/code/short QA. Real work is: read pr...
x.com - Research
Ethan Mollick on X: \"AI can help explain complex topics easily by throwing together a simulation. As Eric says later...
x.com - Research
Alex Prompter on X: \"This paper from Harvard and MIT quietly answers the most important AI question nobody benchmarks...
x.com - Research
Crémieux on X: \"The Science paper: https://t.co/km05CPqfcX (now viral online, unfortunately!) A welcome correction th...
x.com - Research
Recent discoveries on the acquisition of the highest levels of human performance
science.org - Research
apparently a super important theory ML paper by lenka zdeborova and crewapparently a super important theory ML paper...
arxiv.org - Research
Locke Cai on X: \"RL for reasoning often rely on verifiers — great for math, but tricky for creative writing or open-e...
x.com - Research
AI Chatbot’s are getting more relationship seeking but not more useful
arxiv.org - Research
https://andreyfradkin.com/assets/LLM_Demand_12_12_2025.pdf
andreyfradkin.com - Research
Andrey Fradkin on X: \"How much does intelligence cost? How concentrated is the AI market and is it winner take all? W...
x.com - Research
The Tip of the Iceberg: How the Social Media Production-Consumption Gap Distorts Public Opinion for Citizens and Researchers
osf.io - Research
Smartphones and Social Media Fuel Polarization Since 2008 | Jay Van Bavel, PhD posted on the topic | LinkedIn
linkedin.com - Research
ChatGPT does not replicate human moral judgments: the importance of examining metrics beyond correlation to assess ag...
nature.com - Research
Short-form video platforms drive mobile usage
osf.io - Research
[2508.08596] How Conversational Structure and Style Shape Online Community Experiences
arxiv.org - Research
using personas doesn't make an AI better at a task
papers.ssrn.com - Research
View of Searching for Elected Officials: Google’s Prioritization of Political Information
journalqd.org - Research
Rethinking news framing with large language models | Scientific Reports
nature.com - Data
Paper page - From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence
huggingface.co - Research
Reranking partisan animosity in algorithmic social media feeds alters affective polarization
science.org - Research
[2510.15951] Attention to Non-Adopters
arxiv.org - Research
New technical report on mixture of experts style models
storage.googleapis.com - Research
will brown on X: \"@dejavucoder plenty about it in the paper :) https://t.co/32O2NccA3D https://t.co/GnhAY4cJwu\" / X
x.com - Research
a really important paper to understand model trends
dataprovenance.org - Research
How Instacart is using LLMs for better e-commerce search | Yuanzheng (Ron) Zhu posted on the topic | LinkedIn
linkedin.com - Research
Understanding the impact of misinformation on adolescents | Nature Human Behaviour
nature.com - Research
Randall Balestriero on X: \"LeJEPA: a novel pretraining paradigm free of the (many) heuristics we relied on (stop-grad...
x.com - Research
[2510.08831] Everyone prefers human writers, including AI
arxiv.org - Research
Language models cannot reliably distinguish belief from knowledge and fact | Nature Machine Intelligence
nature.com - Research
LLMs destroy signals in marketplaces that separate high skill workers from low skill worke
jesse-silbert.github.io - Research
[2502.09992] Large Language Diffusion Models
arxiv.org - Research
\"Learn Text-to-SQL: Resources for Practitioners\" | Aman Chadha posted on the topic | LinkedIn
linkedin.com - Research
\"US-China scientific leadership shift: China's rise in global science\" | James Evans posted on the topic | LinkedIn
linkedin.com - Research
Smartphone use in adults and linked mental health and wellbeing concernsSmartphone use in adults and linked mental he...
pnas.org - Research
Patterns of news sharing and engagement across seven different social platforms
pnas.org - Research
[2509.00446] NEWSAGENT: Benchmarking Multimodal Agents as Journalists with Real-World Newswriting Tasks
arxiv.org - Research
LinkedIn
lnkd.in - Research
[2510.20171] Collective Communication for 100k+ GPUs
arxiv.org - Research
[2510.15053] The Physics of News, Rumors, and Opinions
arxiv.org - Research
[2508.18541] Uncovering Intervention Opportunities for Suicide Prevention with Language Model Assistants
arxiv.org - Research
Are these the happiest PhD students in the world?
nature.com - Research
What makes PhD students happy? Good supervision
nature.com - Research
Research into the value of online data to platforms
pubsonline.informs.org - Research
Imaging Time-Series to Improve Classification and Imputation
arxiv.org - Research
elvis on X: \"People are sleeping on Deep Agents. Start using them now. This is a fun paper showcasing how to put toge...
x.com - Research
Discourse Graphs | A Tool for Collaborative Knowledge Synthesis
discoursegraphs.com - Research
a massive community notes dataset
arxiv.org - Research
[2508.06445] Echoes of Automation: The Increasing Use of LLMs in Newsmaking
arxiv.org - Research
[2510.12323] RAG-Anything: All-in-One RAG Framework
share.google - Research
[2510.09263] SynthID-Image: Image watermarking at internet scale
arxiv.org - Research
the economics and geography of data centers
pubsonline.informs.org - Research
Aparna Dhinakaran on X: \"We improved @cline, a popular open-source coding agent, by +15% accuracy on SWE-Bench — with...
x.com - Research
Ideological fragmentation of the social media ecosystem: From echo chambers to echo platforms
academic.oup.com - Research
Simulating Social Networks with Hybrid Methodology | Lynnette Ng posted on the topic | LinkedIn
linkedin.com - Research
Less is More: Recursive Reasoning with Tiny Networks | alphaXiv
alphaxiv.org - Research
𝚐𝔪𝟾𝚡𝚡𝟾 on X: \"MODEL: https://t.co/hgHmWfu9b1 RELEASE: https://t.co/i0a5UL8r5C\" / X
x.com - Research
Rohan Paul on X: \"This paper introduces a new method called Agentic Context Engineering (ACE). It helps language mode...
x.com - Research
Rohan Paul on X: \"A 7B model, tuned for forms and docs, beats giant models at pulling structured data. Beats GPT-4.1...
x.com - Research
Rohan Paul on X: \"A beautiful paper from MIT+Harvard+ @GoogleDeepMind 👏 Explains why Transformers miss multi digit mu...
x.com - Research
Excited to share our latest paper from Meta Superintelligence Lab examining the factors that drive reasoning performa...
linkedin.com - Research
Sycophantic AI increases attitude extremity and overconfidence
osf.io - Research
synthesizing comments with LLMs to aid community notes
arxiv.org - Research
The complexity of misinformation extends beyond virus and warfare analogies | npj Complexity
nature.com - Research
Pricing | Pangram Labs
pangram.com - Research
[2402.14873] Technical Report on the Pangram AI-Generated Text Classifier
arxiv.org - Research
Rohan Paul on X: \"BIG claim. Giving an LLM just 78 carefully chosen, full workflow examples makes it perform better a...
x.com - Research
LoRA Without Regret - Thinking Machines Lab
thinkingmachines.ai - Research
Sarah Cen on X: \"We ran a longitudinal study of LLMs during the 2024 US election 🗳️ We queried 12 models on a survey...
x.com - Data
openai/gdpval · Datasets at Hugging Face
huggingface.co - Research
elvis on X: \"Federation of Agents This is a neat concept to convert static multi-agent coordination into dynamic capa...
x.com - Research
Ivan Zhou on X: \"Automated prompt optimization (GEPA) can push open-source models beyond frontier performance on ente...
x.com - Research
Gabriele Berton on X: \"[paper release!] Did you know that you can - speed up any LLM by 4x - and reduce its memory fo...
x.com - Research
Community Notes help reduce the virality of false information on X, study finds – UW News
washington.edu - Research
Current Real-World Use of Large Language Models for Mental Health
osf.io - Research
What remains after LLMs: technical knowledge moves from hubs to niches
papers.ssrn.com - Research
Jackson Atkins on X: \"MIT and Microsoft just made AI 64x better at planning, achieving 94% accuracy. 💥 Their PDDL-INS...
x.com - Research
[2502.16487] All That Glitters is Not Novel: Plagiarism in AI Generated Research
arxiv.org - Research
Chao Huang on X: \"Our team's AI-Researcher has been accepted by NeurIPS 2025 and selected as a Spotlight! 🌟 The proje...
x.com - Research
X (link)
x.com - Research
Rohan Paul on X: \"LLM for financial trading/decision making. A 4B model financial-domain model, Trading-R1, that writ...
x.com - Research
ingroup positivity drives engagement during crisis events
pnas.org - Research
The Digital Ethnography Collective Reading List - Google Docs
docs.google.com - Research
should ai nudge you? how people pay attention to AI signals
papers.ssrn.com - Research
communicating uncertainty can increase AI adoption
papers.ssrn.com - Research
how to improve knowledge accumulation in the social sciences
federicaizzo.com - Research
How to destroy your reputation??? By transparently disclosing your usage of AI… Whereas shaky studies without peer re...
linkedin.com - Research
[2509.11391v1] \"My Boyfriend is AI\": A Computational Analysis of Human-AI Companionship in Reddit's AI Community
arxiv.org - Research
Brian Armstrong on X: \"x402 + @Google just unlocked a new level for AI agents. Agents can actually pay each other now...
x.com - Research
DeepDive: Advancing Long-Horizon Search Agents with Knowledge Graphs and Multi-Turn Reinforcement Learning
arxiv.org - Research
the cost of selling tiktok on the ads market
pnas.org - Research
How People Use ChatGPT | NBER
nber.org - Research
[2506.11727] Forgetful by Design? A Critical Audit of YouTube's Search API for Academic Research
arxiv.org - Research
Turing Post on X: \"One of the most comprehensive Surveys of Reinforcement Learning for LRMs Covers: - LLMs ➝ LRMs via...
x.com - Research
Misha Teplitskiy | Science of Science on X: \"One of the craziest soc sci papers of all time: an email nudge generated...
x.com - Research
Arvindh Arun on X: \"Why does horizon length grow exponentially as shown in the METR plot? Our new paper investigates...
x.com - Research
the impact of LLM Adoption on online user behavior
papers.ssrn.com - Research
interesting study to see the effects of criticism and pushback against public health narra
pnas.org - Research
REAL Evals - Realistic Evaluations for Agents Leaderboard
realevals.xyz - Data
Paper page - NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings
huggingface.co - Research
Ravid Shwartz Ziv on X: \"The new OpenAI paper “Why Language Models Hallucinate” is more like PR than research. The cl...
x.com - Research
LLM hallucinations are compression artefacts, not bugs. We can predict them with EDFL. | Leon Chlon, PhD posted on th...
linkedin.com - Research
Why language models hallucinate | OpenAI
openai.com - Research
Domenico Ferraro on X: \"As more data come in, the contractionary impact of tariffs is becoming increasingly clear. Ou...
x.com - Research
[2507.10599] Emergence of Hierarchical Emotion Organization in Large Language Models
arxiv.org - Research
How can you more effectively talk with your hands? (Research just conditionally accepted at JMR) People often move th...
linkedin.com - Research
[2508.08285] The Illusion of Progress: Re-evaluating Hallucination Detection in LLMs
arxiv.org - Research
Social opinions prediction utilizes fusing dynamics equation with LLM-based agents | Scientific Reports
nature.com - Research
How To Become A Mechanistic Interpretability Researcher — AI Alignment Forum
alignmentforum.org - Research
[2507.00926] HyperFusion: Hierarchical Multimodal Ensemble Learning for Social Media Popularity Prediction
arxiv.org - Research
criticism of using AI simulations to infer causation in social network settingscriticism of using AI simulations to i...
science.org - Research
Another paper about the effects of GenAI on reduction in hiring early stage folksAnother paper about the effects of G...
papers.ssrn.com - Research
This new DeepMind research shows just how broken vector search is. Turns out some docs in your index are theoreticall...
linkedin.com - Research
Canaries in the Coal Mine? Six Facts about the Recent Employment Effects of Artificial Intelligence - Stanford Digita...
digitaleconomy.stanford.edu - Research
[2504.13279] Just Another Hour on TikTok: ID sampling to obtain a complete slice of TikTok
arxiv.org - Research
Detecting Child Objectification on Social Media: Challenges in Language Modeling - ACL Anthology
aclanthology.org - Research
ACM · 3706598.3713362
dl.acm.org - Research
Muyu He on X: \"Our EMNLP main paper presents a fun but very challenging benchmark for LLMs: to solve over 300+ Ace At...
x.com - Research
A Novel Multi-Document Retrieval Benchmark: Journalist Source-Selection in Newswriting - ACL Anthology
aclanthology.org - Research
🚨🚨New paper, now out in PNAS! We know that outrage and negativity go viral online, but is this *always* the case? No:...
linkedin.com - Research
Towards Interactive Evaluations for Interaction Harms in Human-AI Systems | Knight First Amendment Institute
knightcolumbia.org - Research
[2507.19373] Changes to the Facebook Algorithm Decreased News Visibility Between 2021-2024
arxiv.org - Research
racial discrimination in the follow-back rates to phd student twitter accounts by academic
docs.iza.org - Research
Proceedings of the ICWSM Workshops
workshop-proceedings.icwsm.org - Research
India's Cash Transfer Experiment: Boosting Nutrition and Development | Karthik Muralidharan posted on the topic | Lin...
linkedin.com - Research
[2504.18041] RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models
arxiv.org - Research
Jiayuan Zhu on X: \"🎉Happy to share that our paper \"Ask Patients with Patience (APP): Enabling LLMs for Human-Centric...
x.com - Research
[2505.16023] Prototypical Human-AI Collaboration Behaviors from LLM-Assisted Writing in the Wild
arxiv.org - Research
Data is infrastructure
degruyterbrill.com - Research
[2311.09730] Sociodemographic Prompting is Not Yet an Effective Approach for Simulating Subjective Judgments with LLMs
arxiv.org - Research
VerbaAI
verbaai.org - Research
[2508.15763] Intern-S1: A Scientific Multimodal Foundation Model
arxiv.org - Research
[2506.08292] From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium
arxiv.org - Research
Arnav Arora on X: \"🚨New pre-print 🚨 News often conveys different things in text vs. image. Recent work in comp. frami...
x.com - Research
What we can learn from TikTok through its Research API
gdfm.me - Research
Jiashuo Liu on X: \"We built FutureX, the world’s first live benchmark for real future prediction — politics, economy,...
x.com - Research
I've shared this earlier but once upon a time I built a reddit content classifier based on
ora.ox.ac.uk - Research
[2508.09809] A Comprehensive Review of Datasets for Clinical Mental Health AI Systems
arxiv.org - Research
ml model \"genetics\" and a \"family tree\"
arxiv.org - Research
[2503.17684] Can LLMs Automate Fact-Checking Article Writing?
arxiv.org - Research
👋 Jan on X: \"Introducing Jan-v1: 4B model for web search, an open-source alternative to Perplexity Pro. In our evals,...
x.com - Research
[2506.06299] How malicious AI swarms can threaten democracy: The fusion of agentic AI and LLMs marks a new frontier i...
arxiv.org - Research
[2507.02197] Do Role-Playing Agents Practice What They Preach? Belief-Behavior Consistency in LLM-Based Simulations o...
arxiv.org - Research
liberals and conservatives share information differently on social medialiberals and conservatives share information...
academic.oup.com - Research
An Vo on X: \"🚨 Our latest work shows that SOTA VLMs (o3, o4-mini, Sonnet, Gemini Pro) fail at counting legs due to bi...
x.com - Research
elvis on X: \"Tool-Augmented Unified Retrieval Agent for AI Search Nice paper showing how to effectively extend RAG to...
x.com - Research
Beyond Binary Rewards: RL for Calibrated LMs
rl-calibration.github.io - Research
estimation of emotion in the sharing of content on social media platformsestimation of emotion in the sharing of cont...
psycnet.apa.org - Research
[2505.20201] Reasoning Is Not All You Need: Examining LLMs for Multi-Turn Mental Health Conversations
arxiv.org - Research
Brendan Jowett on X: \"AI agents are taking off. But we may be building them the wrong way. A new paper from NVIDIA ar...
x.com - Research
Labeled Dataset for sensitive topics (conflictual language, profanity, sexually explicit m
arxiv.org - Research
[2506.06347] Unified Game Moderation: Soft-Prompting and LLM-Assisted Label Transfer for Resource-Efficient Toxicity...
arxiv.org - Research
[2312.12651] Toxic Bias: Perspective API Misreads German as More Toxic
arxiv.org - Research
[2507.17636] Who Attacks, and Why? Using LLMs to Identify Negative Campaigning in 18M Tweets across 19 Countries
arxiv.org - Research
case studies for arbiter to be a social sensemaking tool
journals.sagepub.com - Research
[2507.21206] Agentic Web: Weaving the Next Web with AI Agents
arxiv.org - Research
[2507.06268v1] A Collectivist, Economic Perspective on AI
arxiv.org - Research
Jackson Atkins on X: \"LLMs can now self-optimize. A new method allows an AI to rewrite its own prompts to achieve up...
x.com - Research
[2506.21734] Hierarchical Reasoning Model
arxiv.org - Research
Persona vectors: Monitoring and controlling character traits in language models \\ Anthropic
anthropic.com - Research
LinkedIn
lnkd.in - Research
LinkedIn
lnkd.in - Research
EuroCon: Benchmarking Parliament Deliberation for Political Consens
zowiezhang.github.io - Research
HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter - ACL Anthology
aclanthology.org - Research
[2507.16045] Chameleon Channels: Measuring YouTube Accounts Repurposed for Deception and Profit
arxiv.org - Research
A Social Dynamical System for Twitter Analysis
arxiv.org - Research
Huge congratulations to the brilliant minds behind this groundbreaking work which won ACL outstanding paper award! Ca...
linkedin.com - Research
Introducing GSPO: A New RL Algorithm for LLMs | Alex Shan posted on the topic | LinkedIn
linkedin.com - Research
what happens to academics post tenure, interesting work
pnas.org - Research
neat recent work on improving the generation of research reports by Googleneat recent work on improving the generatio...
arxiv.org - Research
[2407.12034] Understanding Transformers via N-gram Statistics
arxiv.org - Research
When Claims Evolve: Evaluating and Enhancing the Robustness of Embedding Models Against Misinformation Edits - ACL An...
aclanthology.org - Research
What Does Consulting Do? | NBER
nber.org - Research
AlphaGo Moment for Model Architecture Discovery | alphaXiv
alphaxiv.org - Research
Rohan Paul on X: \"The paper builds a small simulated economy with 100 language‑model “workers” and one language‑model...
x.com - Research
#research #causal #ml | Ciarán M. Gilligan-Lee
linkedin.com - Research
Michael R. Bock on X: \"1/ Can AI file your taxes? Not yet. We tested the latest frontier models and the results were...
x.com - Research
[2507.07931] Meek Models Shall Inherit the Earth
arxiv.org - Research
Feature-based reward learning shapes human social learning strategies | Nature Human Behaviour
nature.com - Research
Karan Singhal on X: \"📣 Excited to share our real-world study of an LLM clinical copilot, a collab between @OpenAI and...
x.com - Research
[2507.13919] The Levers of Political Persuasion with Conversational AI
arxiv.org - Research
[2505.11711] Reinforcement Learning Finetunes Small Subnetworks in Large Language Models
arxiv.org - Research
UniverseTBD on X: \"📢 New dataset out! We introduce HypoGen💥, a dataset of ~5.5K structured problem–hypothesis pairs (...
x.com - Research
[2410.02724] Large Language Models as Markov Chains
arxiv.org - Research
[2504.11169] MuSeD: A Multimodal Spanish Dataset for Sexism Detection in Social Media Videos
arxiv.org - Research
Attributing news creation to AI doesn’t really reduce the perceived value and ideological
journals.sagepub.com - Research
Zhu Jian-Qiao on X: \"Centaur may have learned a shortcut that explains away psychological tasks. @PsychBoyH Link to p...
x.com - Research
Alex Imas on X: \"🚨New paper (link in reply)🚨 Are we underestimating AI use in self-report surveys? YES, by as much as...
x.com - Research
Frontiers: Generative AI and Personalized Video Advertisements
pubsonline.informs.org - Research
more research about how using LLMs can harm learning among students at PNAS this timemore research about how using LL...
pnas.org - Research
New prosocial design interventions paper is out
doi.org - Research
JSTOR paper 2118400
jstor.org - Research
Sukjun (June) Hwang on X: \"Tokenization has been the final barrier to truly end-to-end language models. We developed...
x.com - Research
A large-scale replication of scenario-based experiments in psychology and management using large language models | Na...
nature.com - Research
the governance and behavioral challenges from personalizable AI
doi.org - Research
[2507.03041] Optimas: Optimizing Compound AI Systems with Globally Aligned Local Rewards
arxiv.org - Research
[2507.04545] Measuring Social Media Network Effects
arxiv.org - Research
Interesting work by Anthropic on self labeling by LLMs that we should read for the benchma
arxiv.org - Research
Marcel Binz on X: \"Excited to see our Centaur project out in @Nature. TL;DR: Centaur is a computational model that pr...
x.com - Research
[2403.03744] MedSafetyBench: Evaluating and Improving the Medical Safety of Large Language Models
arxiv.org - Research
View of Changes in YouTube's Content Moderation Policy Had Little Detectable Impact on Election Denial Content
ojs.aaai.org - Research
Beyond Semantics: Unreasonable Effectiveness of Reasonless Intermediate Tokens | Hacker News
news.ycombinator.com - Research
[2502.05967] $μ$nit Scaling: Simple and Scalable FP8 LLM Training
arxiv.org - Research
Tracing the thoughts of a large language model \\ Anthropic
anthropic.com - Research
The Youth Vote in 2024 | CIRCLE
circle.tufts.edu - Research
Chain-of-Thought Is Not Explainability | alphaXiv
alphaxiv.org - Research
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning | Nathan Lambert | 14 comments
linkedin.com - Research
[2506.21718] Performance Prediction for Large Systems via Text-to-Text Regression
arxiv.org - Research
interesting piece on why privacy regulation can solve the disinformation probleminteresting piece on why privacy regu...
papers.ssrn.com - Research
Clint Jarvis on X: \"Stanford paid 35,000 people to quit social media. This was the largest study on emotional health...
x.com - Research
[2506.17729] Efficient Difference-in-Differences and Event Study Estimators
arxiv.org - Research
[2506.18167] Understanding Reasoning in Thinking Language Models via Steering Vectors
arxiv.org - Research
Following news on social media boosts knowledge, belief accuracy and trust | Nature Human Behaviour
nature.com - Research
GitHub - google-deepmind/videoprism: Official repository for \"VideoPrism: A Foundational Visual Encoder for Video Und...
github.com - Research
conway on X: \"latest moondream model is actually beating gpt-4o in several cases I've tested https://t.co/L09T015fvv\"...
x.com - Research
Updesh
aikosh.indiaai.gov.in - Research
What is LLooM? | LLooM
stanfordhci.github.io - Research
Ryan Marten on X: \"Announcing OpenThinker3-7B, the new SOTA open-data 7B reasoning model: improving over DeepSeek-R1-...
x.com - Research
Modulate | Frontier voice AI company
modulate.ai - Research
[2506.07667] Silencing Empowerment, Allowing Bigotry: Auditing the Moderation of Hate Speech on Twitch
arxiv.org - Research
Brian Christian on X: \"CREDITS: This work was done with @hannahrosekirk, @tsonj, @summerfieldlab, and Tsvetomira Dumb...
x.com - Research
BART: A Standard Tool for Data Science | Richard Hahn posted on the topic | LinkedIn
linkedin.com - Research
[2506.12349] Information Suppression in Large Language Models: Auditing, Quantifying, and Characterizing Censorship i...
arxiv.org - Research
addressing bias in financial decisionmaking by LLMs through representation engineeringaddressing bias in financial de...
papers.ssrn.com - Research
How we built our multi-agent research system \\ Anthropic
anthropic.com - Research
[2506.14295] The Impact of Generative AI on Social Media: An Experimental Study
arxiv.org - News
The Impact of Generative AI on Critical Thinking: Self-Reported Reductions in Cognitive Effort and Confidence Effects...
microsoft.com - Research
[2506.08872] Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task
arxiv.org - Research
Primer: What We Know About Effective Misinformation Interventions
prosocialdesign.org - Research
worth reading this paper about \"how many days did you practice music\" as a treatment affec
rss.onlinelibrary.wiley.com - Research
GitHub - MCKnaus/dmlmt: Double Machine Learning for Multiple Treatments · GitHub
github.com - Research
OII | New study finds Republicans flagged for posting misleading tweets twice as often as Democrats on X/Twitter’s Co...
oii.ox.ac.uk - Research
Real People Don’t Use UTM Codes. UTM codes are a great way to track the… | by Felipe Hoffa | The Startup | Medium
medium.com - Research
Toward Informal Language Processing: Knowledge of Slang in Large Language Models
arxiv.org - Research
new programming benchmark showcasing that LLMs do really poorly on programming benchmarksnew programming benchmark sh...
arxiv.org - Research
This is wild. 🤯 Apple drops a paper saying AI \"reasoning\" is just fancy pattern-matching—models flop on stuff like To...
linkedin.com - Research
AIM2025
sites.google.com - Research
[2504.06435] Human Trust in AI Search: A Large-Scale Experiment
arxiv.org - Research
[2505.23802] MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
arxiv.org - Research
The “multiple exposure effect” (MEE): How multiple exposures to similarly biased online content can cause increasingl...
journals.plos.org - Research
Voyager: An Open-Ended Embodied Agent with Large Language Models
arxiv.org - Research
TabM: Advancing Tabular Deep Learning with Parameter-Efficient Ensembling
arxiv.org - Research
LLM Evaluation research (look at the references)
arxiv.org - Research
[2506.08945] Who is using AI to code? Global diffusion and impact of generative AI
arxiv.org - Research
The Directory for Liquid ContentA scalable and modular taxonomy designed to map, describe, and standardize how digita...
liquidcontent.xyz - Research
How to inoculate AI models against misinformation | Sander van der Linden posted on the topic | LinkedIn
linkedin.com - Research
Beyond Semantics: The Unreasonable Effectiveness of Reasonless Intermediate Tokens | Dagmar Monett | 121 comments
linkedin.com - Research
World of Labour - Machine learning for causal inference in economics
wol.iza.org - Research
[2504.02234] LLM Social Simulations Are a Promising Research Method
arxiv.org - Research
ai models for scientific reasoning and potentially discovery
storage.googleapis.com - Research
How Malicious AI Swarms Can Threaten Democracy
osf.io - Research
Nikhil Garg on X: \"Interesting new paper: https://t.co/jbDdacS6q1\" / X
x.com - Research
the future of machine learning will come from new age RL methods using environmental feedb
storage.googleapis.com - Research
Sahar Abdelnabi 🕊 on X: \"Hawthorne effect describes how study participants modify their behavior if they know they ar...
x.com - Research
[2505.14617] The Hawthorne Effect in Reasoning Models: Evaluating and Steering Test Awareness
arxiv.org - Research
Zochi Publishes A* Paper
intology.ai - Research
Anne Ouyang on X: \"✨ New blog post 👀: We have some very fast AI-generated kernels generated with a simple test-time o...
x.com - Research
@here why isn't a bigger deal being made about this research does anyone know?@here why isn't a bigger deal being mad...
arxiv.org - Research
ValuesML: A new Multilingual Dataset for Values Detection in News and Political Manifestos
osf.io - Research
Algorithms for reliable decision-making need causal reasoning | Nature Computational Science
nature.com - Research
Paper2Poster
paper2poster.github.io - Research
Academic Library | Indicator
indicator.media - Research
research into modeling the half-life of a tweet based on some select empirical data and co
pnas.org - Research
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures | alphaXiv
alphaxiv.org - Research
Static network structure cannot stabilize cooperation among large language model agents | PLOS One
journals.plos.org - Research
d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning | alphaXiv
alphaxiv.org - Research
[2505.13775] Beyond Semantics: The Unreasonable Effectiveness of Reasonless Intermediate Tokens
arxiv.org - Research
[2505.13995] ELEPHANT: Measuring and understanding social sycophancy in LLMs
arxiv.org - Research
Rishi Jha on X: \"I’m stoked to share our new paper: “Harnessing the Universal Geometry of Embeddings” with @jxmnop, C...
x.com - Research
Tanishq Mathew Abraham, Ph.D. on X: \"Model Merging in Pre-training of Large Language Models \"We present the Pre-train...
x.com - Research
[2505.12546] Extracting memorized pieces of (copyrighted) books from open-weight language models
arxiv.org - Research
Robert W Malone, MD on X: \"This new peer-reviewed study shows that living close to a golf course significantly increa...
twitter.com - Research
[2402.04607] Google Scholar is manipulatable
arxiv.org - Research
[2411.13187] Engagement-Driven Content Generation with Large Language Models
arxiv.org - Research
Very very very fast counting within a certain accuracy that powers a lot of industry infra
algo.inria.fr - Research
Mapping the Institutional Pipeline for Global AI Talent | NBER
nber.org - Research
[2503.16527] LLM Generated Persona is a Promise with a Catch
arxiv.org - Research
Twitter (link)
twitter.com - Research
AI in Software Engineering at Facebook | IEEE Journals & Magazine | IEEE Xplore
ieeexplore.ieee.org - Research
[2503.16586] Big Help or Big Brother? Auditing Tracking, Profiling, and Personalization in Generative AI Assistants
arxiv.org - Research
ModelSlant.com
modelslant.com - Research
MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control
arxiv.org - Research
Neel Nanda on X: \"After supervising 20+ papers, I have highly opinionated views on writing great ML papers. When I en...
twitter.com - Research
Coordinated link sharing on Facebook | Scientific Reports
doi.org - Research
[2504.17004] (Im)possibility of Automated Hallucination Detection in Large Language Models
arxiv.org - Research
Detecting Synthetic, Doubting Authentic: AI Attribution Bias for Political Imagery
osf.io - Research
The Effect of Deactivating Facebook and Instagram on Users’ Emotional State | NBER
nber.org - Research
[2504.20879] The Leaderboard Illusion
arxiv.org - Research
[2502.02943] Behavioral Homophily in Social Media via Inverse Reinforcement Learning: A Reddit Case Study
arxiv.org - Research
[2502.07266] When More is Less: Understanding Chain-of-Thought Length in LLMs
arxiv.org - Research
content labeling and community notes research
papers.ssrn.com - Research
Could an AI Agent Become One of Your Coworkers? | by MIT IDE | MIT Initiative on the Digital Economy | Medium
medium.com - Research
Propensity Score Matching: A Guide to Causal Inference | Built In
builtin.com - Research
multiple period of treatment in DiD approaches
sciencedirect.com - Research
#misinformation #research #politicalcommunication #datascience #digitalethics #eupolicy #digitalservicesact | Anton G...
linkedin.com - Research
[2504.13859] DoYouTrustAI: A Tool to Teach Students About AI Misinformation and Prompt Engineering
arxiv.org - Research
Collaborating with AI Agents | Bugge Holm Hansen | 21 comments
linkedin.com - Research
World of Labour - Does increasing the minimum wage reduce poverty in developing countries?
wol.iza.org - Research
Andrew Gordon Wilson on X: \"Really excited about our new paper! It derives a generalization bound that predictably ge...
twitter.com - Research
Mind the (Language) Gap: Mapping the Challenges of LLM Development in Low-Resource Language Contexts | Stanford HAI
hai.stanford.edu - Research
https://andreyfradkin.com/assets/demandforllm.pdf
andreyfradkin.com - Research
\"Mi Abogado: A Study on Foster Care and Legal Aid\" | Experimental posted on the topic | LinkedIn
linkedin.com - Research
Making AI-generated code more accurate in any language | MIT News | Massachusetts Institute of Technology
news.mit.edu - Research
elvis on X: \"AgentA/B is a fully automated A/B testing framework that replaces live human traffic with large-scale LL...
twitter.com - Research
[2504.10157] SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-Worl...
arxiv.org - Research
genAI spurs passive engagement not active ones
papers.ssrn.com - Research
9 Yee Whye Teh - YouTube
youtube.com - Research
[2503.24322] NoProp: Training Neural Networks without Full Back-propagation or Full Forward-propagation
arxiv.org - Research
Genglin Liu on X: \"Excited to share my first project at UCLA! We built MOSAIC — a social network simulator where LLM-...
twitter.com - Research
LLMs can label data better than humans it turns out — for specific use casesLLMs can label data better than humans it...
journals.sagepub.com - Research
Digital media – A threat to democracy? | Max-Planck-Gesellschaft
mpg.de - Research
[2406.04236] Understanding Information Storage and Transfer in Multi-modal Large Language Models
arxiv.org - Research
Invisible Labor: The Backbone of Open Source Software
arxiv.org - Research
Kiran Garimella on X: \"Community-based fact-checking effectiveness relies heavily on sourcing. This paper shows that...
twitter.com - Research
Alexander Doria on X: \"A contrarian result I like a lot: smaller language models perform better on knowledge graphs t...
twitter.com - Research
[2501.19393] s1: Simple test-time scaling
arxiv.org - Research
How AI outperforms humans in decision-making | Abel Sanchez posted on the topic | LinkedIn
linkedin.com - Research
Tanishq Mathew Abraham, Ph.D. on X: \"Perception Encoder: The best visual embeddings are not at the output of the netw...
twitter.com - Research
Jillian Fisher on X: \"How do biased AI models effect human decision-making? 🤔 Our latest paper, “Biased AI can Influe...
twitter.com - Research
Abeer Aldayel (@Aldayelabeer@sciencemastodon.com) on X: \"📢**Persuasion takes different modes!** Instead of just askin...
twitter.com - Research
Nick Byrd, Ph.D. on X: \"Is #socialMedia bad for society? In two countries, following a couple mainstream #news accoun...
twitter.com - Research
Social Media, Ethics, and Automation — Social Media, Ethics, and Automation
social-media-ethics-automation.github.io - Research
GitHub - yuxiaw/OpenFactCheck · GitHub
github.com - Research
re-sharing to confirm whether read this? truth social and news shari
tandfonline.com - Research
Bots of a Feather: Mixing Biases in LLMs’ Opinion Dynamics | Springer Nature Link
link.springer.com - Research
Navigating the uncertainty: the impact of a student-centered final year project allocation mechanism on student perfo...
nature.com - Research
Large Language Models: A Survey with Applications in Political Science
osf.io - Research
[2503.02080] Linear Representations of Political Perspective Emerge in Large Language Models
arxiv.org - Research
TAIS RFP: Research Areas | Coefficient Giving
openphilanthropy.org - Research
Sign In | alphaXiv
alphaxiv.org - Research
[2504.03767] MCP Safety Audit: LLMs with the Model Context Protocol Allow Major Security Exploits
arxiv.org - Research
TelegramScrap: A comprehensive tool for scraping Telegram data
arxiv.org - Research
Participants listen to AI advice
sciencedirect.com - Research
mdpi.com
mdpi.com - Research
Wild stuff >20sec videos… up to 1 min with a 5B model????
arxiv.org - Research
Now we have a misinformation test for people!
sciencedirect.com - Research
How OpenAI's GPT-4 generates images with Transfusion | Max Buckley posted on the topic | LinkedIn
linkedin.com - Research
Better Feeds: Algorithms That Put People First – Knight-Georgetown Institute
kgi.georgetown.edu - Research
[1908.08313] Auditing Radicalization Pathways on YouTube
arxiv.org - Research
Elicit: AI for scientific research
elicit.com - Research
Perplexity
perplexity.ai - Research
A global comparison of social media bot and human characteristics | Scientific Reports
nature.com - Research
very much worth reading the way they manually research influence operations as we are seek
jns.scholar.princeton.edu - Research
[2502.16280] Human Preferences in Large Language Model Latent Space: A Technical Analysis on the Reliability of Synth...
arxiv.org - Research
[2503.21934] Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad
arxiv.org - Research
How a PhD student’s lab size affects their chance of future academic success
nature.com - Research
Introducing Ai2 Paper Finder | Ai2
allenai.org - Research
foreign interference tactics specifically from a media and comms standpoint that would be
eeas.europa.eu - Research
dr. jack morris on X: \"# A new type of information theory this paper is not super well-known but has changed my opini...
twitter.com - Research
AgentRxiv
agentrxiv.github.io - Research
AI Agents, Digital Twins, and the New Way to Manage Operations
business.columbia.edu - Research
The Cybernetic Teammate - by Ethan Mollick
oneusefulthing.org - Research
Chatbots as social companions
academic.oup.com - Research
Our paper we can learn a lot about effective framing from what they’ve done in their workOur paper we can learn a lot...
dl.acm.org - Research
[2503.05336] Toward an Evaluation Science for Generative AI Systems
arxiv.org - Research
GitHub - internetarchive/newsum: Daily TV News Summary using GPT · GitHub
github.com - Research
Inductive reasoning in minds and machines - PubMedInduction-the ability to generalize from existing knowledge-is the...
pubmed.ncbi.nlm.nih.gov - Research
[2503.02886] Exploring Political Ads on News and Media Websites During the 2024 U.S. Elections
arxiv.org - Research
When Incentives Backfire, Data Stops Being Human
arxiv.org - Research
arXiv · 2501.11433
arxiv.org - Research
Culturally Yours | Understanding cultural references in text
mbzuai.ac.ae - Research
[2501.09102] Tracking the Takes and Trajectories of English-Language News Narratives across Trustworthy and Worrisome...
arxiv.org - Research
[2503.01532] Unmasking Implicit Bias: Evaluating Persona-Prompted LLM Responses in Power-Disparate Social Scenarios
arxiv.org - Research
Detecting misbehavior in frontier reasoning models | OpenAI
openai.com - Research
Decentralized Society: Finding Web3's Soul by Puja Ohlhaver, E. Glen Weyl, Vitalik Buterin
papers.ssrn.com - Research
Red teaming ChatGPT in medicine to yield real-world insights on model behavior | npj Digital Medicine
nature.com - Research
[2503.02250] AI Automatons: AI Systems Intended to Imitate Humans
arxiv.org - Research
Optimizing language models for human preferences should be viewed as a causal problemOptimizing language models for h...
arxiv.org - Research
UCSD SMS Analytics Research Project
sms-analytics.sysnet.ucsd.edu - Research
Having an advisor with a strong publication record + past students who have been successfu
nber.org - Research
fbarchive.org
fbarchive.org - Research
Why Economists Should Conduct Field Experiments and 14 Tips for Pulling One Off
ideas.repec.org - Research
[2410.23506] The Belief State Transformer
arxiv.org - Research
[2411.10109] LLM Agents Grounded in Self-Reports Enable General-Purpose Simulation of Individuals
arxiv.org - Research
well structured paper conveying how to write clearly
questromworld.bu.edu - Research
Data & Society — Data Voids
datasociety.net - Research
Chain of Agents: Large language models collaborating on long-context tasks
research.google - Research
[2502.11264] Strategic Wealth Accumulation Under Transformative AI Expectations
arxiv.org - Research
The Economist: Generative AI and inequality | Rem Koning posted on the topic | LinkedIn
linkedin.com - Research
[2502.14143] Multi-Agent Risks from Advanced AI
arxiv.org - Research
factors that cause people to believe in misinformation
pnas.org - Research
Breakdown of the foundations of GenAI, applications, and lessons to learn about governance
papers.ssrn.com - Research
Engagement-based algorithms disrupt human social norm learning
osf.io - Research
GitHub - om-ai-lab/VLM-R1: Solve Visual Understanding with Reinforced VLMs · GitHub
github.com - Research
FUTURE-AI: international consensus guideline for trustworthy and de
bmj.com - Research
Safer Internet Day 2025: Staying Ahead and keeping the ecosystem safe
blog.google - Research
arXiv · 2502.06807
arxiv.org - Research
[2412.17847] Bridging the Data Provenance Gap Across Text, Speech and Video
arxiv.org - Research
Osf (link)
osf.io - Research
tracker?hashtag=%23YCPAntham WhatsApp trend tracker by Princeton’s digital wellness labtracker?hashtag=%23YCPAntham W...
digitalwitnesslab.org - Research
[2502.00873] Language Models Use Trigonometry to Do Addition
arxiv.org - Research
[2501.18649] Fake News Detection After LLM Laundering: Measurement and Explanation
arxiv.org - Research
[2501.18438] o3-mini vs DeepSeek-R1: Which One is Safer?
arxiv.org - Research
Been Kim on X: \"@karpathy We taught some superhuman chess moves from AlphaZero to Grandmasters some time ago (https:/...
twitter.com - Research
#mbzuai #llm #llm360 #ai | MBZUAI (Mohamed bin Zayed University of Artificial Intelligence)
linkedin.com - Research
Jiayi Pan on X: \"We reproduced DeepSeek R1-Zero in the CountDown game, and it just works Through RL, the 3B base LM d...
twitter.com - Research
🤖 How do chatbots respond to political questions? Large language models (LLMs) are reshaping our information environm...
linkedin.com - Research
Junxian He on X: \"We replicated the DeepSeek-R1-Zero and DeepSeek-R1 training on 7B model with only 8K examples, the...
twitter.com - Research
A lot of TikTok influencers are shifting to xiaohongshu or RED the Chinese platform in an
tandfonline.com - Research
Introducing “DomainDemo: a dataset of domain-sharing activities among different demographic groups on Twitter.” Today...
linkedin.com - Research
4.5 Million (Suspected) Fake \\faStar Stars in GitHub: A Growing Spiral of Popularity Contests, Scams, and Malware
arxiv.org - Research
Artificial Societies
societies.io - Research
We made an AI simulation of 1000 real VC investors and how they interact with each other on LinkedIn. Why simulate a...
linkedin.com - Research
joyojeet pal on X: \"Our latest work on politicians engaging YouTubers for outreach Summary: 1. Influencers routinely...
twitter.com - Research
[1004.4704] Homophily and Contagion Are Generically Confounded in Observational Social Network Studies
arxiv.org - Research
GenAI can harm learning
papers.ssrn.com - Research
someone built nice multiagent simulators
oasis.camel-ai.org - Research
[2407.00215] LLM Critics Help Catch LLM Bugs
arxiv.org - Research
Human study on AI spear phishing campaigns — LessWrong
lesswrong.com - Research
Evaluating the effect of viral posts on social media engagement | Scientific Reports
nature.com - Data
In collaboration with Nature, I investigated the impact of the Trump administration on US science one year after its...
linkedin.com - Data
Datawrapper: Create charts, maps, and tables
datawrapper.de - Research
US science after a year of Trump: what has been lost and what remains
nature.com - Data
Show HN: Self-host Reddit – 2.38B posts, works offline, yours forever | Hacker News
news.ycombinator.com - Data
distil-labs/distil-qwen3-4b-text2sql · Hugging Face
huggingface.co - Data
facebook/research-plan-gen · Datasets at Hugging Face
huggingface.co - Data
World News API: Pricing
worldnewsapi.com - Data
Searchable.City
searchable.city - Data
Exa on X: \"Introducing state-of-the-art People Search: You can now semantically search over 1 billion people using a...
x.com - Data
Data Types - Platform Data Guide | Show Me The Data
show-me-the-data.com - Data
Introduction - Platform Data Guide | Show Me The Data
show-me-the-data.com - Data
Archive on X: \"Bot made $900 -> $208k in 3 months on polymarket one of the most talked about bots on polymarket ri...
x.com - Research
Mapping the online manipulation economy
science.org - Research
I would look at the plots in this supplementary file for the science paper mapping online
science.org - Data
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool
paperzilla.ai - Data
Media Cloud
mediacloud.org - Research
[2510.23645] Global YouTube Trending Dataset (2022-2025): Three Years of Platform-Curated, Cross-National Trends in D...
arxiv.org - Data
Donating Your Social Media - UCSF Library
library.ucsf.edu - Research
lots of fact-check datasets here
dl.acm.org - Research
[2404.07340] RIP Twitter API: A eulogy to its vast research contributions
arxiv.org - Data
AVAILABLE DATASETS – Mobilize Center
mobilize.stanford.edu - Data
Postscope - Twitter/X Visualization Tool
postscope.pages.dev - Data
Webchiver: Build Your Own Personal Web Archive
webchiver.com - Data
GitHub - sherlock-project/sherlock: Hunt down social media accounts by username across social networks · GitHub
github.com - Data
SnapStream ✂️ on X: \"SnapStream lets you see which networks are taking events like the President speaking live &...
x.com - Research
a good paper showing evolution of sharing from one subreddit to another, across multiple d
arxiv.org - Data
Talk to the City
talktothe.city - Data
DSA: Risk Assessment & Audit Database | Alexander Hohlfeld
linkedin.com - Data
Oreocide on X: \"@captgouda24 This page may be of interest to you https://t.co/qV6Ftnojqj\" / X
x.com - Data
Nicholas Decker on X: \"I’ve started an ongoing project to collect all the datasets which economists can use, all in o...
x.com - Data
Releases · ArthurHeitmann/arctic_shift · GitHub
github.com - Data
Reddit subreddits metadata, rules and wikis 2025-01 - Academic Torrents
academictorrents.com - Data
Reddit - Please wait for verification
reddit.com - Data
Using the LessWrong API to query for events — LessWrong
lesswrong.com - Data
GitHub - HackerNews/API: Documentation and Samples for the Official HN API · GitHub
github.com - Data
Live Trade BenchLive evaluation of trading agents
trade-bench.live - Data
BAAI on X: \"We're releasing InfoSeek, a dataset that trained a 3B model to rival Gemini/Sonnet 4.0 on deep research t...
x.com - Data
gnews · PyPI
pypi.org - Data
GitHub - kharrigian/mental-health-datasets: An evolving list of electronic media data sets used to model mental-healt...
github.com - Data
An Emerging Lobby: An Analysis of Campaign Contributions from Indian-Americans, 1998-2022 – Joyojeet Pal
joyojeet.people.si.umich.edu - Data
REALLY COOL DATASET that we should immediately integrate into arbiter for the agent to sea
dataverse.harvard.edu - Data
OpenDataLab å¼é¢AIå¤§æ¨¡åæ¶ä»£ç弿¾æ°æ®å¹³å°
opendatalab.com - Data
Pratyush Maini on X: \"1/Pretraining is hitting a data wall; scaling raw web data alone leads to diminishing returns....
x.com - Data
Can we check if this data comprising 117 million posts is publicly accessible? It would be
academic.oup.com - Data
Substack API repo reaches 100 GitHub stars, thanks to community feedback | Nick Hagar posted on the topic | LinkedIn
linkedin.com - Data
Introducing FineWeb2: A 20TB multilingual dataset | Thomas Wolf posted on the topic | LinkedIn
linkedin.com - Data
GitHub - iptv-org/iptv: Collection of publicly available IPTV channels from all over the world · GitHub
github.com - Data
Multi-Token Attention | Research - AI at Meta
share.google - Data
share.google
share.google - Research
[2404.11988] The Emerging Generative Artificial Intelligence Divide in the United States
arxiv.org - Research
[2504.06318] The Schwurbelarchiv: a German Language Telegram dataset for the Study of Conspiracy Theories
arxiv.org - Data
Discord Fetch – discord_fetch
hamelsmu.github.io - Data
Launch YC: Clado: Deep Research for People | Y Combinator
ycombinator.com - Data
Reducto: AI document parsing & extraction software
reducto.ai - Data
GitHub - stanford-futuredata/ColBERT: ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'2...
github.com - Data
A great dataset: 560 podcast episodes, 300+ hours of content, and full transcripts from a covert Russian influence op...
linkedin.com - Data
Zack Kanter on X: \"It unfortunately seems that 37signals spent the last two and a half years on a Manhattan project t...
twitter.com - Data
Distill — Latest articles about machine learning
distill.pub - Data
On the Biology of a Large Language Model
transformer-circuits.pub - Data
Can Large Language Models Explain Their Internal Mechanisms?
pair.withgoogle.com - Data
Here Are All The ‘Bro’ Podcast Episodes With Trump
forbes.com - Data
Instagram Statistics Marketers Should Know in 2025 [Updated] | Sprout Social
sproutsocial.com - Data
Digital 2025: Global Overview Report — DataReportal – Global Digital Insights
datareportal.com - Data
platformabuse.org
platformabuse.org - Data
Cognition on X: \"Project DeepWiki Up-to-date documentation you can talk to, for every repo in the world. Think Deep R...
twitter.com - Data
Airtable - Social Media Monitoring Products Repository
airtable.com - Data
About - Sourcebase
sourcebase.ai - Data
OSINT +500 Tools - Start.me
start.me - Data
GitHub - cassidoo/scrapers: A list of scrapers from around the web. · GitHub
github.com - Data
Google Sheets now has AI for formulas | Simon Taylor posted on the topic | LinkedIn
linkedin.com - Data
Yushe on X: \"4chan just got hacked hard. The person who hacked them claimed they dumped the entire database. https://...
twitter.com - Data
I’ve built the Perplexity of the DarkWeb! Let me explain 👇 First, if you've been living in a cave, Perplexity is a se...
linkedin.com - Data
Pitch Decks That Helped Hot Startups Raise Millions - Business Insider
businessinsider.com - Data
[Interview] Mark Ledwich - Algorithmic Extremism: Examining YouTube's Rabbit Hole of Radicalization - YouTube
youtube.com - Data
Recfluence
recfluence.net - Data
John B. Holbein on X: \"Wow! This project looks amazing. In it, three scientists at Columbia, Michigan, and Maryland i...
twitter.com - Data
Home - Nielsen Kilts Datasets - Research Guides at New York University
guides.nyu.edu - Data
TVNewser
adweek.com - Data
Worldwide â X (Twitter) trending topics and hashtags today | trends24.in
trends24.in - Data
Ad Library
facebook.com - Data
Platform
openmeasures.io - Data
Trending narratives on social: ‘Deport all Muslims,’ Tesla fires are “terrorism,” Biden stranded the astronauts, Step...
mailchi.mp - Data
really love the data viz
thcostello.com - Research
[2303.05345] TGDataset: Collecting and Exploring the Largest Telegram Channels Dataset
arxiv.org - Data
Discord
discord.com - Data
The Top 100 Gen AI Consumer Apps - 4th Edition | Andreessen Horowitz
a16z.com - Data
javascript - Modifying the positions of streams in the D3 stream graph - Stack Overflow
stackoverflow.com - Data
Dataset - Democratic Erosion
democratic-erosion.org - Data
The GDELT Project
gdeltproject.org - Data
Reports – National AI Opinion Monitor
naiom.net - Data
DeepSeek R1 Distill Llama 70B offers a more accessible and fast way of accessing reasoning capabilities than the full...
linkedin.com - Data
Cloudflare
workers.cloudflare.com - Data
GitHub - BloombergGraphics/2025-youtube-podcast-men-for-trump: Data from the Bloomberg News analysis on streamers and...
github.com
Browse by topic.
CALL FOR READINGS
Have a paper or report we should add?
The library is curated by the team but suggestions are welcome. Send a one-line note via the contact form or DM us on LinkedIn and Twitter.
