Every link we've shared in our monthly newsletter, in one place. Policy briefs, datasets, technical write-ups, news features, tutorials, papers. Use the search box, filter by category, or stack multiple tags to narrow further.
Action-oriented taxonomy of digital-safety interventions across focus, scope, driver, user journey.
Cross-community bridging metric on Telegram disinformation networks. SimPPL paper.
MIT News write-up of Sakhi at Delta V Demo Day.
Inaugural India RCC cohort announcement; SimPPL was the first awardee.
Three-minute interview ahead of Meta's Bangladesh CIB takedown.
Tech Global Institute / SimPPL Info Lab report on Bangladesh election Pages and Groups.
SimPPL position paper. Multiagent + LLM-driven simulation as the public-research bridge.
Part 2: how information disseminates through coordinated structures.
Part 1 of Jhagrut Lalwani's two-part SimPPL blog on coordinated behaviour.
Multiagent Reddit simulation quantifying CIB's effect on recommender outputs.
Ransomware crew '8BASE' announcing a UK target on X, illustrating how threat actors broadcast on X for OSINT monitors to pick up.
Curated index of X accounts run by ransomware groups and cybercriminals — a seed list for OSINT cyber-threat monitoring.
Mollick highlights a study on 280K academic talk transcripts showing speakers increasingly use ChatGPT-favored words — model collapse for humans.
TigerData open-sourced pg_textsearch, bringing native BM25 ranking to Postgres so teams can drop their separate Elasticsearch clusters.
Elasticsearch reference on Reciprocal Rank Fusion — a tuning-free method for combining multiple retrievers into one ranked result set.
Origon's enterprise-AI-agent platform: agents trained on customer data and deployed to private cloud or on-prem for regulated industries.
Mixedbread serves 1B+ docs with multi-vector and multimodal search at sub-50ms p50, with a production engineering write-up forthcoming.
SimpleMem: efficient open-source lifelong-memory implementation for LLM agents across text and multimodal inputs.
Sprites: hardware-isolated persistent Linux sandboxes for running AI-agent code, with checkpoint and restore.
Packy McCormick co-essay with Standard Bots' Evan Beard on cumulative small advances in robotics adding up to a giant leap.
Paste any GitHub URL into CodeFlow and get an interactive architecture map showing how files connect — browser-only, no install.
json-render: a generative-UI framework that turns LLM JSON output into safe predefined components and actions.
Deedy Das post claiming Claude Code completed in 30 minutes a side project that took him two weeks in 2025.
Scholarly-ideas: an LLM-driven research-puzzle development tool that grounds academic puzzles in real empirical anomalies.
Embeddings + k-means + GPT-4o labeling applied to 6,912 of @deedydas's tweets to surface 25 thematic clusters.
Quadrillion Labs unveils Qualia, an agentic data-scientist product, in a teaser for its private beta.
E2B's enterprise AI-agent cloud, used by Perplexity, Hugging Face, Manus, Groq, and Lindy for sandboxed agent execution.
Open-source pipeline that turns unstructured text documents into LLM-generated knowledge graphs.
Jay Van Bavel promotes RageCheck, a free tool that flags emotionally manipulative framing patterns in online content.
Interactive walk-through of neural-network loss landscape geometry: convexity, saddle points, sharp vs flat minima.
Anthropic engineer Adam Jones on cutting MCP tool-token bloat by having agents write code to call tools rather than register every tool.
taws: a Rust-built terminal UI for AWS — navigate, observe, and manage resources without the console.
AI SDK 6 from Vercel ships tool-execution approval, DevTools, native MCP, reranking, and image editing for TypeScript AI apps.
Anthropic engineering case for letting agents call MCP servers via code execution, so tool definitions stay out of the context window.
Grin's docs on Merkle Mountain Ranges — an append-only alternative to Merkle trees used to store blockchain kernels and proofs.
Andy Pavlo's 2025-in-review on databases: Postgres momentum, MCP everywhere, MongoDB v FerretDB, file formats, and Turbopuffer's rise.
swyx on Turbopuffer overtaking Pinecone with $4-5M ARR contracts despite a late start, citing Pavlo's 2025 databases retrospective.
Devi Parikh: Yutori's blog on lessons building deep, broad, on-point, 24/7 agents that power Scouts — useful for anyone building agents.
ChapterPal: AI-assisted reading and note-taking app, marketing landing page.
Boris Cherny (creator of Claude Code) shares his vanilla setup and notes there's no one correct way to use the tool.
Balestriero highlights provable measures of alignment between pretraining setups and eval performance — early but promising work.
Seunghyun Seo's slides on training-horizon scaling, focusing on the role of weight decay (not just learning rate) when scaling.
LACUNA: a PPO RL agent trained to trade Polymarket 15-minute crypto markets by fusing Binance order flow with Polymarket orderbook data.
Jaya Gupta argues 'context graphs' that capture decision traces (not just data) are AI's next trillion-dollar opportunity.
Animesh Koratana on how to actually build a context graph — modeling decision traces is structurally hard, not 'add memory to your agent'.
Practical guide: when to use LLM-as-judge, with five rules (reference-based, debiasing, ensembling, reasoning before scoring, calibration).
Lab notes from Epistemic Garden on deriving narrative strands from quote-tweets + semantic search across the postrat community archive.
ElevenLabs becomes an official WhatsApp technology provider, deploying human-like voice agents inside WhatsApp business chats.
A hardware-aware guide to data structures for systems engineers, shared by Miguel Pastor.
Karan Lokchandani shares an open LLM-interview-questions PDF — handy prep for ML hiring loops.
Google Gemini API rate-limit reference: RPM, TPM, RPD across tiers, model-specific caps, and project-level scoping.
Google AI Studio launches the Interactions API, a unified foundation for building Gemini-based models and agents in public beta.
Perplexity Page on a malicious VPN extension allegedly stealing ChatGPT credentials (paywalled or login-gated).
LangChain shares Thomas Taylor's tutorial on deploying stateful LangGraph agents on AWS Lambda with DynamoDB checkpointing and CDK.
Bharat Biyani's visual guide to .NET concurrency: state machine, stack vs heap, async vs parallel, the .Result deadlock trap.
Bharat Biyani's design-patterns explainer: why they exist, what problems they solve, when (and when not) to use them, interview framing.
Belinda's interpretable-interface project for Sakana's AI Scientist-v2 lets you watch every hypothesis, failed experiment, and 'aha'.
Daniel Kokotajlo's 2021 LessWrong post extrapolating AI futures year-by-year through 2026 — frequently cited AI-timeline vignette.
AI 2027: a detailed scenario forecasting superhuman AI's next-decade impact, with slowdown vs race endings and quantitative trend extrapolations.
xjdr on why training MoEs under 20B params is hard: flop efficiency, load-balancing/router stability, and data quality/quantity.
Archie Sengupta's distributed-GPU-training explainer: from a single GPU's streaming multiprocessors to slicing models across racks.
arXiv preprint on C++ design patterns for high-frequency trading systems.
Eric Zakariasson shares Cursor's internal onboarding guide for non-technical team members joining the company.
Ax: a TypeScript DSPy port — declare signatures instead of prompts, with auto-prompt-tuning, GEPA optimizer, and 15+ LLM providers.
AWS Lambda durable functions: automatic checkpointing, suspend execution up to a year, recover from failures, no infra to manage.
Clerk Billing: drop-in React components for B2C and B2B subscription billing without writing payment-integration or UI code.
Polar: usage-based billing platform for the AI era — meter tokens, API calls, compute, GPU workloads.
K-Dense ships free Claude Scientific Skills + Writer that take a dataset to a submission-ready manuscript in one day.
Thariq launches a Deep Research demo for the Claude Agent SDK: parallel agents researching a topic and synthesizing into a report.
Pedro Sant'Anna recommends refine.ink: AI tool that proof-checks academic papers for internal consistency, typos, and equation proofs.
Towards Data Science walkthrough of psmpy, a Python library for propensity-score matching (paywalled, fetch failed).
flashtensors: run 100 large models on a single GPU with minimal time-to-first-token impact via tensor swap.
GEPA featured in the OpenAI x Bain cookbook tutorial on building self-evolving agents that move beyond static prompts.
Modal: high-performance AI infrastructure with sub-second cold starts, instant autoscaling, elastic GPU access without quotas.
Kosong: Moonshot AI's open-source LLM abstraction layer for modern AI agent applications (now part of the kimi-cli monorepo).
Microsoft's Agent Framework: open-source framework for building, orchestrating, and deploying AI agents and multi-agent workflows in Python and .NET.
Google's Agent Development Kit (ADK): production-ready open-source agent framework in Python, TypeScript, Go, and Java.
Tom Cunningham compares AI-impact-on-growth forecasts from economists (0.1-1.5%/yr) vs AI experts (3-30%/yr) — large disagreement, debate why.
Supermemory: a memory + RAG API for the AI era with graph memory, content types, multi-tenancy, and SDK integrations.
Santiago: MiniMax M2 (open-source) outperforms Gemini 2.5, Claude 4.1, Qwen3 on coding/tool-use benchmarks at ~8% of Claude's cost.
Akshay highlights Microsoft's open-source Agent Lightning: train any AI agent (LangChain, AutoGen, CrewAI, etc.) with RL and prompt optimization.
Neil Hoyne summarizes NotebookLM's upgrade: 1M-token context, 6x longer memory, custom personas, deeper research.
WHO's Aleksandra Kuzmanovic on behavioural-science-informed framing for health communication on social media: a viewpoint with Meta researchers.
BFI Chicago working paper on Pangram's models for AI-generated text detection (PDF, page is binary so excerpt unparsed).
Johann Schopplich introduces TOON: Token-Oriented Object Notation, a JSON-like format that's 40-60% fewer tokens for LLMs.
Elastic Dev shows Agent Builder: define an agent in natural language and get a custom AI assistant for Elastic data.
LangChain shares Salesforce AI Research's Enterprise Deep Research, a multi-agent system on LangGraph with streaming and human steering.
AI2 olmOCR 2 scores 82.4 on olmOCR-Bench (up from 78.5), with gains across every document category.
Meilisearch: unified search and AI-retrieval platform with sub-50ms full-text, semantic, hybrid, and multi-modal search.
Elastic Labs blog: ES|QL query builder for the Python Elasticsearch client (8.19+) with familiar Python syntax.
GeoIntel: Python tool using Google's Gemini API to uncover photo locations through AI-powered geo-location analysis.
OpenAgent: open-source multimodal agentic AI for search, thinking, and general tasks (page returned empty).
ElevenLabs UI: a shadcn-based component library and registry for building multimodal voice agents faster.
Animesh Gaitonde on how BookMyShow handled 1M ColdPlay ticket requests in 10 minutes: pessimistic vs optimistic vs in-memory locking.
TwelveLabs: video-intelligence platform with 60x real-time ingest, indexing 10k hours/day; turn raw footage into searchable AI-ready data.
Aydyn Tairov highlights Meta's OpenZL data-compression framework — graph-based composition of existing algorithms, 10x speedup over zstd.
Adarsh Appaiah: Google removed the num=100 search parameter, cutting LLM-accessible long-tail results 90% and dropping Reddit's stock 15%.
Liquid AI ships LFM2-Audio, a 1.5B model handling text and audio with 10x faster inference and parity with 10x larger models.
ToolUniverse: a registry of 1,000+ scientific tools for AI Scientist agents.
Lingo.dev: localization-engineering platform that persists glossaries, brand voice, and per-locale model chains as a stateful translation API.
Parsera: agent-based and API-based scraping that turns any website into a custom dataset via natural-language prompts.
Refine.ink: AI-powered peer-review tool that flags accuracy, math, and internal-reference errors in research papers.
Turing Post on LMCache, an open-source KV-cache management layer for LLM serving — 4-10x reduction in RAG, lower TTFT, integrated with NVIDIA Dynamo.
Harrison Chase: Deep Agents now run on LangChain 1.0 using new middleware — technical deep dive on what they are and how to use them.
Tsururu (arXiv 2509.15843): a Python time-series-forecasting library focused on strategies (recursive/direct/MIMO/hybrid) and preprocessing.
Dub: modern link-attribution platform for short links, conversion tracking, and affiliate programs.
Akshay highlights MindsDB Knowledge Bases: open-source enterprise RAG over 200+ data sources with embeddings, reranking, real-time sync.
Adalat AI: India's end-to-end justice tech stack — courtroom transcription, case-lifecycle management, real-time updates.
Alexios Mantzarlis: 53 TikTok hoax videos about Charlie Kirk's assassination got 32M views in three days; TikTok took 48 down after disclosure.
Allen AI's curated list of open-source language models — links and resources from the OLMo team's NeurIPS 2024 tutorial.
Nathan Lambert flags Cursor's RL blog: collecting RL data from real users and updating checkpoints every 90-120 minutes — unthinkable a year ago.
Geoffrey Litt on JELLY: structured AI-generated UIs that first build a data schema users can edit, then compose UIs from premade widgets.
Strands Agents: AI-powered agents for modern workflows (page returned empty).
Dexter Horthy (Human Layer) on advanced context engineering for agents — spec-first, compaction strategies, subagents, planning workflows.
Sam Bhagwat (Mastra) on LongMemEval: tailored templates and targeted updates yield SOTA results on agent-memory benchmarks.
Jeff Huber (Chroma) on context engineering: filtering and compaction matter more than long-context windows for reliable agent performance.
Jake Heller on scaling CoCounsel — context-engineering lessons from building professional-grade legal AI from GPT-4 onward.
Agentuity: full-stack platform for AI agents — typed APIs, frontends, sandboxes, evals, OpenTelemetry observability, evals on live traffic.
Resemble AI's Chatterbox Multilingual: production-grade open-source zero-shot TTS in 23 languages with emotion control and watermarking.
smol AI: Kimi K2-0905 and Qwen3-Max-preview both launched as 1T-parameter open-weight models on the same day.
Neo: autonomous ML-engineer agent that automates training, fine-tuning, RAG pipeline construction, and evaluation.
Agentic Design Patterns Google Doc (login-gated; can't read content).
Sean Goedecke on system design: do the simplest thing that could possibly work, in fixing bugs, maintaining systems, and architecting new ones.
Social Forest: YouTube Data API and YouTube Scraper alternative for accessing YouTube data at scale.
CALL FOR READINGS
The library is curated by the team but suggestions are welcome. Send a one-line note via the contact form or DM us on LinkedIn and Twitter.