9 ways researchers actually keep up with arXiv in 2026

What the workflow looks like for people who actually maintain it past 6 months. Nine approaches, who each fits, and the tradeoffs we have seen burn researchers we have talked to.

The reading-load problem is older than arXiv. It got worse in 2018 when transformer-era output volume started compounding, and worse again post-2022 when LLM tooling lowered the cost of writing a paper. The volume on cs.LG alone in June 2025 was roughly 200 new submissions per day. Nobody reads 200 papers a day. The question is how the people who stay current actually do it.

We built DIGEST after watching the same workflow collapse happen across a few researchers we know. Their first attempt was always "I will read everything in my subfield". Six weeks later they were buried, then guilty about being buried, then reading nothing. The researchers who stayed current past a year did one of nine things below. Most do two or three combined.

This is the survey, not the sales pitch. Where DIGEST fits is in approach #4 and we will say so when we get there. The other eight are also valid; pick the one that survives your actual week.

1. Author-whitelist RSS feeds

The oldest approach and still the most surgical. You pick 30-80 authors whose work you want to track and subscribe to RSS feeds keyed to their arXiv submissions. Tools: arXiv's built-in author RSS, Feedly with custom feeds, or arxiv-sanity-lite.

Who it fits: Researchers deep in a specific area where the productive authors are a known small set. Postdocs in subfields with under 500 active researchers globally. Anyone who can name their "field's 50".

Tradeoff: Bias compounds. You will miss the surprise paper from a first-year PhD or a lab you never heard of. The "Attention is All You Need" (arXiv:1706.03762) paper came from a Google team, not from the NLP authors most pre-transformer NLP people were tracking. Author-whitelist subscribers in 2017 caught it late.

Maintenance: Quarterly prune. Add 2-3 new authors per quarter, drop 2-3 whose recent work is off your interest line. Without pruning the feed bloats and stops being useful inside a year.

2. arXiv category RSS, filtered by keyword

You subscribe to a full arXiv category (cs.CL, cs.CV, q-bio.QM) and run a local filter on titles and abstracts. Filter rules can be regex, BM25 over a keyword list, or a small embedding-similarity model against papers you have already starred.

Who it fits: Researchers covering a category where the productive subset is broader than 80 authors. ML engineers tracking applied research across multiple labs. Anyone whose interest is keyword-shaped ("retrieval", "interpretability", "scaling laws") rather than author-shaped.

Tradeoff: Keyword filters fail on the papers most worth catching. Novel work uses new vocabulary by definition. The first 6 months of "retrieval-augmented generation" work pre-dated the term. Filter authors of one of those papers missed it because their filter was the wrong word.

Maintenance: The keyword list ages every quarter. The mitigation is mixing keyword-match with citation-graph signal: if a paper is cited by something you already starred, surface it regardless of keyword match.

3. Hand-curated editorial newsletters

TLDR AI (5 papers/day, editorial picks), The Batch (Andrew Ng's weekly, slower cadence), Import AI (Jack Clark, weekly long-form), Ben's Bites (consumer-AI-leaning, daily), Alpha Signal (auto-summarized blend). They share one structural property: a human or small team picks what matters.

Who it fits: Industry practitioners who want breadth, not depth. PhD students in their first year mapping the landscape. Anyone whose primary interest is "what is the field doing this week" rather than "what is happening in my exact subfield".

Tradeoff: Editorial curation skews toward big labs and big news. If your subfield is small or your interest is technical-detail-shaped, hand-curated newsletters will under-serve you. The papers that compound most for a niche researcher are usually NOT the papers the editorial newsletters cover.

Maintenance: None. Subscription does the work.

4. Auto-summarized arXiv digests, configured per reader

Pick the arXiv categories you care about, pick a reading style (we use "Quick Scan" for triage and "Researcher" for depth), get a tailored daily or weekly email. Alpha Signal does this. We built DIGEST to do it.

(Quick disclosure: we are the team behind DIGEST. We are biased. The comparison we are about to make is honest anyway and we think it holds.)

Who it fits: Researchers and practitioners whose interests do not match what editorial newsletters cover. Anyone who reads multiple subfields and wants per-category cadence. PhD students who need adjustable depth as their subfield knowledge grows.

Tradeoff: Auto-summarization is only as good as its source filtering and its summary quality. The summarization quality bar moves every 6 months as base models improve. The honest evaluation is to subscribe to one for 3 weeks and audit how often the summary misses the actual contribution of the paper. We do that audit internally; researchers should do it externally.

What DIGEST does specifically: 5 fixed reader profiles (Student / Researcher / Industry Pro / Curious Adult / Quick Scan), all arXiv categories supported, daily or weekly cadence, Resend-delivered email. Free tier is 1 recipe daily. Pro is €5/mo for unlimited recipes plus cross-references between papers in the same digest. We are not the only auto-summarizer in this category. Alpha Signal is the obvious comparison and works well for ML-only.

Maintenance: Recipe edit when your interest shifts. Profile change as your subfield knowledge deepens.

5. arXiv-sanity (and similar embedding-recommender tools)

Andrej Karpathy's arxiv-sanity and its descendants build embeddings over arXiv abstracts and let you train a recommender on papers you have rated. Result is a personalized feed where the surprise factor stays alive longer than keyword filtering.

Who it fits: Researchers with a clear "I like this kind of paper" intuition that does not translate cleanly to keywords. Anyone willing to invest 20 minutes a week labeling thumbs-up / thumbs-down to keep the recommender calibrated.

Tradeoff: Recommender quality degrades if you stop labeling. Cold-start is brutal for niche subfields with few labeled positives. Embedding models age; a recommender trained in 2022 underweights post-2023 vocabulary unless the embeddings get re-rolled.

Maintenance: 15-20 min/week labeling. Re-roll embeddings whenever the underlying model changes (roughly yearly).

6. Twitter / X / Bluesky firehose

Following the active researchers in your area on social. The signal is what they retweet and quote-tweet, not what they post directly. Yannic Kilcher, Sebastian Raschka, Aran Komatsuzaki, and similar accounts function as decentralized editorial newsletters.

Who it fits: Researchers who already have a social presence in their area and want light maintenance. People who like the discussion-thread aspect (where the comments on a paper sometimes carry more signal than the paper).

Tradeoff: Highest hit rate on currency, lowest on signal-to-noise. The same firehose that surfaces real work surfaces hype-cycle reactions to that work. Twitter's algorithmic ranking actively hides papers if your engagement skews toward replies and screenshots. Bluesky is friendlier but has thinner ML coverage as of mid-2026.

Maintenance: Curate the follow list once a quarter. Mute the consistently-hype accounts. Read the firehose in batches, not as a feed.

7. Semantic Scholar + Connected Papers (citation-graph navigation)

When you find a paper you care about, you do not need a daily feed — you need a graph. Semantic Scholar's "Influenced by" and "Influences" links and Connected Papers' graph visualization let you walk from one paper to the related cluster in 10 minutes.

Who it fits: Researchers doing focused work where the question is "what came before / after this paper" rather than "what is new today". Literature review work. PhD students mapping the citation neighborhood of their thesis topic.

Tradeoff: Citation-graph navigation is reactive, not proactive. You catch papers you would have caught anyway because something else surfaced them first. Useful as a depth tool, not a discovery tool.

Maintenance: None. Use per project.

8. Lab newsletters, university group mailing lists, and Discord communities

The under-noticed channel. Most active research labs maintain an internal weekly that summarizes 5-15 papers the lab found interesting. Anthropic, OpenAI, FAIR, DeepMind have these for internal use; smaller labs at universities often share them publicly. Discord servers like Eleuther, LMSys, and most major-conference servers carry similar weekly readouts in dedicated channels.

Who it fits: Researchers whose interests overlap a known lab's research direction. ML engineers who want practical implementation discussion alongside paper recommendations. Anyone who values the social discussion thread.

Tradeoff: Discoverability. Most of these channels are not advertised. You find them through lab websites, conference Discord links, and individual researcher Twitter bios. Maintenance per channel is low; cumulative maintenance across 5+ channels is high.

Maintenance: Audit which Discords you actually read each quarter. Unjoin the ones where you have not opened a message in a month.

9. Conference paper sweeps (the seasonal pattern)

Skip the daily / weekly cadence entirely. Read whatever you can in the 2-3 weeks before and after major conferences (NeurIPS, ICML, ICLR, ACL, CVPR, EMNLP) when the dump of accepted papers happens. Use Papers With Code's conference-specific pages, OpenReview filters, and conference-track session lists.

Who it fits: Researchers in steady-state subfields where the velocity is real but bounded. Industry researchers who only need "what is the field doing" 4 times a year. People with cyclical project schedules that align with conference cycles.

Tradeoff: Latency. A paper from January gets your attention in May. For most subfields this is fine; for fast-moving ones (mid-2024 reasoning-model work, late-2025 inference-time-compute work) it is too slow.

Maintenance: Calendar-based. The conferences are the maintenance trigger.

How to pick

The researchers we have talked to who maintained their workflow past a year ran some combination of these, not just one:

There is no single right answer. There is an answer that survives your week and an answer that doesn't. The test is "after 6 months, is the workflow still running" — most workflows fail not because they are wrong but because they are too expensive to maintain.

Where DIGEST fits (the honest read)

We built DIGEST because approach #4 was the gap. Editorial newsletters (#3) were too broad for researchers we know who track 4 arXiv categories that don't overlap the editorial mainstream. arxiv-sanity (#5) requires labeling that most people stop doing in week 3. The auto-summarized space was thin: Alpha Signal works well but is ML-centric; nothing else covered the full arXiv breadth with per-reader configuration.

DIGEST is one option for #4. Try it if you are in the "auto-summarized, per-reader" cohort. The free tier is enough to evaluate (1 recipe, daily, 1 category). If you are in any of the other 8 cohorts, the other 8 approaches probably fit better.

If you want to see what a Quick Scan summary actually looks like and how cross-references work in the Pro tier, the how-to-use page walks through a live example with three Mixtral / Mistral / Switch-Transformer papers.

Tool roster + citations verified as of June 2026. Updated quarterly.

Want this kind of analysis in your inbox?

Get the digest free

Free forever. No credit card.