priorwork

Find the relevant ML papers you're missing.

Upload your draft or describe your research β€” surface the most relevant prior work across 106,800+ papers and see how each result compares.

πŸŽ‰ Free Open Beta - Sign in to save your search history
Live Example
This is what you get β€” real output from a real search

Submit a paper or write a query β€” here's exactly what priorwork.fyi returns.

πŸ“„
Example: submitted paper
Attention Is All You Need
Vaswani, Shazeer, Parmar et al. Β· NeurIPS 2017
1
Cross-paper synthesis β€” what the field looks like from your vantage point
πŸ“Š State of the Field
Primary Trends:

The dominant technical theme is the architectural evolution of sequence-to-sequence models for natural language processing, specifically focusing on overcoming the computational and representational limits of recurrent neural networks. This shared focus on machine translation benchmarks (e.g., WMT'14) and sequence transduction complexity likely caused their co-retrieval.

Confidence: high
Secondary Trends:

Modifications to self-attention to achieve linear time complexity or faster autoregressive decoding.

Papers: [R5] [R7] [R10]

Enhancements or alternatives to positional encoding and structural sequence representations.

Papers: [R2] [R8] [R9]

Methodological Spectrum:

Predominantly empirical deep learning systems papers introducing novel architectural components, complemented by formal complexity/mathematical equivalence analyses in a few cases.

πŸ”— Your Work in Context:

The submitted paper is the foundational baseline that serves as the direct anchor for the entire retrieved set. While the retrieved papers largely propose modifications to fix the submitted model's quadratic complexity, lack of recurrence, or static depth, the submitted paper itself defines the exact paradigm they seek to improve. It uniquely establishes the core multi-head self-attention mechanism that all subsequent papers in this cluster either optimize, hybridize, or replace.

2
Ranked results β€” with per-paper analysis tailored to your work
Showing 1 of 10 results
[R1] Learning Source Phrase Representations for Neural Machine Translation [PDF]
πŸ‘₯ Hongfei Xu, Josef van Genabith, Deyi Xiong, Qiuhui Liu, Jingyi Zhang πŸ“ ACL 2020
πŸ“„ Abstract
The Transformer translation model (Vaswani et al., 2017) based on a multi-head attention mechanism can be computed effectively in parallel and has significantly pushed forward the performance of Neural Machine Translation (NMT). Though intuitively the attentional network can connect distant words via shorter network paths than RNNs, empirical analysis demonstrates that it still has difficulty in fully capturing long-distance dependencies...
Per-paper AI analysis
✨ What Sets It Apart

This paper introduces an end-to-end neural phrase extraction and cross-attention mechanism to address long-distance dependency failures in standard self-attention. New work on modeling long sentences must compare against its +1.72 BLEU improvement on sequences exceeding 45 tokens.

πŸ”— Relevance to Your Work

Both target sequence transduction and machine translation on the WMT14 benchmarks. This paper builds directly upon the submitted paper by adding explicit phrase pooling and cross-attention representations. Its length-based findings highlight a direct limitation in the submitted paper's standard self-attention mechanism, offering an explicit avenue for architectural improvement.

🎯 86.4% match ⭐ Suggested
[R2] Convolutional Sequence to Sequence Learning
πŸ‘₯ Jonas Gehring et al.πŸ“ ICML 2017
🎯 85.3% match⭐ Suggested
[R3] Sequence to Sequence Learning with Neural Networks
πŸ‘₯ Ilya Sutskever et al.πŸ“ NeurIPS 2014
🎯 83.8% matchπŸ” Worth a look
+ 9 more results
Submit your own paper below to see them

Ready to find prior work for your own research?

or
πŸ“„
Upload a paper or draft to find relevant prior work!
Drop PDF files here or click to browse β€’ Multiple PDFs supported β€’ Max 50MB per file