priorwork - ML Paper Discovery & Citation Analysis

Live Example

This is what you get — real output from a real search

Submit a paper or write a query — here's exactly what priorwork.fyi returns.

📄

Example: submitted paper

Attention Is All You Need

Vaswani, Shazeer, Parmar et al. · NeurIPS 2017

Cross-paper synthesis — what the field looks like from your vantage point

📊 State of the Field

Primary Trends:

The dominant technical theme is the architectural evolution of sequence-to-sequence models for natural language processing, specifically focusing on overcoming the computational and representational limits of recurrent neural networks. This shared focus on machine translation benchmarks (e.g., WMT'14) and sequence transduction complexity likely caused their co-retrieval.

Confidence: high

Secondary Trends:

Modifications to self-attention to achieve linear time complexity or faster autoregressive decoding.

Papers: [R5] [R7] [R10]

Enhancements or alternatives to positional encoding and structural sequence representations.

Papers: [R2] [R8] [R9]

Methodological Spectrum:

Predominantly empirical deep learning systems papers introducing novel architectural components, complemented by formal complexity/mathematical equivalence analyses in a few cases.

🔗 Your Work in Context:

The submitted paper is the foundational baseline that serves as the direct anchor for the entire retrieved set. While the retrieved papers largely propose modifications to fix the submitted model's quadratic complexity, lack of recurrence, or static depth, the submitted paper itself defines the exact paradigm they seek to improve. It uniquely establishes the core multi-head self-attention mechanism that all subsequent papers in this cluster either optimize, hybridize, or replace.

Ranked results — with per-paper analysis tailored to your work

Showing 1 of 10 results

[R1] Learning Source Phrase Representations for Neural Machine Translation [PDF]

👥 Hongfei Xu, Josef van Genabith, Deyi Xiong, Qiuhui Liu, Jingyi Zhang 📍 ACL 2020

📄 Abstract

The Transformer translation model (Vaswani et al., 2017) based on a multi-head attention mechanism can be computed effectively in parallel and has significantly pushed forward the performance of Neural Machine Translation (NMT). Though intuitively the attentional network can connect distant words via shorter network paths than RNNs, empirical analysis demonstrates that it still has difficulty in fully capturing long-distance dependencies...

Per-paper AI analysis

✨ What Sets It Apart

This paper introduces an end-to-end neural phrase extraction and cross-attention mechanism to address long-distance dependency failures in standard self-attention. New work on modeling long sentences must compare against its +1.72 BLEU improvement on sequences exceeding 45 tokens.

🔗 Relevance to Your Work

Both target sequence transduction and machine translation on the WMT14 benchmarks. This paper builds directly upon the submitted paper by adding explicit phrase pooling and cross-attention representations. Its length-based findings highlight a direct limitation in the submitted paper's standard self-attention mechanism, offering an explicit avenue for architectural improvement.

🎯 86.4% match ⭐ Suggested

[R2] Convolutional Sequence to Sequence Learning

👥 Jonas Gehring et al.📍 ICML 2017

🎯 85.3% match⭐ Suggested

[R3] Sequence to Sequence Learning with Neural Networks

👥 Ilya Sutskever et al.📍 NeurIPS 2014

🎯 83.8% match🔍 Worth a look

+ 9 more results

Submit your own paper below to see them

Ready to find prior work for your own research?

Recent Searches (Last 30 Days)

Account Settings

Account Information

Email Preferences

Danger Zone