অধ্যায় 57 — Reading AI Research Papers

📖 একটি ছোট গল্প

প্রথমবার একটি NeurIPS paper হাতে তুলে নিলাম — ১২ পাতা, ৫০টির বেশি equation, অপরিচিত notation। ৩ ঘণ্টা পড়ার পর বুঝলাম, "introduction" আর "conclusion" পড়লেই ৮০% বোঝা যায়। Paper পড়া একটি skill — জোরে জোরে নয়, systematic approach দিয়ে শেখা হয়।

The 3-Pass Method (Keshav)

Pass 1: The 5-Minute Scan (Big Picture)

Title, abstract, introduction পড়ুন — কী problem, কী contribution?
Section ও subsection headings দেখুন — structure বোঝা যায়।
Mathematical content skip করুন, শুধু figures ও tables দেখুন।
Conclusion পড়ুন — main result কী?
References scan করুন — কোন্ prior work-এর উপর দাঁড়িয়েছে?

Output: Category, context, correctness, contribution, clarity — ৫টির রেটিং দিন।

Pass 2: The 1-Hour Grasp (Content Without Details)

Figures ও diagrams carefully পড়ুন — প্রায়ই figure-এ main idea থাকে।
Method section-এর pseudo-code বা algorithm দেখুন।
Key equations identify করুন — কোনটা main result, কোনটা lemma।
Experiments section: datasets, metrics, baselines — comparison fair কিনা?

Pass 3: Deep Dive (Re-implementation Level)

Every equation derive করার চেষ্টা — কাগজে নিজে লিখুন।
Proof গুলো verify করুন — assumption valid কিনা?
Code re-implement করুন (if available) — toy version হলেও।
Limitations ও future work identify করুন — নিজের research idea পেতে পারেন!

Understanding Paper Structure

Abstract — ১-২ paragraph, problem + approach + result। কখনোই skip করবেন না।
Introduction — motivation, prior work gaps, main contributions (bullet points খোঁজে বের করুন)।
Related Work — context, কীভাবে এই paper টা different।
Method/Model/Approach — core technical contribution।
Experiments — empirical validation, ablations, comparisons।
Conclusion & Future Work — summary + open problems (research idea source!)।

Reading Math-Heavy Sections

AI paper-এর ৬০% সময় math section-এ যায়। কীভাবে approach করবেন:

Notation table খুঁজুন — না থাকলে নিজে বানান (কাগজের একপাশে)।
Dimension tracking — প্রতিটি variable-এর shape লিখে রাখুন।
Special cases — general formula-এর জন্য simple example (d=2, n=3) try করুন।
Backward from result — theorem → lemma → definition, reverse direction-এ চলুন।
Identify reusable components — কোন lemma অন্য paper-এও দেখেছেন?

✨ টিপ

Paper-এ "without loss of generality" (WLOG) বা "it is easy to show" লেখা থাকলে সতর্ক হোন — often এটাই সবচেয়ে tricky part! Author চেয়েছেন proof short দেখাতে।

Decoding Figures & Tables

Figure 1 (Architecture) — model structure, data flow, components। Color coding মনে রাখুন।
Figure 2-3 (Results) — x-axis ও y-axis carefully দেখুন, log scale কিনা? Error bars আছে?
Table 1 (Main Results) — best number bold হয়, কিন্তু second best দেখুন — consistent কিনা?
Ablations — কোন component সবচেয়ে important? Remove করে কত degrade হয়?
Appendix figures — often supplementary experiments থাকে, main result support করে কিনা check করুন।

Where to Find Good Papers

arXiv.org — preprint, same-day access, free। cs.LG, cs.AI, cs.CL, stat.ML sections।
Papers With Code — paper + code + results, reproducibility check।
OpenReview — ICLR papers, review + rebuttal + meta-review, decision process দেখা যায়।
Semantic Scholar — citation graph, influential citations filter।
Connected Papers — visual graph of related work, similar papers খুঁজে পেতে সহজ।

Python: Paper Metadata Parser

pythonPython · NumPy

import requests
from bs4 import BeautifulSoup

def extract_arxiv_info(arxiv_id):
    """Fetch and parse basic arXiv metadata."""
    url = f"https://export.arxiv.org/api/query?id_list={arxiv_id}"
    response = requests.get(url)
    soup = BeautifulSoup(response.content, 'xml')
    
    entry = soup.find('entry')
    info = {
        'title': entry.find('title').text.strip(),
        'summary': entry.find('summary').text[:300] + "...",
        'authors': [a.text for a in entry.find_all('author')[:3]],
        'published': entry.find('published').text[:4],
        'primary_category': entry.find('arxiv:primary_category')['term']
    }
    return info

# Example: Attention Is All You Need
info = extract_arxiv_info("1706.03762")
for k, v in info.items():
    print(f"{k}: {v}")

Practice Tasks

"Attention Is All You Need" (2017) — Pass 1 দিন, 5-minute-এ ৩টি contribution বের করুন।
Transformer-XL বা BERT paper — math section dimension tracking করুন।
একটি recent ICLR paper-এর OpenReview page দেখুন — reviewer concern কী ছিল?
নিজে একটি reading log শুরু করুন — প্রতি paper-এর summary + critique লিখুন।

Interview Questions

আপনি কীভাবে একটি নতুন research area-এর paper follow করবেন?
Paper-এর result replicate করতে পারছেন না — কী steps নেবেন?
Ablation study কেন গুরুত্বপূর্ণ? কোনো paper without ablation কীভাবে critique করবেন?
Preprint vs peer-reviewed — কোনটা কখন trust করবেন?

Summary · সারসংক্ষেপ

Paper reading = 3-pass method: scan → grasp → deep dive।
Introduction + conclusion = ৮০% understanding, math = details।
Notation table, dimension tracking, special cases — math-heavy section-এ survival tools।
Figures often contain the main idea — never skip them।
arXiv, Papers With Code, OpenReview, Connected Papers — essential tools।

পূর্ববর্তী · CH 56

Math Behind Generative AI

পরবর্তী · CH 58

Understanding Mathematical Equations in Papers