Hacker News new | past | comments | ask | show | jobs | submit | from login
Noteworthy LLM Research Papers of 2024 Megapost (sebastianraschka.com)
5 points by yaiml 7 days ago | past | discuss
Implementing a Byte Pair Encoding (BPE) Tokenizer from Scratch (sebastianraschka.com)
2 points by headalgorithm 10 days ago | past | discuss
Implementing a Byte Pair Encoding (BPE) Tokenizer from Scratch (sebastianraschka.com)
4 points by sbbq 13 days ago | past | discuss
AI Research Recap 2024: From New Scaling Laws to Scaling Inference Compute (sebastianraschka.com)
1 point by sbbq 15 days ago | past
Noteworthy AI Research Papers of 2024 (Part One) (sebastianraschka.com)
1 point by birdculture 26 days ago | past
Noteworthy AI Research Papers of 2024 (Part One) (sebastianraschka.com)
1 point by sbbq 29 days ago | past
Collection of 1k LLM Research Papers of 2024 (sebastianraschka.com)
4 points by sbbq 31 days ago | past
LLM Research Papers: The 2024 List (sebastianraschka.com)
5 points by ModelForge 43 days ago | past
LLM Research Papers: The 2024 List (sebastianraschka.com)
1 point by mdp2021 53 days ago | past
Understanding Multimodal LLMs (sebastianraschka.com)
2 points by lapnect 87 days ago | past
Understanding Multimodal LLMs: The Main Techniques and Latest Models (sebastianraschka.com)
4 points by sbbq 88 days ago | past
Building a GPT-Style LLM Classifier from Scratch (sebastianraschka.com)
2 points by mdp2021 4 months ago | past
Building LLMs from the Ground Up: A 3-Hour Coding Workshop (sebastianraschka.com)
970 points by mdp2021 5 months ago | past | 136 comments
Show HN: New LLM Pre-Training and Post-Training Paradigms (sebastianraschka.com)
2 points by rasbt 5 months ago | past
New LLM Pre-Training and Post-Training Paradigms: How Modern LLMs Are Trained (sebastianraschka.com)
5 points by sbbq 5 months ago | past
Developing an LLM: Building, Training, Finetuning (sebastianraschka.com)
1 point by Anon84 7 months ago | past
Understanding the LLM Development Cycle: Building, Training, Finetuning (sebastianraschka.com)
3 points by rasbt 7 months ago | past
The latest major open LLM releases: Mixtral, Llama 3, Phi-3, and OpenELM (sebastianraschka.com)
5 points by rasbt 8 months ago | past
Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com)
2 points by sbbq 10 months ago | past
Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com)
2 points by tosh 10 months ago | past
Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com)
1 point by Anon84 10 months ago | past
Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com)
2 points by rasbt 10 months ago | past
AI Research in Feb 2024 – LoRA Successor, "Small" LLMs, Transparent LLM Research (sebastianraschka.com)
3 points by rasbt 11 months ago | past
Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch (sebastianraschka.com)
96 points by rasbt 11 months ago | past | 10 comments
AI Research Papers in Jan 2024: Model Merging, Mixtures of Experts, Smaller LLMs (sebastianraschka.com)
20 points by rasbt 12 months ago | past
Naive Bayes and Text Classification I – Introduction and Theory (2014) (sebastianraschka.com)
2 points by vikrum on Jan 22, 2024 | past
Coding Self-Attention, Multi-Head Attention, Cross-Attention, Causal-Attention (sebastianraschka.com)
142 points by rasbt on Jan 14, 2024 | past | 11 comments
Ten Noteworthy AI Research Papers of 2023 (sebastianraschka.com)
128 points by danboarder on Jan 6, 2024 | past | 19 comments
Noteworthy AI Research Papers of 2023 (sebastianraschka.com)
3 points by rasbt on Jan 1, 2024 | past
Ten Noteworthy AI Research Papers of 2023 (sebastianraschka.com)
9 points by lucasus on Dec 30, 2023 | past

Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: