Mar 24, 2024
Reducing the Reversal Curse?
A follow-up on the reversal curse and how reverse training may help language models handle reversed relations.
Read post →Series
Short notes from the AI explained series: LLMs, agents, multimodal models, and the ideas that made the field move.
Mar 24, 2024
A follow-up on the reversal curse and how reverse training may help language models handle reversed relations.
Read post →Mar 17, 2024
Apple's MM1 paper distilled into practical lessons on data mixtures, image resolution, synthetic data, and multimodal architecture.
Read post →Feb 18, 2024
A look at CoT decoding, an approach that explores alternative decoding paths to reveal reasoning without explicit chain-of-thought prompts.
Read post →Feb 18, 2024
A look at active exploration for RLHF-style feedback collection and why choosing better questions can improve LLM training efficiency.
Read post →Feb 9, 2024
A short note on embodied agent foundation models, multimodal perception, planning, and what scale could mean for agent capabilities.
Read post →Jan 30, 2024
A short overview of how a large benchmark evaluates multimodal language models across generalization, trustworthiness, and causality.
Read post →Jan 21, 2024
An explanation of self-rewarding language models, where models generate prompts, judge responses, and iteratively train from their own feedback.
Read post →Jan 14, 2024
A short explainer on AMIE, Google's diagnostic dialogue system for medical interviews and clinical reasoning.
Read post →Jan 9, 2024
A follow-up note on why AI text detectors can fail, from perplexity and burstiness limits to false positives and bias against non-native writers.
Read post →Jan 7, 2024
A look at task contamination and why benchmark gains in zero-shot and few-shot LLM evaluation may be misleading.
Read post →Dec 27, 2023
A look at AppAgent, a multimodal agent that learns to operate smartphone apps through tapping, swiping, screenshots, and memory.
Read post →Dec 17, 2023
A study of how persuasive misinformation can change LLM responses, from rejection and uncertainty to acceptance.
Read post →Dec 11, 2023
A NeurIPS study comparing newborn chicks and Vision Transformers on view-invariant object recognition from limited visual experience.
Read post →Dec 8, 2023
An introduction to Mamba, selective state space models, and why linear-time sequence modeling is exciting for language models.
Read post →Dec 1, 2023
A look at how researchers extracted memorized training examples from ChatGPT and what that means for privacy and copyright.
Read post →Nov 24, 2023
A short explanation of System 2 Attention, a method for regenerating context before answering to reduce distraction and sycophancy.
Read post →Nov 24, 2023
An explanation of why LLMs can learn A equals B but fail to answer the reverse relation B equals A.
Read post →Nov 16, 2023
A practical explanation of Ghostbuster, perplexity features, and why AI text detection remains difficult.
Read post →Nov 12, 2023
A short recommendation list of high-quality machine learning blogs by Lilian Weng, Eugene Yan, and Chip Huyen.
Read post →Nov 9, 2023
An explanation of implicit chain-of-thought reasoning through hidden states and knowledge distillation.
Read post →Nov 8, 2023
A quick explanation of ControlNet and how diffusion models can generate QR-code and illusion-like images.
Read post →Nov 2, 2023
A short explainer on EmotionPrompt and how emotional stimuli can affect LLM task performance.
Read post →Oct 26, 2023
An explanation of step-back prompting and how abstraction can improve LLM reasoning on complex tasks.
Read post →Oct 26, 2023
A short practical note on using random seeds with DALL-E 3 to make image generation more reproducible and controllable.
Read post →Oct 20, 2023
A practical explanation of LLM chains, agents, tools, memory, and why autonomous planning changes how LLM apps behave.
Read post →Oct 13, 2023
A look at why visual tokenizers like MAGVIT-v2 make language-model-based image and video generation more competitive with diffusion models.
Read post →Oct 6, 2023
A short tutorial on downloading YouTube transcripts, restoring punctuation, and using the OpenAI API to summarize or query a video.
Read post →Sep 25, 2023
A short note on Claude's long-context advantage, why full-document context matters, and Anthropic's analysis of prompting strategies.
Read post →Sep 24, 2023
A short note on RAIN, a rewindable inference technique for making pretrained LLM outputs more helpful and harmless without weight updates.
Read post →Sep 7, 2023
A step-by-step tutorial for building an interactive CV chatbot with TRURL, Hugging Face embeddings, FAISS, and LangChain.
Read post →Aug 28, 2023
A short note on why LangChain, FAISS, and RAG made smaller embedding models like SentenceBERT important again.
Read post →May 1, 2021
By Agnieszka Mikołajczyk
A wrap-up of the 5-month non-profit project, roles, outputs, arXiv paper, repository, and blog series.
Read post →Mar 5, 2021
By Sylwia Majchrowska, Agnieszka Mikołajczyk
Using pseudo-labeling and OpenLitterMap data to expand a waste classifier beyond the labeled dataset.
Read post →Feb 9, 2021
By Sylwia Majchrowska, Agnieszka Mikołajczyk
EfficientDet results on 7-class and one-class waste detection, plus the motivation for separating detection and classification.
Read post →Dec 8, 2020
By Maria Ferlin, Agnieszka Mikołajczyk
Exploratory analysis of extended TACO annotations, category mapping, bounding boxes, and dataset imbalance.
Read post →Nov 20, 2020
By Maria Ferlin, Agnieszka Mikołajczyk
How the Detect Waste story began: motivation, recycling rules, waste categories, and the TACO dataset.
Read post →Jan 6, 2020
By Agnieszka Mikołajczyk, Magdalena Kortas
How a WiMLDS Trójmiasto team used deep learning, acoustics, and ornithology to classify bird species from sound.
Read post →