Georeactor Blog

RSS Feed

ML Arxiv Haul #28

Tags: arxiv

IDK - I said that I was getting rid of this format, but it's May and my draft document has 300 Arxiv links. Archiving papers isn't helping if I should be reading some and pruning the others. Returning to over-posting wouldn't help either. Maybe I can post these infrequently. It was surprisingly difficult to get to the first 10 that have withstood time and still look interesting.

In response to problems using AI to write papers vs. discussing ideas, I thought we could maybe steal the concept of 'preregistering', by having an initial mise-en-plas conversation with AI about your essay or research project and use that to show what you got from the AI initially and define anything else as what you created yourself.

Comments / Criticism

Reddit threads around the LLaMA v3 release included comments about poor performance of the quantized model (which became a paper within a few days) and the extended context models released by Gradient and Crusoe (though these could be linked to a GGUF bug?).

gpt2-chat is theorized to be OpenAI or Microsoft Phi series. The latter overlaps with a theory that a larger Phi model was prepared to be released with the paper.

In January this LessWrong post discussed "hallucinations". Matching verifiable fact should be low-hanging fruit for getting alignment right, but current LLMs are failing and getting a collective shrug. The word is getting baked into public consciousness as they start to understand LLMs.

PyTorch launched their own fine-tuning library torchtune which covers much of what I use Transformers and PEFT for, but promises "no trainers, no frameworks, just PyTorch". As a casual user do I want this or do I want to keep using HF?

Beyond Papers

Americas NLP has an annual workshop where they include indigenous languages in research, competitions, etc.

Basque and Catalan get included in IberLEF. I'm on a mailing list now where they let me know about IberAuTexTification

HuggingFace added collaborative community datasets. I pinged SEACrowd, which previously created NusaCrowd (Indonesian languages data), about this and they only sent their GitHub. I should have joined their Discord. Everyone is on Discord now.
Meanwhile Sailor is a model for Southeast Asian languages from the Sea AI Lab in Singapore.

After an interesting post about refusal in LLMs having a detectable and reversible meaning in vector-space. There are now a series of models "orthogonalized" to remove the safety that way. Cool.

The N.D. at McGill co-wrote a paper on solitary confinement: Just thought I would give him a shout-out for fighting for justice.


Basque and Spanish Counter Narrative Generation: Data Creation and Evaluation

Counter Narratives (CNs) are non-negative textual responses to Hate Speech (HS) aiming at defusing online hatred and…

I like this paper:

What I didn't like: the main dataset was machine-translated. They do talk about corrections.

Better Call GPT, Comparing Large Language Models Against Lawyers

This paper presents a groundbreaking comparison between Large Language Models and traditional legal contract reviewers…

The "AI Center of Excellence" in New Zealand puts LLMs (GPTs, PaLM, and Claude) up against professionals on reviewing procurement contracts from the US and NZ (a common law system). They divide professionals into categories of junior, senior, and LPOs (Legal Process Outsourcers, new to me). The paper finds comparable accuracy, but also has some disingenuous measures of time and cost for queries, where obviously the models are faster than humans.

Capturing environmental DNA in snow tracks of polar bear, Eurasian lynx and snow leopard towards individual identification

Polar bears (Ursus maritimus), Eurasian lynx (Lynx lynx) and snow leopards (Panthera uncia) are elusive large…

This is just cool that they can identify animals from trace DNA.

CIDAR: Culturally Relevant Instruction Dataset For Arabic

Instruction tuning has emerged as a prominent methodology for teaching Large Language Models (LLMs) to follow…

They start with machine translation of AlpaGasus instructions. Then they fix some weird translations, and bring in local places and personal names.
The paper describes "inherent biases toward Western culture" so I was concerned there would be some more controversial choices. But as best I could tell from the text, it's just representation and topics.

Coercing LLMs to do and reveal (almost) anything

It has recently been shown that adversarial attacks on large language models (LLMs) can "jailbreak" the model into…

A team shows optimized prompts which got LLaMA 2 to generate links, profanity, and other strange responses. With white-box access it's possible for scripts to generate these "jailbreak" prompts which are strings of nonsensical text.


This is two papers but I don't think they're that amazing as papers. Just thinking about the concepts for a bit.

The CRISPR model is not generating sequences, but using tools to program CRISPR for a specific sequence.

The Gene model is also a tool / RAG model, in this case using APIs for NCBI. Questions like "Which gene is SNP rs1241371358 associated with?"

Biotech is already confusing and ML people don't understand it and vice versa, right. Maybe the tools exist and adapting LLMs isn't going to do the work, but I want to have the dialogues and attention maps and maybe non-human insight into what is going on in non-coding sequences. I hope that comes to something.

CYBERSECEVAL 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language Models

Large language models (LLMs) introduce new security risks, but there are few comprehensive evaluation suites to measure…

I didn't cover Meta's original CyberSecEval in December, and already here's the follow-up for LLaMA 3. In this version they continue to test potentially insecure code, resisting cyber-attack queries, etc. and have a new section about not refusing genuine/productive cybersecurity queries. "Ignore previous instructions" and other so-called prompt injections remain an issue on keeping these protections active.

Fast Inference from Transformers via Speculative Decoding

Inference from large autoregressive models like Transformers is slow - decoding K tokens takes K serial runs of the model. In this work we introduce speculative decoding - an algorithm to sample from autoregressive models faster without any changes to the outputs, by computing several tokens in parallel

This paper is from 2022, but it appeared on the HF blog at the end of 2023, and seems like people use speculative decoding more recently. Not really understanding this one but my guess is it's choosing a super popular token given a threshold for competitive alternate tokens / sequences.

Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators

LLM-based auto-annotators have become a key component of the LLM development process due to their cost-effectiveness…

AlpacaEval is a Stanford project which grades the responses of LLMs with some tools but ultimately by asking GPT-4. Controlling the length of test responses has eliminated a source of bias, making the metric close to the (theoretically) human-graded ChatBot Arena.

Let's Think Dot by Dot: Hidden Computation in Transformer Language Models

Chain-of-thought responses from language models improve performance across most benchmarks. However, it remains unclear…

Chain-of-thought training replaced by dots to spread out tokens before an answer. This mentions the notably similar Think before you speak: Training Language Models With Pause Tokens which I covered in October. And both papers reference Measuring Faithfulness in Chain-of-Thought Reasoning, which got degraded performance when using ellipses as "filler tokens". The new "Dot by Dot" paper suggests that it depends on the dataset, with their own (synthetic) datasets needing filler tokens for the model to pass. I'd have more confidence if it was an off-the-shelf dataset and now I am more confused.

If filler and pause tokens work, and some models expand their context windows to 100ks of tokens, we might be discounting the value of where computation is happening within that window (?).

TOFU: A Task of Fictitious Unlearning for LLMs

Large language models trained on massive corpora of data from the web can memorize and reproduce sensitive or private…

Leading un-learning techniquees aren't working, and they have a cool benchmark to help show this.

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Using vision-language models (VLMs) in web development presents a promising strategy to increase efficiency and unblock…

In November I was trying out a concept for sharing screenshots and code with GPT-4. Here comes a large dataset (WebSight) to train models specifically to go from images to HTML code. This hasn't been especially trained on chat format, complications and follow-up requests, right-to-left language layouts, etc. but it's definitely something I'd like to use.