Thought Eddies

2025-08-19 Logs

2025-08-19

It's been a while since I've attempted to use LLMs to solve Connections puzzles.

2024-09-21 Posts

Claude 3.5 Sonnet Connections Evals

I've continued experimenting with techniques to prompt a language model to solve Connections. At a high level, I set out to design an approach to hold the model to a similar standard as a human player, within the restrictions of the game. These standards and guardrails include the following: The...

connections evals

2024-09-19 Logs

2024-09-19

I finally found some time to run a more comprehensive evals of Connections with one guess at a time and using Python code to validate the guesses and give feedback. I ran about 100 puzzles with gpt-4o-mini, gp-4o, and claude-3-5-sonnet, but it became clear that Sonnet was going to perform the best,...

connections evals

2024-07-24 Logs

2024-07-24

I ran the code from my Fine-tuning "Connections" post using gpt-4o-mini. I was hoping the results might be a bit better, which could motivate an effort to fine-tune the model. I'm not sure where my original version of this code went, so I reconstructed a repo for it. Once I was done, I ran 100...

connections gpt-4o-mini

2024-03-24 Logs

2024-03-24

I got this result twice in a row.

connections

2024-03-23 Logs

2024-03-23

One of the greatest misconceptions concerning LLMs is the idea that they are easy to use. They really aren’t: getting great results out of them requires a great deal of experience and hard-fought intuition, combined with deep domain knowledge of the problem you are applying them to.

language_models claude3 gpt4 connections

2024-03-22 Logs

2024-03-22

Did a bit more work on a LLM evaluator for connections. I'm mostly trying it with gpt-4 and claude-3-opus. On today's puzzle, the best either did was 2/4 correct. I'm unsure how much more improvement is possible with prompting or even fine tuning, but it's an interesting challenge.

language_models connections

2024-01-16 Logs

2024-01-16

I spent another hour playing around with different techniques to try and teach and convince gpt-4 to play Connections properly, after a bit of exploration and feedback. I incorporated two new techniques Asking for on category at a time, then giving the model feedback (correct, incorrect, 3/4) Using...

language_models openai connections

2024-01-13 Posts

Fine-tuning gpt-3.5-turbo to learn to play "Connections"

I started playing the NYTimes word game "Connections" recently, by the recommendation of a few friends. It has the type of freshness that Wordle lost for me a long time ago. After playing Connections for a few days, I wondered if an OpenAI language model could solve the game (the objective is to...

language_models fine_tuning connections

Tag: connections

Keyboard Shortcuts

Global