Logs

Did a bit more work on a LLM evaluator for connections. I'm mostly trying it with gpt-4 and claude-3-opus. On today's puzzle, the best either did was 2/4 correct. I'm unsure how much more improvement...

I use the <kbd>hyper</kbd>+<kbd>u</kbd> keyboard shortcut to open a language model playground for convenience. I might use this 10-20 times a day. For the last year or so that I've been doing this,...

I spent some time exploring Deepgram's Next.js starter app. I was hoping I could use it to generate a transcription in realtime but it was more like real-time captions. The responses from the server...

I tried to get claude-3-opus to solve today's Connections. It did well enough that I am going to try doing this more consistently.

I like Nix a lot. I still don't understand a lot about it. I cannot lose an hour to installing a dependencies when that is a solved problem using Homebrew.