One of the greatest misconceptions concerning LLMs is the idea that they are easy to use. They really aren’t: getting great results out of them requires a great deal of experience and hard-fought...
Logs
Did a bit more work on a LLM evaluator for connections. I'm mostly trying it with gpt-4 and claude-3-opus. On today's puzzle, the best either did was 2/4 correct. I'm unsure how much more improvement...
Setup a Temporal worker in Ruby and got familiar with its ergonomics.
I use the <kbd>hyper</kbd>+<kbd>u</kbd> keyboard shortcut to open a language model playground for convenience. I might use this 10-20 times a day. For the last year or so that I've been doing this,...
I tried setting up sqlite-vss with Deno following these instructions but got stuck on this error
I've spend almost a week, on and off, trying to install ollama using Nix in such a way that ollama serve will be run and managed automatically in the background. Initially, I had tried to install...
I spent some time exploring Deepgram's Next.js starter app. I was hoping I could use it to generate a transcription in realtime but it was more like real-time captions. The responses from the server...
I played around with AgentGPT using Reworkd's cloud hosted instance. I tried a few different goals. The first was travel related. I was pleasantly surprised (unnerved) to see the agent return links...
I tried to get claude-3-opus to solve today's Connections. It did well enough that I am going to try doing this more consistently.
I like Nix a lot. I still don't understand a lot about it. I cannot lose an hour to installing a dependencies when that is a solved problem using Homebrew.