Logs

I tried stacking multiple pages of a pdf vertically as a single image to a model, then doing data extraction from this. It didn't work. I imagine this is because models aren't trained on much data...

I wrote and screen-recorded myself building a Python app to call a model to extract structured data from an image, making heavy use of codegen with Cursor. The same protobuf is used as instructions...

This point resonates with me. The more time I spend prompting models, the more it's becoming clear that the clarity of the instructions are what matter most. Writing clear, unambiguous instructions...

I ran the code from my Fine-tuning "Connections" post using gpt-4o-mini. I was hoping the results might be a bit better, which could motivate an effort to fine-tune the model. I'm not sure where my...

I've been wanting to create a chat component for this site for a while, because I really don't like quoting conversations and manually formatting them each time. When using a model playground,...

Incredible writing and insight by Linus in Synthesizer for thought. I will probably need to revisit this work several times.

How can I add videos to Google Gemini as context (is this even what their newest model is called anymore) and why is it so hard to figure it out? https://gemini.google.com only let's me upload...

Research and experimentation with models presents different problems than I am used to dealing with on a daily basis. The structure of what you want to try out changes often, so I understand why some...