I tried stacking multiple pages of a pdf vertically as a single image to a model, then doing data extraction from this. It didn't work. I imagine this is because models aren't trained on much data...
I wrote and screen-recorded myself building a Python app to call a model to extract structured data from an image, making heavy use of codegen with Cursor. The same protobuf is used as instructions...
This point resonates with me. The more time I spend prompting models, the more it's becoming clear that the clarity of the instructions are what matter most. Writing clear, unambiguous instructions...
I ran the code from my Fine-tuning "Connections" post using gpt-4o-mini. I was hoping the results might be a bit better, which could motivate an effort to fine-tune the model. I'm not sure where my...
Tried to join in on the llama3.1-405b hype using Groq but sadly, no dice
I've been wanting to create a chat component for this site for a while, because I really don't like quoting conversations and manually formatting them each time. When using a model playground,...
espanso
Incredible writing and insight by Linus in Synthesizer for thought. I will probably need to revisit this work several times.
How can I add videos to Google Gemini as context (is this even what their newest model is called anymore) and why is it so hard to figure it out? https://gemini.google.com only let's me upload...
Research and experimentation with models presents different problems than I am used to dealing with on a daily basis. The structure of what you want to try out changes often, so I understand why some...