Show HN: PreCog AI – Automatic AI Model Selection for Any Task

(precog.ubik.studio)

23 points | by ieuanking 3 hours ago ago

13 comments

  • KaoruAoiShiho an hour ago

    Do you find that there are a lot of variations and intricacies in deciding which LLM to use? I find it pretty simple to just assume sonnet is best at most coding jobs, o1 best at complicated non-coding tasks, 4o for simple questions that don't require planning. and well, tbh that's it, no other LLMs are that interesting.

    • ieuanking 36 minutes ago

      We found that there is more nuance when it comes to different languages, we also update our leaderboard as new models drop so the best models are always available through PreCog. It's also nice to not have to tab switch constantly between chatbots and instead just use them in one central place. We added a feature so you can just pick the model you'd like to chat with if you dont want to be automatically matched with the leaderboard rankings (all matched responses provide an explanation in the reasoning behind the match so you can see why the model was chosen).

    • cma 17 minutes ago

      I think o1 is best at algorithm heavy coding tasks, opus at api/language-translation knowledge-breath heavy tasks (and also has better recency via later knowledge cutoff). But with latest update opus is pretty close at algorithm heavy coding.

      • ieuanking 12 minutes ago

        We have plans to put o1 on the leaderboard - but Opus is there RN! We should have a new ELO ranking soon, but the second o1 ranking are done it will be on our leaderboard.

  • swyx 2 hours ago

    you're doing model routing - any thoughts on https://github.com/lm-sys/RouteLLM and Martian?

    hope you dont raise funding before you figure out what they haven't

    • ieuanking 2 hours ago

      we have looked at route llm and played with it but we felt like they were focusing on cost minimization, we think theres more work to be done in focusing on routing for the best possible output without the cost constraint. Also just trying to provide easy ways for people to use tools like route llm that dont require coding knowledge. That being said we def wanna release a benchmark of our routing in comparison to some of the pre-trained defaults in route llm

  • okintheory an hour ago

    Ah yes, Minority Report, that story we're so eager to repeat.

    • ieuanking an hour ago

      fortunately we aren't policing anybody :* (we do love PKD tho) we thought precog perfectly described the function

      • JasonSage 36 minutes ago

        I think it’s a great name, it really does describe the function perfectly.

        I got a huge smile when I saw it.

        • ieuanking 28 minutes ago

          tysm - I got a huge smile reading this comment.

  • dvfjsdhgfv 2 hours ago

    I entered the query but was redirected to a login form. If you are honestly looking for feedback and no leads, unblock the app temporarily for HN. If you are for leads, for sure you will get some if this submission receives enough upvotes, but I wouldn't count on many. These days people are not so keen to leaving their data on random websites anymore.

    • ieuanking an hour ago

      Just took down the sign up for anyone who wants try it out! Thanks again for pointing that out - new to posting on here, super helpful hope you get some use out of our project.

    • ieuanking 2 hours ago

      Thanks so much for the feedback (new here for sure) - we are working on that rn, updating as fast possible gotta rebuild some stuff lmao