The future of code search is not regex – 100x faster than ripgrep

(fff.dmtrkovalenko.dev)

25 points | by neogoose 3 hours ago ago

15 comments

  • forrestthewoods 2 minutes ago

    Websites that don’t tell me what they’re doing are infuriating. I’m on mobile. This landing page experience is awful.

  • kristopolous 2 hours ago

    I ran across this fascinating tool a few days ago researching embedding models on hugging face.

    Advertised as "ColGREP Semantic code search for your terminal and your coding agents",

    I haven't put it in any harness yet but I probably should.

    https://github.com/lightonai/next-plaid/tree/main/colgrep

    I've also tried astgrep (also known as sg) but llms really mess up on them. I think you'd need to fine tune.

    If anyone has cracked that case I'd love to hear about it

  • genewitch an hour ago

    considering that ripgrep has marginal overhead over just reading the files to /dev/null, how exactly does this achieve 100x speedup?

    I have a lot of use for something that can search ~1GB of text "instantly", but so far nothing beats rg/ag after the data has been moved into RAM.

    • anilakar an hour ago

      The trick to optimization is not "doing faster" but "doing less". I already feel rg is missing a ton of results I want to see because it has a very large ignore list by default.

  • pjmlp 10 minutes ago

    It has never been ripgrep for decades for those of us on IDEs.

  • swiftcoder 27 minutes ago

    Is there a write up of the underlying approach? The summary on the repo mentioned SIMD, but not a whole lot else.

  • neogoose 3 hours ago

    I have open sourced the fastest code search implementation. Comprehensive SDK for both file finder and grep file search that is over 100x faster than ripgrep

    • siva7 an hour ago

      I don't get this submission title. Your tool uses regex but the title claims the future is not about regex.

      • molszanski 33 minutes ago

        I think it is about input. Before I had to type regex, now I just type text and fuzzy finds more, regex style. Awkward wording, but code seems cool.

    • MaxMonteil 3 hours ago

      This looks cool!

      You should add a link to the GitHub repo for the project itself, at first I wasn't even sure what it was called.

      I found this link https://github.com/dmtrKovalenko/fff.nvim

  • dig1 31 minutes ago

    ctags, GNU Global and even "ugrep -Q" would like to have a few words with you ;)

  • globular-toast an hour ago

    Why is it "for neovim"? Surely such a thing would be useful in many applications?

    • ramon156 an hour ago

      Because it's being dishonest from multiple angles.

      - it has regex, so the title is weird - it definitely wouldn't be 100x faster than rg - its an sdk, so its apples to oranges anyway

  • asdfadsfaf 34 minutes ago

    I don't get it how can I search anything but the file name?

  • schrodinger 2 hours ago

    How's it work? Embed tokens and use euclidean distance or something?