It's pretty fun to poke at! Although it's certainly difficult to be exact, it would be neat if generated pages used the context of the pages they were linked from (ideally, all pages that link to it) to guide the direction of the page. From the ones I generated it seemed they were mostly independent.
You could also argue that the web has failed and poisoning it into irrelevance is a vital service, motivating humans to collect knowledge into immutable sources. We‘ll call them ‘libraries.’
To the web? It's fantastic for the web, these are the kinds of fun projects that make the web a worthwhile place to be. To slop generators? Yes, absolutely harmful, and that's for the best.
Great idea! I created an adjacent website that gives, shall we say, "alternative facts" about your questions. (don't know if the rules allow me to link the site so I won't).
The page requires JS to load its content - user agents without JS support just get a blank page.
I'm not sure if the bots that scrape data to train LLMs are capable of loading that type of page, or if they only work on pages that have the content inside the HTML itself?
any serious scraping service these days will fail over to a headless browser when it fetches an asset referencing a js bundle that isn't verifiably a vendor script
It's pretty fun to poke at! Although it's certainly difficult to be exact, it would be neat if generated pages used the context of the pages they were linked from (ideally, all pages that link to it) to guide the direction of the page. From the ones I generated it seemed they were mostly independent.
Yeah, thought about that, maybe will implement it. Will keep in mind! For now SSR to feed LLMs' the priority
Ironically, this seems much faster (for pages already, erm, "researched") than the real one! How?
It generates articles only once. So once it's generated, it never perish. Logic looks like: If article exist -> show it If not -> generate and save
Another day another beige serif-font vibecoded pedia site
Funny, but you could argue this is actively harmful to the web.
The sooner the current web dies, the better. Something better either rises from its ashes, or we lose... something that was already lost.
You could also argue that the web has failed and poisoning it into irrelevance is a vital service, motivating humans to collect knowledge into immutable sources. We‘ll call them ‘libraries.’
On the other hand, one could argue that anything that can be destroyed by relatively clearly labeled satire, deserves to be.
To the web? It's fantastic for the web, these are the kinds of fun projects that make the web a worthwhile place to be. To slop generators? Yes, absolutely harmful, and that's for the best.
Grokipedia is already doing that.
> you could argue
Could you? I don't see it happening, but I could be wrong.
Pissing on a pile of shit
Great idea! I created an adjacent website that gives, shall we say, "alternative facts" about your questions. (don't know if the rules allow me to link the site so I won't).
Give it a week and see what Google AI Overview has to say about the Great Pigeon Census of 1887!
Finally a more trustworthy version of Grokipedia!
It's hilarious, you made my day hahah
I honestly forgot that Grokipedia existed. Did anyone ever use it?
Tried once, but was useless. Very funny that it had so many text, while Elon is apparently "huge" fan of short and precise communication...
Seeing “Something broke, which is ironic for a made-up encyclopedia: Load failed” when trying to access some of the suggested starting points
Works on my PC.
Could you gimme the url that's failing?
Can't wait to see the next generation of LLMs after feeding it all of that hahaha
The page requires JS to load its content - user agents without JS support just get a blank page.
I'm not sure if the bots that scrape data to train LLMs are capable of loading that type of page, or if they only work on pages that have the content inside the HTML itself?
any serious scraping service these days will fail over to a headless browser when it fetches an asset referencing a js bundle that isn't verifiably a vendor script
It's entirely possible they simply ingest the JS as-is.
I'm aware and will implement SSR soon ;)
Love it! It feels very Borges!
Feature request: also be able to click on the Talk page to see the controversies. I don't always want to trust the article itself as the final word.
Edit: Oh look, there's an article about the YC! https://halupedia.com/y-combinator
Who says llms can't be funny?!
I LOVE IT. Superb.