(Unless of course the "dump" or rawest form of database storage you can access has some field or notation for category, in which case it'd be even easier!) — Outlander
Next, we preprocessed the posts by generating embeddings. Using the MiniLM-L6-v2 model, each post was converted into a 384-dimensional numerical vector capturing its semantic meaning. These vectors were stored locally as embeddings.npy . To enable fast similarity search, we built a FAISS index from the embeddings, allowing the bot to retrieve only the most relevant posts for a user query rather than scanning all 29,918 posts each time.
We then integrated the BannoBot script, which takes a user’s question, converts it into an embedding, searches the FAISS index for top-k relevant posts, and constructs a prompt including these excerpts. This prompt is passed to a local LLM (Orca-Mini), which generates a natural-language answer in the style and content of your posts. All processing—embedding, search, and LLM inference—occurs on your laptop, ensuring privacy and avoiding cloud APIs.
I'll tempt you to do something like this with the entire data file... a master philosophy forum bot... — Banno
In my present immoral state, I'll tempt you to do something like this with the entire data file... a master philosophy forum bot... — Banno
Should I feed it into an LLM and build a Banno Chat Bot to deal with trivial posts with minimal intervention? — Banno
As it stands it is only using the top 4 posts. I'll have a play and see if it can do more without being too slow. — Banno
Out of curiosity, I am wondering whether Discourse was the only option able to accommodate the new laws. Were other options also capable? — Leontiskos
I don't suppose there's any chance of uploaded images being rehabilitated, on the archive site? — bongo fury
If not, is there a deadline for replacing them with linked ones? — bongo fury
Preferably also sending a 301 Moved Permanently header. Provided you or another staff member created the archive site from scratch (not using a pre-boxed framework or library) your knowledge in such fields seems sufficient enough to do so easily (and most importantly: properly or safely). — Outlander
This would mean having the comment ID fed to a script that pulls up the discussion ID and then redirects the user to the relevant discussion URL prefixed with a hash anchor containing the comment ID. — Outlander
If you work in a restaurant, you try to separate stored goods from actual food production. And the idea is central to other means of production. So, I applied a pedestrian truism to a current situation. Not expecting a Pulitzer prize for that observation. — Paine
The one thing I'm concerned about transitioning to another platform is that this website will just deteriorate over time without maintenance and the whole archive will be lost, something that's been happening to the rest of the internet with broken sourced hyperlinks everywhere. — Saphsin
Looking at the last page of those discussion results, it seems The Philosophy Forum has been around for just over 10 years, or since October 20th of 2015. That's curious and wonderful the website is a decade old. — Bret Bernhoft
How do you mean? — Christoffer
Yet, there's also a point to be made that many arguments that rely on facts tend to be called leftist. There's far more climate science deniers or general deniers of scientific results on the conservative right... so if a human's best attempt to reach a truth based on facts and scientific data is considered "leftist", then I guess that tells more about the political spectrum than that this forum is "unbalanced". — Christoffer
Since the old site will be preserved, I wonder if the "advanced search" functions that work now will work there — Paine
Thanks Jamal, the projected new forum sounds great. Is there any way of exporting one's posts from the old forum as a word file? — Janus
