Bonum Certa Men Certa

Improving Site Navigation and Discovery

posted by Roy Schestowitz on Dec 12, 2023

An open book

THE site is growing fast and people have a hard time searching for much older material. We fully recognise this limitation. It's a real peril. Many sites have the exact same limitation. This problem isn't limited to digital media, either (volumes of material, some of it outdated or unlinked).

In WordPress we used code that checks references in reverse; for any given article, it would (at the bottom) show later (future) articles that link to it. This was very CPU-intensive (at the database level), resulting in pages taking far longer to load. Unless properly cached, it would require scanning about 10 GB of text (or 40,000 blog posts' bodies, not counting drafts/revisions).

We needed to move on. Better sooner than later. Having a server screaming 24/7 to serve requests (whose growing proportion is rogue bots) is not a long-term strategy. Running a Web server on a machine with almost 100 CPU cores isn't cheap.

Before the very final post from Pamela Jones of Groklaw (just over 10 years ago) she wrote about the challenges of preserving old material. She had quit before, then came back, then retired. Fair enough, she wasn't getting young, but it was important for her to ensure the information remains accessible for many years to come (debunking lies about the GPL and origins of Linux). Some time later the site was converted into static pages (still hosted at ibiblio.org), but some material such as old comments disappeared in the process. Geeklog had its share of limitations and apparently it's still being maintained.

Anyway, unlike Groklaw we're still going. I'm 41 and in good health. I receive help from many people and we're good to go. Nothing can stop us, even though some extremists are trying. We won't let wackadoodles waste our time. They just validate what we wrote months ago and they try to attack my wife. Misogynists are like that; they love picking on women.

So what next for search? We've long envisioned this site having self-hosted search, not that lousy WordPress search our blog used to have (it's just some lousy WordPress database scan, which is notoriously weak at delivering relevant results).

No, we don't want to rely on third parties either. We don't want to hear, "how about Google?" or "why not ClownFlare?" (Wherever or whenever there are DDoS attacks)

Any third party means Outsourcing. Outsourcing does not solve the issue; it typically creates additional issues, even if they are temporarily not visible (ClownFlare does not make money yet, so a "big squeeze" is impending and Google is not search anymore).

Several of our articles this month got over 3,000 views and we do not depend on Google, social control media, Gulag Noise (Google News), "Hacker" "News" etc. We have our loyal readership, i.e. people who come back not because "Google told me to..." (so-called 'search')

Many people don't know this, but way back in 2006 we made a "download site" option available (our database was relatively small back then and a WordPress plugin existed to make a database available sans sensitive things like user accounts). For about a year this whole site was available for download, but the site grew too big and it was no longer feasible to generate the dump on the fly and serve requests. These requests were nightmarish. They caused PHP timeouts and MySQL strain.

So what next for data?

Well, we considered what we can install for self-hosted search, seeing what's available that is Free software and is also more potent than just a database scan (over fields like title and body).

Search can help, wiki pages can help even more, but ideally we may go back in time and turn the site into a kind of hierarchical 'book' (a big project! Big but still feasible). It's still debated in IRC.

I quit my job so that I can devote more of my time to promotion of Software Freedom, abolition of software patents etc.

While we continue to discuss the best way to organise information in this site (suggestions welcome, IRC would work best) we remind readers that we're actively seeking help with server bills. We want to keep going for more than a decade to come and help from readers enables us to spend more time researching, writing, tidying up existing material (lots of wiki refactoring to come over the Christmas period), maybe adding a self-hosted search facility.

Dog Golden Retriever Card: Watercolor painting of a golden retriever dog holding a leash

Other Recent Techrights' Posts

Microsoft-Connected Sites Trying to Shift Attention Away From Microsoft's Megebreach Only Days Before Important If Not Unprecedented Grilling by the US Government?
Why does the mainstream media not entertain the possibility a lot of these talking points are directed out of Redmond?
[Video] 'Late Stage Capitalism': Microsoft as an Elaborate Ponzi Scheme (Faking 'Demand' While Portraying the Fraud as an Act of Generosity and Demanding Bailouts)
Being able to express or explain the facts isn't easy because of the buzzwords
Microsoft ("a Dying Megacorporation that Does Not Create") and IBM: An Era of Dying Giants With Leadership Deficits and Corporate Bailouts (Subsidies From Taxpayers)
Microsoft seems to be resorting to lots of bribes and chasing of bailouts (i.e. money from taxpayers worldwide)
 
Windows in Lebanon: Down to 12%?
latest from statCounter
Links 18/05/2024: Caledonia Emergency Powers, "UK Prosecutor's Office Went Too Far in the Assange Case"
Links for the day
US Patent and Trademark Office Sends Out a Warning to People Who Do Not Use Microsoft's Proprietary Formats
They're punishing people who wish to use open formats
Links 18/05/2024: Fury in Microsoft Over Studio Shutdowns, More Gaming Layoffs
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Friday, May 17, 2024
IRC logs for Friday, May 17, 2024
Links 18/05/2024: KOReader, Benben v0.5.0 Progress Update, and More
Links for the day
[Meme] UEFI 'Secure' Boot Boiling Frog
UEFI 'Secure' Boot: You can just ignore it. You can just turn it off. You can hack on it as a workaround. Just use Windows dammit!
The Market Wants to Delete Windows and Install GNU/Linux, UEFI 'Secure' Boot Must Go!
To be very clear, this has nothing to do with security and those who insist that it is have absolutely no credentials
In the United States Of America the Estimated Share of Google Search Grew After Microsoft's Chatbot Hype (Which Coincided With Mass Layoffs at Bing)
Microsoft's chatbot hype started in late 2022
Techrights Will Categorically Object to Any Attempts to Deny Its Right to Publish Informative, Factual Material
we'll continue to publish about 20 pages per day while challenging censorship attempts
Links 17/05/2024: Microsoft Masks Layoffs With Return-to-office (RTO) Mandates, More YouTube Censorship
Links for the day
YouTube Progresses to the Next Level
YouTube is a ticking time bomb
Journalists and Human Rights Groups Back Julian Assange Ahead of Monday's Likely Very Final Decision
From the past 24 hours...
[Meme] George Washington and the Bill of Rights
Centuries have passed since the days of George Washington, but the principles are still the same
Daniel Pocock: "I've Gone to Some Lengths to Demonstrate How Corporate Bad Actors Have Used Amateur-hour Codes of Conduct to Push Volunteers Into Modern Slavery"
"As David explains, the Codes of Conduct should work the other way around to regulate the poor behavior of corporations who have been far too close to the Debian Suicide Cluster."
Video of Richard Stallman's Talk From Four Weeks Ago
2-hour video of Richard Stallman speaking less than a month ago
statCounter Says Twitter/X Share in Russia Fell From 23% to 2.3% in 3 Years
it seems like YouTube gained a lot
Journalist Who Won Awards for His Coverage of the Julian Assange Ordeals Excluded and Denied Access to Final Hearing
One can speculate about the true reason/s
Richard Stallman's Talk, Scheduled for Two Days Ago, Was Not Canceled But Really Delayed
American in Paris
3 More Weeks for Daniel Pocock's Campaign to Win a Seat in European Parliament Elections
Friday 3 weeks from now is polling day
Microsoft Should Have Been Fined and Sanctioned Over UEFI 'Lockout' (Locking GNU/Linux Out of New PCs)
Why did that not happen?
Gemini Links 16/05/2024: Microsoft Masks Layoffs With Return-to-office (RTO) Mandates, Cash Issues
Links for the day
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Thursday, May 16, 2024
IRC logs for Thursday, May 16, 2024
Ex-Red Hat CEO Paul Cormier Did Not Retire, He Just Left IBM/Red Hat a Month Ago (Ahead of Layoff Speculations)
Rather than retire he took a similar position at another company
Linux.com Made Its First 'Article' in Over and Month, It Was 10 Words in Total, and It's Not About Linux
play some 'webapp' and maybe get some digital 'certificate' for a meme like 'clown computing'
[Meme] Never Appease the Occupiers
Freedom requires truth. Free speech emancipates.
Thorny Issues, Violent Response
They say protests (or strikes) that do not disrupt anything are simply not effective. The same can be said about reporting.
GNU/Linux in Malaysia: From 0.2 Percent to 6+ Percent
That's like 30-fold increase in relative share
Liberty in Liberia? Windows Falls Below 10% and Below iOS
This is clearly a problem for Microsoft
Techrights Congratulates Raspberry Pi (With Caution and Reservations)
Raspberry Pi will "make or break" based on the decisions made in its boardroom
OSI Makes a Killing for Bill Gates and Microsoft (Plagiarism and GPL Violations Whitewashed and Openwashed)
meme and more
The FSF Ought to Protest Against UEFI 'Secure Boot' (Like It Used To)
libreplanet-discuss stuff
People Who Defend Richard Stallman's Right to Deliver Talks About His Work Are Subjected to Online Abuse and Censorship
Stallman video removed
GNU/Linux Grows in Denmark, But Much of That is ChromeOS, Which Means No Freedom
Google never designs operating systems with freedom in mind
Links 16/05/2024: Vehicles Lasting Fewer Years, Habitat Fragmentation Concerns
Links for the day
GNU/Linux Reaches 6.5% in Canada (Including ChromeOS), Based on statCounter
Not many news sites are left to cover this, let alone advocate for GNU/Linux
Links 16/05/2024: Orangutans as Political Props, VMware Calls Proprietary 'Free'
Links for the day
The Only Thing the So-called 'Hey Hi Revolution' Gave Microsoft is More Debt
Microsoft bailouts
TechTarget (and Computer Weekly et al): We Target 'Audiences' to Sell Your Products (Using Fake Articles and Surveillance)
It is a deeply rogue industry that's killing legitimate journalism by drowning out the signal (real journalism) with sponsored fodder
FUD Alert: 2024 is Not 2011 and Ebury is Not "Linux"
We've seen Microsofers (actual Microsoft employees) putting in a lot of effort to shift the heat to Linux
Links 15/05/2024: XBox Trouble, Slovakia PM Shot 5 Times
Links for the day
Windows in Times of Conflict
In pictures
Over at Tux Machines...
GNU/Linux news for the past day
IRC Proceedings: Wednesday, May 15, 2024
IRC logs for Wednesday, May 15, 2024