Category: Search Engines

Search engine related posts.

PageRank Lives: OpenPageRank by Domcop

In the early days of Google, PageRank was a very important piece of information about a website. It let you know the general authority level of a site and how well it would tend to rank against similar content on another site. The PageRank toolbar, released in 2000, became an important tool in the SEO

Top-Level Domain Popularity

In a crawl of just over 32 million pages, this is the number of domains that I discovered for each top-level domain (TLD). The “Known Domains” is the number of domains with that extension that were found in links, while the “Crawled Domains” is the number of domains where pages were retrieved from. Extension Known

WbSrch Offline Again

I put the WbSrch search engine back online in March of 2018. I spent a lot of time improving it over the 16 months, but it’s the sort of thing that always manages to demand more time and energy. It’s time to stop giving it either — though it’s grown and improved a lot, it’s

WbSrch Online Again

A while back I open-sourced the code for the WbSrch search engine. It’s online now in a much-reduced form at wbsrch.com. It’s not the full search engine. Far from it. It’s just a tiny database of about 10,000 or so URLs to demo the source code, but it’s possible you’ll actually find what you’re looking

Setting Up a Redash Dashboard

This was originally posted on wbsrch.com. It is reproduced here to preserve history. The more WbSrch evolves, the more it becomes necessary to keep track of a bunch of metrics. Until now we’ve been using a mix of simple report pages and raw SQL queries. It has worked well enough, but not having a clean

An Experiment with Project Wonderful

This was originally posted on wbsrch.com. It is reproduced here to preserve history. I’m always looking for new and efficient ways to let people know about WbSrch. That’s why I decided to try advertising with Project Wonderful. Project Wonderful was built as a banner ad network for web comics. That doesn’t mean you can only

Analysis of Search Engine Crowdfunding Campaigns on IndieGoGo

This was originally posted on wbsrch.com. It is reproduced here to preserve history. In the process of researching crowdfunding campaigns, I searched IndieGoGo for search engine pitches. I found 22 attempts to fund “actual search engines”. Here is a list (with links to the IndieGoGo campaign): TheNet101 Thumbar Jixty.com Xense Iyiyes Aspinosa Rexyo Asim Shah

AdSense Alternatives for Startups and Small Websites

This was originally posted on wbsrch.com. It is reproduced here to preserve history. In starting WbSrch, a search competitor to Google, I knew that at some point Google would find a way to “invite us to leave” AdSense. The Terms of Service make it clear that it is incompatible with a search engine (can’t have ads on

Why I Decided to Build a Search Engine (And You Should Too)

This was originally posted on wbsrch.com. It is reproduced here to preserve history. I’ve always wanted to build something big, but never had a burning desire to create any one specific thing. Instead I built lots of little things – small desktop apps, weekend websites, etc. It wasn’t until AltaVista shut down that I realized that the world