Background

Wikimedia Discovery Is The Foundation's Weapon For improving Wikimedia Experience, Including Its Search

Wikipedia logoWikimedia Foundation, the U.S-based non-profit organization behind the largest online encyclopedia Wikipedia and many of its sisters, is quietly building its own advanced search feature. As the first phase of undertaking the attempt, it received a granted sum of $250,000 by the John S. and James L. Knight Foundation, to research how people access and interact with Wikimedia content.

To initiate its goal, Wikimedia Discovery, previously referred as Knowledge Engine, will help Wikimedia in exploring new ways to make the search and discovery of high-quality, trustworthy information on Wikipedia and more accessible and open.

The grant is to be used "To advance new models for finding information by supporting stage one development of the Knowledge Engine by Wikipedia," reads the Knight Foundation's grant letter to the Wikimedia Foundation. And according to a post on its website, it "will support an investigation of search and browsing on Wikipedia and other Wikimedia projects, with the goal of improving how people explore and acquire information."

Here, the aim of Wikimedia Discovery and its Discovery team is to better encapsulate the scope and goals of the project.

The grant that was approved by the Foundation's Board in December 2015 wasn't disclosed at that time to the public. Wikimedia's initial reluctance to detail the restricted grant, was a factor in the departure of community-elected WMF board member James Heilman.

In the grant document from the Knight Foundation for media innovation, Wikipedia owner Wikimedia describes:

Knowledge Engine by Wikipedia will democratize the discovery of media, news and information - it will make the internet's most relevant information more accessible and openly curated, and it will create an open data engine that's completely free of commercial interests.

Today, commercial search engines dominate search-engine use of the Internet, and they're employing proprietary technologies to consolidate channels of access to the Internet’s knowledge and information.

Wikipedia Search

The Transparent Search Feature, Not A Search Engine

As one of the few internet services which have the most pages, Wikipedia is relying on search engines and its own search feature to point people to the things they seek. Wikimedia and its team of engineers are trying to create better links between all of the assets it owns so it can become a truly searchable open knowledge hub.

To make this happens, Wikimedia Discovery emphasizes into six areas:

  • Publication curation mechanisms for quality.
  • Transparency, telling users exactly how the information originated.
  • Open data access to metadata, giving users the exact data source of the information.
  • Protected user privacy, with their searching protected by strict privacy controls.
  • No advertising, which assures the free flow of information and a complete separation from commercial interests.
  • Internalization, which emphasizes community building and the sharing of information.

Wikimedia Foundation claims the Knowledge Engine to be the internet's first transparent search engine and the first one originated by the Wikimedia Foundation. But there has been a slight confusion where people sees it as building a 'search engine' when it is actually 'advancing its search feature'.

The rumor of the first concepts were revealed from investigations by Andreas Kolbe, a board member of Wikipedia's Signpost. Wikimedia and founder Jimmy Wales denied the project, calling the report as "a lie". Wikimedia tries to make things clear by saying:

We are not building Google… The Wikimedia movement's vision is to make the sum of all human knowledge freely available to everyone. Wikipedia is our largest and most well-known project, but there are many other projects like Wikimedia Commons and Wikidata that move us towards realizing our vision. These projects have millions of users every month! So, yes, if we can make a search system that's good enough and meets the needs of our users, people absolutely would use it.

"Through this project, the Wikimedia Foundation will test ways to make relevant information more accessible and investigate transparent methods for collecting, connecting and retrieving this information consistent with the values of Wikipedia and the open Web," the Foundation says.

Samantha Lien, a Communications Associate and spokesperson at Wikimedia has made further correction. In an email she sent to us at Eyerys, she said that:

We do not have plans to build a new search engine. Our objective is to improve people's ability to find content across Wikipedia and the Wikimedia projects. As the first phase of undertaking these improvements, we received a grant to research how people access and interact with Wikimedia content.

Wikipedia has no plans to compete with Google or any other companies for that matter.

With the funding, the Wikimedia Foundation has started its six months of deep research, testing and prototyping on user search habits and other practices they do on the Wikipedia encyclopedia. The research, tests and prototyping results will be shared and discussed publicly.

The initial discovery stage process includes an exploration of prototypes of future versions of Wikipedia.org which are "open channels" rather than an encyclopedia. It analyzes the query-to-content path, and embedding the Knowledge Engine "via carriers and Original Equipment Manufacturers".

A budget submitted by the Wikimedia Foundation and included in the grant announcement gave a total of $2,445,873, which will be divided among 14 of its staffs (eight engineers including programmers, two data analysts and four team leaders including a director and a vice president), hardware, and associated costs that include and not limited to travel and medical expenses.

The $250,000 grant is just the first stage funding for the $2.5 million project.

Wikipedia Concept Discovery

Exploring New Ways From An Old Idea

Search is nothing new to Wikimedia Foundation since it originally relied on a "homegrown" search engine written by Wikimedia volunteers and based off Apache Lucene (later replaced with Elasticsearch and the CirrusSearch extension integrated with MediaWiki sites). Founder Jimmy Wales himself was once focused on creating a search engine when in 2006 he made a passing comment regarding the possibility of a wiki-based internet search.

The result was Wikia Search by Wikia, a for-profit wiki-hosting company he founded in late 2004 with and Angela Beesley.

Wikia Search was an open-source search engine intended to challenge Google and introduce transparency and public dialogue. Wales stated that Google's random tests and its closed algorithm were different from the open, community-oriented crowdsourcing attempts of Wikia Search.

While Google and other search engines approach a given query by matching keywords, subjects and the popularity of pages, Wikia Search showcased short articles written by people on topics.

While Wikia did enhanced it with additional features, the project didn't last long. The development and its service were brought to a halt on May 14th, 2009.

Wales who abandoned the project, said that it had "not been enjoying the kind of success that we had hoped." But later during that time, he also said about his interest about search, saying that "I will return again and again in my career to search, either as an investor, a contributor, a donor, or a cheerleader."

The chance then came when Wikipedia and its sisters (Wiktionary, Wikiquote, Wikibooks, Wikisource, Wikimedia Commons, Wikispecies, Wikinews, Wikiversity, Wikidata, Wikivoyage, Wikimedia Incubator, and Meta-Wiki) were having more than 35 million articles across nearly 300 languages.

"As the quantity, quality, and diversity of this content has grown, so too has the challenge of helping people find the most relevant information," said Wikimedia in a blog post.

But Wikimedia isn't Wikia, and it's not building a "search engine". Instead, it's creating an "advanced search" that focuses on improving Wikimedia experience on areas such as relevance, user experience, multi-language support, multi-projects search, and incorporating new data sources.

Further reading: As Wikipedia Turns 15, The Online Encyclopedia Is Still Facing Its Evergrowing Pain