Send in your ideas. Deadline June 1, 2024

Interview With Simon Descarpentries from Meta-Press.es

"We should not rely on search engines to free us from the effort to know the world"

Meta-Press logo: a butterfly net catching stars

The Meta-Press.es search engine lets you explore the news without middle men or trackers. Creator and lead developer Simon Descarpentries is a free software enthusiast for over twenty years and former Framasoft employee he currently is CEO at Acoeuro.com and treasurer of Fund for Defense of Net Neutrality. We interviewed Simon for FreeWebSearchDay. You can listen to the recording of the interview or read the edited transcript below.

Hand-picked sources

Question: Can you tell us something about Meta-Press.es?

Answer: Meta-Press is free software allowing everyone to search through online press. With one click you can search through 930 online newspapers. All these sources are indexed one-by-one by humans. Currently, if you query GAFAM news search engines, you'll get results which include fake newspapers. They just copy content from other sites to sell advertisement. These search tools can’t sort out the real sources from the false ones because everything in the process is automated. In Meta-Press we did it all by hand, so it is all verified by a human. Meta-press also includes about 300 newspapers which give free access to their content because that is the economic model that they choose. Unlike Google News, Meta-Press does not lead you to a dead end: this exists but you can’t reach it.

No censorship

Meta-press is a Firefox browser add-on and is free software, accessible for everyone. It's built with a software architecture which guarantees there are no bottle-necks, no single point of failure and no censorship. Because once you install the add-on in your web browser your requests are not send to a hypothetical Meta-Press server. Instead, your computer is instructed how to do the search. It’s your web browser that has gained the superpower to request results from nearly 1000 newspapers. Therefore it is virtually uncensorable because you have plenty news papers on the one side and plenty of computers on the other. And that is how the internet works. No central point that we all have to cross. The more people use Meta-Press the better it will work.

Problems with search today

Q: What problems do you see with search today?

A: My main concern with GAFAM news search engines is what I mentioned before: they serve fake newspapers. It is easy to feed those engines fake content designed exactly to go through their ranking algorithm. And since nobody ever checks, you can fool it. It is possible to do for you, for a government, a company, it’s flawed. We need something else. It is bloated, dishonest and not working anymore. And it will only get worse because there is money at stake. And there are political stakes too.

Mass surveillance as an economic model

Another big problem with search online is that it's currently dominated by a handful of megacorporations following no rules except their own and whose economic model is just based on mass surveillance. They following everyone, knowing who came back to the website and what they were interested in, what click they made. If you had a screen next to the newspaper displaying all the information they extract from your online activity, you would just turn off the computer. They sell that information to companies, governments and political parties. That is not a conspiracy theory. We know that to be true from Snowden en the Cambridge Analytica scandal. It would be more comfy to live in a world where we could forget about those scandals but it’s a reality we have to face.

Decreasing accuracy of search results

A third problem is that the accuracy of Google search is decreasing. More and more content is created just to score high in their ranking algorithm. It shows up even if it was not exactly what you wanted to see. It's called an injection attack. Every website should be protected from injection attacks. Google is an open security breach regarding injection attacks. That is a technical problem in search today.

We're losing our ability to be inquisitive

And last but not least, regular humans are abandoning their ability to search in favor of those dishonest tools. You could compare it to GPS. When you use GPS for navigation you’ll slowly lose your ability to read a map and other navigation skills. If GPS was dishonest and provided you with the wrong information, you would be lost and abandon the tool. It is exactly the same thing with online search. The tool is dishonest and you have to abandon it. You need to work on your skills to search for things: cross check your information, compare your sources, publish your results so others can verify them. You must become a bit of a journalist. Like Viktor Lofgren of Marginalia Search said: the more you use Google the more you'll become fenced into a small park Google allows you to reach. But the best search tool is to think.

Information has a price

Q: How does Meta-press address these problems?

A: Meta-press addresses the fake news problem because it only searches through sources that are validated by humans. You're guaranteed to search in real newspapers with articles written by real humans. Humans who were trained and paid for it, which is what we call journalists. This type of information has a price and we should pay that price. If you do not pay for the information you get, than it's you who is the product sold in the transaction. And you're not getting the information that you need. You should pay for the services you use or run them on your own computer. If you don’t change the way the world is turning, it will continue to turn the wrong way.

A solution to decentralized indexing

There are multiple projects working on free software search engines to make general purpose search available such as YaCy, Searx, and Marginalia. They are addressing a difficult problem: how to make a distributed index of the world that is reliable and honest. With Meta-Press we addressed this by limiting our scope to online newspapers. Newspapers provide honest indexes, because their reputation is at stake. With Meta-Press we just stitched those indexes together. This is a small window to the rest of the world with only information created by journalists. But hopefully information that covers the entire world because journalists are looking everywhere.

General purpose indexes are a complicated problem and I am better at solving simple problems. When I started the project it was a small compared to Google. But I decided I would pull one hair out of the head of Google. I invite you to do the same. Get your own hair of the head of Google and we will win quite fast. It is my collibri approach to this thing. You address a small part of the problem but you address it well.

The future of search should be collaborative

Q: What do think will happen with search in the future?

A: ChatGPT could be the end of Google Search in a year if it continues to rise like this. Not because the results are better. ChatGPT is a stochastic parrot. It has no idea what it says. You can ask it what the French presidents of the Republic are and you get the right list. Ask it who the female presidents are and you'll also get a list. But no such list exists. But ChatGPT works because it is simple for people to use. That is always what wins.

ChatGPT works with so-called AI algorithms and fortunately for us they are not the same kind of AI as in the Matrix movies. It’s just a statistics matrix and this technology which is 40 years old can be used for the good of humanity. For instance you have Pl@ntNet. You send it the picture of a plant and it will tell you what it is. This is access to knowledge. This is the same technology but used in a good way. This is search made well. iNaturalist.org helps you find out which insect you are seeing. BirdNET from Cornell University will tell you what bird you are hearing.

These are examples of online search tools I am the most excited about. They are collaborative efforts. The more you use it, the more accurate it will get. Take Pl@ntNet for instance. If you upload a picture of a rare plant, something that is missing in their database, they will display a pop-up inviting you to upload more photo’s of it once it is flowering or the seeds are fully grown to improve the database. That way humanity works together to get better knowledge of what surrounds us. That looks like the way to go for me. Going to Mars is not an interesting thing to do as long as we don’t have maps of the ocean floors.

We mustn't rely on search engines to know the world

Q: Is that how you would like to see search evolve in the future? It becoming more collaborative?

A: Yes. There are two sides of the problem. Is it search that has to be improved? Or is it the way we publish things? Searching for something in the library is easy because it is an organized world. You have shelves, you have books in alphabetical order, that works great. If we publish things better we won’t have a problem searching for them. But hoping that search engines will free us from the efforts to know the world and to sort it, is a bad way to go. Something that won’t work and will catch people in the glue like small birds.

Keep in mind the limits of the dream sold to us nowadays. Artificial intelligence is just another algorithm to sort things automatically and we have absolutely no idea of what the content is. Tools won’t solve society’s problems. If humans work together it will be better.

Collaborative efforts to discover the world, to map it, like OpenStreetMap is the way to go. People working together and giving each other knowledge that will help to know the world and to search through it. For instance, you can help Meta-Press mapping the news papers of the world. It is a collaborative effort. And the project is open to all your contributions. Help us to map the world! And thanks to NLnet it will soon be possible for people who aren’t computer science engineers to do this, I promise.

Ways to get involved with Meta-Press

Q: How can we contribute to Meta-Press?

A: You can add your own sources to Meta-Press. On Meta-Press.es you find much documentation on how Meta-Press sees the world and how it can see your favorite news paper. It’s currently a long process available for someone who knows how to make a CSS selector. But with funding from NLnet I am working on an interface in which you will just have to copy paste the address of the source and click here is the search engine, here is the title of the results, here is the link and it will be enough.

Once you have this result it’s recorded somewhere in Meta-Press and you have a button in the settings to manage your local sources and you will have the JSON object describing how to fetch the results from this source. You can send it by mail to Meta-Press or you can create a pull request on FramaGit.org. You can reach me via IRC, Mastadon, no limitation, free communication, free software.

Adding sources is the contribution you can do. But every kind of help to the project would be welcome. You can help with translations. Translation managed by Weblate which is a great free software project and enterprise. You can also help the project by just speaking about it. Introduce it to people or your local university. There are a lot of configurations possible in Meta-Press to search for specific topic or in one language or country. Just try it, use it and make it known.


Funding

Meta-Press.es received funding through the Search and Discovery fund. The funds are established by NLnet.nl with financial support from the European Commission's Next Generation Internet programme.

Do you also have an open source project that needs funding? You can apply for one of the theme funds of NLnet.


Logo NGI Zero: letterlogo shaped like a tag
Logo European Commission