Forget future generations
I'm wondering why we're trusting a Californian consumer data processing corporation which sells ads, right now.
Google has never liked privacy laws, and it really hated Europe’s “Right to be Forgotten” ruling in 2014. The RTBF ruling gave private citizens the right to ask for the removal of search entries that contained personal information that was old and irrelevant, for which there was no public interest. It was a privilege …
I'm wondering why we're trusting a Californian consumer data processing corporation which sells ads, right now.
I have a professional duty to read through Terms & Conditions before I accept anything. I don't have the luxury of TL;DR because the buck stops with me, and what hides in the T&Cs of Google, Facebook and any other volume "free" provider is enough to avoid them like Ebola.
Google's are actually the worst of the lot, because they snuck perpetual licensing conditions in there (although they changed the words in later versions because that was obviously far too direct and alarming), even after you cancel your account. You may be forgotten, but all the useful stuff they'll keep for themselves anyway, and thank you very much for that lovely free content.
However, that's not the worst trait IMHO. The worst aspect about Google is their sanctimonious stance as soon as something happens that may improve the rights of their users or oh-my-God-that-should-NEVER-happen worse: something affects their ability to milk money out of the personal details of their victims.
So, I hope they make good on their threat to leave Europe. It'll be a fine day, and I'm looking forward to it.
This is excatly right - you can even see it in the screenshots. The Bing search engine ignores quote marks, whereas Google uses quote marks to ensure that exact sequence of words appears in the page. This is verified by the bootnote, which states that when the quote marks are removed, the Google results return thousands of results.
This is just very sloppy/blinkered reporting from The Register.
It is important to report any potential negative effects even if the data isn't complete. You can clarify later if necessary. I have lots of rules I have to obey regarding stuff like this.
If you're manipulating data for the public it requires establishing clear priorities and no small measure of responsibility.
I would assume "data" is too common a word. While Bing etc are happy to search with that term, Google may have removed it from a library or two of search terms to save processing power.
For a fare comparison, we'd need the search without the abbreviation (I find these rarely work well on Google) and without the speech marks.
Perhaps you could use a search engine to check ?
Quote
"The Basics
Bing’s basic search operators work similarly to Google’s. Search for an exact phrase by surrounding it with quotes:"
http://www.howtogeek.com/106751/how-to-use-bings-advanced-search-operators-8-tips-for-better-searches/
http://bingtricks.com/bing-advanced-search
Sure, bing supports quotes, but actually try the query, you'll note that the results it returns don't actually contain the exact phrase. In bing, the quotes make it push results with the exact phrase ahead of those with just the words, with Google, you just plain don't get results lacking the exact phrase.
I just tried the search strings myself.
With Google, I got "1020 results", which somehow translates to only one page of actual clickable links. (12 results, to be exact, of which one was to this story, and one other was to a word-for-word copy of it appearing on myinforms.com. Two more were to another Reg story of two days ago, and a copy of that appearing in sffchronicles.com.)
One particular interesting result is a paper at the University of Otago, New Zealand, which contains the phrase "right to be forgotten", but not "RTBF". Hmm.
With Bing, I got three results. They didn't include this story, but did include the myinforms.com copy, so at least I wouldn't be missing out.
Trust Google? Well, maybe I'll think a little better of them when they give me a way to see the other 1008 results. But probably not.
That is indeed for online advertising. You get a bit blinkered when you're working in online media and see the main revenue stream drying up before your eyes.
"Google finds only their question (the one you just read)."
Had that just recently. I'm on a contract with a Major Financial Institution(tm) which uses an obscure (but commercial) library to connect to some pretty old hardware. If there was any documentation for it, it's been lost years ago.
So I go onto Google to see if there's any guidance available. I got a little over a page of results, every one of which was from former employees/contractors of this noble institution, all asking the same thing as me.
Talk about consistent results!
When I search for the same string I get a "full" set of results in Google ( about 501 results). I think Google just don't like you.
The irony of your comment is that that is indeed actually possible for Google, which is a worrying development in itself.
Having said that, I don't think it changes the content of the article much as it's still quite on the ball. Even if the stats don't quite add up, there's something profoundly evil about Google which everyone is at pains to keep looking away from, a sort of Microsoft 2.0.
If you use the acronym RTBF then you don't get that many results, but use "right to be forgotten", with or without the data processing business bit, and you get thousands. Perhaps Google's search algorithms don't give much weight to fairly obscure acronyms? I think you're letting your attitude to Google lead you to paranoid conclusions.
Thank you Credas for explaining the acronym. I got all the way to your comment while wondering what the hell was going on between Google and a Belgian broadcaster.
Bing: RTBF "data processing business" : About 10,000 results. Of those which do have the actual phrase "data processing business" AND RTBF - about zero.
For this conspiracy to work the search engines have to process the search string the same way and you have to find pages that have the specific phrase and other search terms on the same page and is indexed but isn't shown.
I prefer Google's stance on the search terms. Anything you have in quotes must appear exactly as stated on the page with a verbatim option to stop allowing replacement for similar words.
Ding Ding! Correct answer.
If you look at the article screenshots, you can see the Google results highlighting the quoted string. In the Bing result, you see single separated words highlighted, meaning Google searched for two rare strings, and Bing etc searched for one rare string, and up to three common ones.
Right now, Bing only returns two results for rtbf "data processing business", at least for me. And both of them are on the Register. Google says it finds a thousand results… But there's actually only one page of them. Go figure.
"Right now, Bing only returns two results for rtbf "data processing business", at least for me. And both of them are on the Register"
That implies that before it spidered the Reg article it had NO matching results for BOTH fixed strings, so Bing stripped the quotes and tried again.
Which implies that Bing's results pool is smaller than Google's.
The dumbing-down of search engines over the last 10+ years has left us with a situation where the majority of the popular ones (except Google, interestingly) completely ignore double-quotes as a way of trying to specify only a specific string of words in a particular order.
So the first problem is that the non-Googly services are likely interpreting your query entirely differently than how Google does.
And then of course there's the infamous "tunnel vision" issue with Google and others serving you customized results. Are you searching from a clean device/browser, cookies cleared, not logged-in to any other Google services/sites?
Just tried it on Dolphin (with all cookies nuked). "About 1030 results", of which this article is in top position. Is using Google.fr as Google.com redirected me based upon location.
And then of course there's the infamous "tunnel vision" issue with Google and others serving you customized results. Are you searching from a clean device/browser, cookies cleared, not logged-in to any other Google services/sites?
The author covered that by using "duck duck go" no? They also got the same results.
Actually he described DDG as "Google scraping", eg, uses Google search results.
Which would explain that particular bit, but I was actually not of the impression that DDG used Google as a source. I thought they pulled their results from a variety* of sources, including their own spider.
*(~400 sources, was the figure I recall. Neither do I think Google is amongst them.)
Google fiddle with their search algorithms all the time, and these days it tries to fit the results to what it "thinks" you are looking for or likely to purchase. In terms of accurate search "results" it's been deteriorating for some time. Given their market share of course we have to kowtow to Google but functionally they are following the same evolutionary path as most large tech companies - sooner or later (it could be 10 years or so) their search engine business will hire a Marissa Mayer-like leader to revitalize their market position and that will be the end of them after a few more years.
to get your name on the Interwebs in the first place.
If you ain't there, you can't be forgotten now can you.
As this applies to most of the population then...
As for me, (posting anon so as to not temp fate) I have a name that is not that uncommon (there were three of us in my secondary school). I google for myself about every six months and so far (touching wood) the Chocolate Factory does not have a file on me.
Long may that continue.
"Why would any mature society trust its digital archive to a Californian consumer data processing corporation that sold ads."
Why would any mature society allow the same company to build a database of everyone's reading list, the music they listen to, the videos they watch, where they go, and who they communicate with?
If this was a State we'd have some very harsh words to say about it. As it is this company appears to be at the centre of governments. Pumping $millions into political lobbying every week or so.
Adam 52 is correct.
If you use ixquick (the EU's favoured search engine) you get just the Bruce Schneier link plus this very article from El Reg if you include the quotes in the search string.
Without them you get pages and pages and pages...ixq doesn't seem to give a hit count by default and I don't use it much so I can't be arsed looking for the setting.
Sorry Reg, but this story is nonsense, and you can see this just by looking at the screenshots you processed:
1) Bing is finding extra results because it's ignoring the double quotes and finding results that contain the words "data" "processing" and "business" even when they're not consecutive. (this is actually exactly why I don't use Bing - respect the user, dammit!)
2) Bing is also finding extra results because it's treating RTBF as a synonym of the string "Right to be forgotten", and is matching on both. Google is only showing results containing the acronym "RTBF". Again, Bing is trying to be clever on the user's behalf, Google is being strict to what the user asked for.
No conspiracy. Move along, people, nothing to see here...
They do this all the time.
When they first introduced their data collection exclusion option, you could not find it via google once you gave up the fruitless hunt via their own instructions.
Other search engines found it easily.
BTW. I used google UK to search for rtbf "data processing business" and see 9 results, expandable to "About 1,010 results" 4 pages of results.
If I try to access one of the other pages, google says there are no results.
I just did the search through Tor and this is what I got
Google.ca:
403. That’s an error.
Your client does not have permission to get URL /sorry/IndexRedirect?continue=https://www.google.ca/search%3Fsclient%3Dpsy-ab%26site%3D%26source%3Dhp%26q%3Drtbf%2B%2522data%2Bprocessing%2Bbusiness%2522%26btnK%3DGoogle%2BSearch&q=CGMSBE33taIYl7bfuAUiGQDxp4NL_ZQribTKbn6YleNEIVl9i7_S-Ms from this server. That’s all we know.
DDG:
the same as El Reg
Bing
same as DDG
Yahoo.com
Sorry, Unable to process request at this time -- error 999.
Yahoo!
Unfortunately we are unable to process your request at this time. This error is usually temporary. Please try again later.
If you continue to experience this error, it may be caused by one of the following:
You may want to scan your system for spyware and viruses, as they may interfere with your ability to connect to Yahoo!. For detailed information on spyware and virus protection, please visit the Yahoo! Security Center.
This problem may be due to unusual network activity coming from your Internet Service Provider. We recommend that you report this problem to them.
While this error is usually temporary, if it continues and the above solutions don't resolve your problem, please let us know.
Return to Yahoo!
Please try Yahoo! Help Central if you need more assistance.
from the article: "Or is it? Google's defence in the RTBF trial was that it was a mere "data processing company""
so why google for "data processing business"
surely it would be more accurate to google for "data processing company" ??
and as much as i don't like google, if others have the right to have listings removed, then surely that give Google the right to remove listings about themselves that they don't want? Good for goose and all that. Sometimes you have to be careful what you wish for...
Read the comments you just don't understand the way Google searching works. It's no mystery, if you search for RTBF "data processing business" with verbatim on you still only get a few results (more than the initial two as some sites have added this story.
However the reason you are getting over 1000 results is due to Google deciding to expand the initialism and search for both RTBF and Right To Be Forgotten, which it wasn't before. So it is now allowing more ways to find the information. Bing et al are still ignoring quotes for phrases with limited results.
This really is a non-story with the current 'evidence' provided.