Google's troll-destroying AI can't cope with typos • The Register Forums

Thursday 2nd March 2017 07:00 GMT Anonymous Coward

Google's AI

is fsck'ed

Thursday 2nd March 2017 07:24 GMT Anonymous Coward

Re: Google's AI

Since Google's geek army programmed it, that is probably the one misspelling it recognizes.

9 0 Reply
Thursday 2nd March 2017 12:30 GMT Rafael #872397

Re: Google's AI

Google AI is a *wonderful* idea that will benefit all mankind. I bet it will be as successful as Wave, Orkut, Google Reader, Knol, iGoogle, Buzz, and many, many others.

(wondering if this kind of message would be detected by Google AI)

13 0 Reply
Thursday 2nd March 2017 15:32 GMT Anonymous Coward

Re: Google's AI

"AI" seems to be appended to the name of any 3rd rate algorithm these days simply to add gravitas and in googles case to help marketing and share price.

I would guess underneath its nothing more than a word and phrase permutation spotter program, the like of which have been around for decades.

5 0 Reply
Thursday 2nd March 2017 16:13 GMT Oh Homer

Plz fx ths ggl

I nd bttr spm flterz 4 \/a1gra spm n ubfskat3d urlz.

4 0 Reply

Thursday 2nd March 2017 08:42 GMT frank ly

Details

I'd like to see the difference (if any) between "S c r e w you Trump supporters" and "B l e s s you Trump supporters". I suspect they are identical.

7 0 Reply

Thursday 2nd March 2017 08:47 GMT AndyS

Re: Details

No, one is a perfectly normal reaction to current world events, the other is offensive. Unless they sneezed.

9 6 Reply
1. Thursday 2nd March 2017 18:05 GMT Lord Elpuss
  
  Re: Details
  
  "No, one is a perfectly normal reaction to current world events, the other is offensive. Unless they sneezed."
  
  Aaaaaaand that's why we can't trust AI. An automated system that picked up the snark in your comment and scored it accordingly would be seriously impressive :-)
  
  5 1 Reply
  1. Thursday 2nd March 2017 19:43 GMT Anonymous Coward
    
    Re: Details
    
    That comment doesn't even make it to snark.
    
    1 4 Reply
Monday 10th July 2017 11:50 GMT Not That Andrew

Re: Details

You mean go full on old Southern lady on them? Bless their little hearts.

0 0 Reply

Thursday 2nd March 2017 09:02 GMT Gordon Pryra

Not only that but it seems biased towards Hillary as well!!

Personally I feel the words "moron" and "idiot" to carry the same weight on the Toxicity scales.

I AM from the UK, so my tolerance of swearwords is probably higher than that found in the New world or amount the other lesser countries. These are words taught in reception and primary education to give our British children a good platform on which to build their vocabulary.

But I digress, how can calling a Trump supporter a "moron" only get a score of 80% Whilst calling a Hillary supporter an "idiot" gains 90% toxicity!?

8 2 Reply

Thursday 2nd March 2017 09:23 GMT WaveyDavey

Ugh

Those deliberate mis-spellings look like the toxic sludge that the #DMReporter keeps pointing out.

2 0 Reply

Thursday 2nd March 2017 11:09 GMT John Styles

Is a bunch of regular expressions an AI now?

14 0 Reply

Wednesday 4th December 2019 05:21 GMT ttlanhil

REs are consistent and reliable.

And when they don't work, you can figure out why and how to fix them

Well... in theory.

"AI" is none of the above (as someone who's trained a few "AI" models... and fixed a lot of REs)

0 0 Reply

Thursday 2nd March 2017 12:16 GMT phuzz

Misspeling

If I was writing an automated tool to look for trolls, pretty much the first thing I'd do is flag up the ones full of spelling errors. It's not a foolproof test, but it'll pick up 90% of the idiots.

15 0 Reply

Thursday 2nd March 2017 13:30 GMT Ugotta B. Kiddingme

Re: Misspeling

"If I was writing an automated tool to look for trolls, pretty much the first thing I'd do is flag up the ones full of spelling errors. It's not a foolproof test, but it'll pick up 90% of the idiots."

To further improve the catch rate, include grammar detection. For example: "They're st.upid, it's getting warmer..." only scores 2% toxicity, whereas "Their st.upid, its getting warmer..." should score at least as high as the original phrase.

8 0 Reply
Wednesday 4th December 2019 05:21 GMT ttlanhil

Re: Misspeling

butt... illetracy r kewl, mkay?

Plus twitter is often a source of training data for language-based "AI" work...

I'd like to see how well you can handle all of the common abbrevs

0 0 Reply

Thursday 2nd March 2017 12:44 GMT matchbx

It's a completely useless fight

Most forums and comments are already setup to block most swear words, which is the reason folks started using s.w.e.a.r words.

If you train some new AI bot to block s.w.e.a.r words, then folks will simply starting using sVVear words....

They might as well be trying to push a 10 thousand pound boulder up hill with a twig.... during an ice storm.... at night....... while an earthquake is happening.....

6 0 Reply

Thursday 2nd March 2017 13:08 GMT Swarthy

Re: It's a completely useless fight

So you are saying that AI filtering is a Sisyphean task?

I don't know.. they could just team up with Microsoft and use Tay to train the filter (while allowing 4Chan to train Tay)

3 0 Reply
Thursday 2nd March 2017 21:46 GMT Nat C.

Re: It's a completely useless fight

Exactly. This is how sexually explicit material became known as "pr0n" in Internet slang -- because forum filters were blocking the word "porn". Nothing new at all....

2 0 Reply

Thursday 2nd March 2017 12:59 GMT not.known@this.address

EsSEX and Scunthorpe are in trouble again then...

So if I post that "$NAME is a bastard" because his (or her, mustn't be sexist!) parents are not married and that is what the word means, I will get in trouble?

And how do they decide which words or phrases should be "banned"? If it is by number of complaints, how many people have to complain before a word or phrase ends up on the hit list, and how many have to support it before it can be marked as 'do not delete'? It doesn't take much imagination to see the possibilities here, surely - "My name is Joe and I think the word [insert name of religion here] should be banned, and so do my $NUMBER mates"...

Maybe we should start calling the people behind this idea the relevant bit of Sgooglehorpes instead?

5 1 Reply

Thursday 2nd March 2017 13:23 GMT Cuddles

Benign settings

"Machine learning models are generally designed to yield the best performance on clean data and in benign settings."

While that may generally be the case, you'd hope software specifically written to detect non-benign situations would avoid assuming benevolence.

2 0 Reply

Thursday 2nd March 2017 13:25 GMT Moosh

"to provide an automated way to detect "toxic" language in social media"

Might I ask... Why?

Will I now have the pleasure of being reported to her majesty's finest automatically whenever I tell someone to f-ck off and call them a moron on social media?

How do they define social media? Facebeook? Twitter? Message boards? Comments sections?

At what point does saying "I disagree" become a punishable offence?

5 1 Reply

Thursday 2nd March 2017 14:06 GMT m0rt

Re: "to provide an automated way to detect "toxic" language in social media"

"At what point does saying "I disagree" become a punishable offence?"

When you are being that you need to give your password/keyphrases to the plod.

0 0 Reply
Thursday 2nd March 2017 15:27 GMT Anonymous Coward

Re: "to provide an automated way to detect "toxic" language in social media"

"At what point does saying "I disagree" become a punishable offence?"

I guess that would be right around the point when your style is overly offensive, for locally defined values of "overly". And who said anything about Her Majesty? So-called western governments will likely never have anything to do with the whole mess-- to be silenced within a community *by* the community is the only "punishment" that I'd ever expect to come down. Just disagreeing and/or flatly saying so isn't ordinarily offensive, or if it is, that particular community isn't where I'd want to be ITFP and that conversation probably wasn't worth having. Of course, mindlessly repeating 'i disagree' would be childish and qualifies as offensive style if not offensive substance. So it's a hard problem, like self-driving cars needing to be able to weigh risks. And then, sarcasm... good luck with that one.

As far as why... limited moderators with limited serotonin, I guess.

obligatory xkcd ref (alt text ftw)

obligatory RvB PSA

2 0 Reply

Thursday 2nd March 2017 14:43 GMT Bob Wheeler

Out in the real world, hygiene and benevolence cannot be assumed.

Have you meet my co-workers?

5 0 Reply

Thursday 2nd March 2017 15:46 GMT Sleep deprived

Must be based on the Gmail spellchecker...

Given that the spellchecker in Gmail has often no suggestion to make when a letter is missing or swapped or mis-accented in a word, this should be no suprise. Haters can keep on hating safely.

2 0 Reply

Thursday 2nd March 2017 18:06 GMT John Styles

Re: Must be based on the Gmail spellchecker...

Glad I'm not the only person to think this about the Google spelling checker. I have noticed in particular that it is utterly useless at spotting obvious typos where you have mistyped the first character of the word

1 0 Reply

Thursday 2nd March 2017 15:48 GMT Steve Evans

I wonder what it would score "spawny-eyed parrot-faced wazzock"

https://youtu.be/I2AcJSkUw6M?t=1m18s

1 0 Reply

Thursday 2nd March 2017 16:02 GMT Anonymous Coward

You don't even need swearwords to troll some people.

Some people are so lacking in terms of intelligence that you can criticise them to their face and any AI would be none the wiser to pick it up.

1 0 Reply

Thursday 2nd March 2017 16:08 GMT Anonymous Coward

Re: You don't even need swearwords to troll some people.

er... "No AI would be the wiser" sounds better, on second thought. Correcting myself.

0 0 Reply

Thursday 2nd March 2017 17:35 GMT GrapeBunch

Willie Bee

Google AI: the real reason he's William the Conqueror.

Analbuddy hoo votted gogg.le iz.za wasteof.space

No deprecation intended, just wanted to raise the spectre of insulting terms also being possible to interpret as links, and th.us triggering different automation modules in filter.space

0 0 Reply

Thursday 2nd March 2017 18:28 GMT Stevie

Bah!

So El Reg's articles about Yahoo should still get indexed by headline in the Google search results then.

0 0 Reply

Thursday 2nd March 2017 19:41 GMT Long John Baldrick

Isn't there a way to remove ..

characters that are not alphanumeric?

Just sayimg.

0 0 Reply

Thursday 2nd March 2017 20:21 GMT Anonymous Coward

Nationalism(ist) is flagged?

Pisses me off that the word is flagged at all, this is the crap they are using on Reddit, Twitter, and FB right now to flag anyone who isn't a globalist shill. Sorry that people aren't like Germany who are afraid to wave their OWN flag. https://www.youtube.com/watch?v=_Rcc7xgD2dM

Or declared the Christmas Market Terror attack an "Accident".. http://www.express.co.uk/news/world/773602/Germany-Berlin-Christmas-market-compensation-terror-ISIS-attack-Fabrizio-Di-Lorenzo-lorry

1 1 Reply

Friday 3rd March 2017 02:06 GMT Herby

I'm just happy that...

El Reg doesn't have such filters on stories or comments.

I suspect that there are subjects that get 80%-90% ratings in either comments or the stories.

Of course :-) I would never get such a score.........

Trolls?? Nah, wouldn't happen here :-).

2 0 Reply

Friday 3rd March 2017 08:39 GMT Gordon Pryra

Not much point in this anyway

As has been said, most forums have filters to block the obvious and the less obvious (forum mods know the language of their posters generally, with all its variation in spelling) add to this that the MOST abusive posts are generally in pretty good English, not requiring any swearing or nasty words.

The idea is to hurt the person you are trolling, and generally this is accomplished by making them feel small, physical size doesn't come into it, therefore the attacker can generally beat their target into submission by putting a post that is just "better written" than the target can respond too. (case of the small people being able to pick their battlefield and attack from a position of strength)

Check out some of the threads on E lReg for examples, those in the lower leagues tend to be left looking like the only surviving brain transplant donor by those who can string a few non-swearing insults together.

And then we have the final issue with language and words having their meaning changed or just having an generally accepted double meaning, those mugs at the AI lab will have a hard time working out whats an actual attack by only looking at the language being used. (see what I did there? English is great for this kind of thing)

Syntax and placement of words are almost as important as the words being used themselves. but I guess that is the point in using this as a training ground for an AI, after all a self learning system would come out of this exercise either totally broken and crying or able to work for IGN forums as a mod.

2 0 Reply

Topics

Special Features

Vendor Voice

Resources

COMMENTS

Google's AI

Re: Google's AI

Re: Google's AI

Re: Google's AI

Plz fx ths ggl

Details

Re: Details

Re: Details

Re: Details

Re: Details

Not only that but it seems biased towards Hillary as well!!

Ugh

Misspeling

Re: Misspeling

Re: Misspeling

It's a completely useless fight

Re: It's a completely useless fight

Re: It's a completely useless fight

EsSEX and Scunthorpe are in trouble again then...

Benign settings

"to provide an automated way to detect "toxic" language in social media"

Re: "to provide an automated way to detect "toxic" language in social media"

Re: "to provide an automated way to detect "toxic" language in social media"

Out in the real world, hygiene and benevolence cannot be assumed.

Must be based on the Gmail spellchecker...

Re: Must be based on the Gmail spellchecker...

You don't even need swearwords to troll some people.

Re: You don't even need swearwords to troll some people.

Willie Bee

Bah!

Isn't there a way to remove ..

Nationalism(ist) is flagged?

I'm just happy that...

Not much point in this anyway

POST COMMENT House rules

Enter your comment

Add an icon

Other stories you might like

Google Cloud chief is really psyched about this AI thing

AI spam is winning the battle against search engine quality

Google will pump more than $100B into AI, says DeepMind boss

Google is wrong to put AI search features behind paywall, says HPC leader

Google ponders making AI search a premium option

Psst, hey. It's the NSA. You want some AI security advice?

Google will delete data collected from 'private' browsing

Arm flexes silicon muscles to push generative AI at the edge

Gentoo Linux tells AI-generated code contributions to fork off

Developers are calling the shots on AI planning, judging by your experience

AI gold rush continues as Microsoft invests $1.5B in UAE's G42

Next Vision, or Vision Next? What we really thought about Google and Intel's AI events

About Us

Our Websites

Your Privacy