There is actually an important distinction in there, as the venerable Mr O would be delighted to explain.
You want your site to appear on Google, so you need to let the Googlebot in, but if they take that content, like news or user-reviews, and put it into a Google News or Google Comparison thingy then they just boosted their own 'competing' product by taking your hard work, and you can't stop them as you need to be Googleable.
If the snippet of News is too large, people don't need to click on the link (as most news is a small fact dressed up in puff). If they take your reviews they improve their product ranker and people don't then look for product rankers like yours. These are the things a Giant can step on. But remember, the *public* want to search for news summaries and reliable products quickly. Google may have to actively not do what the market demands in order to protect itself from the EU protecting the market.
Now, it looks like GoogleBot has a bunch of clones these days who declare what service they're collecting the data for, so you can choose which Bot you give what data to, so you can make yourself Googleable, yet not feed any ancillary Googlenesses. This is why I'm curious about how much of all this scraping noise is still *actually* valid, and hence why people are banging drums about it.