Tell me more ×
Answers OnStartups is a question and answer site for entrepreneurs looking to start or run a new business. It's 100% free, no registration required.

There are these websites that have raw data feed going on. I'm scraping those data, process them and present in a digest-able format to user, and charge users for viewing it.

What will actually happen is the user will go to a certain page from a website that I don't own, click the bookmarklet that I provide, which will inject a script to scrape data, process open up a popup or a floating modal window right in that page and display processed data.

Is that OK or not?

share|improve this question

3 Answers

It depends on the terms and conditions of each individual site, as scraping may be called out as a forbidden activity. It sounds like you are providing a data manipulation tool rather than a tradtional scrapper (scrappers normally run automatically from one or more machines and pull the data to a central place) as your requests are orginating from the distinct IP of the user which will receive the data. If the extractor actually requests from your server IP rather from the users machine you will likely be spotted and have your IP address banned.

If the site owner does not specifically forbid data extraction in the ts and cs, you should be ok, but be wary of how much data you pull as anyone who pays attention to data usage on a site might change the format of getting at the data.

share|improve this answer
Data won't be pulled from a server. In-fact the users will run my script right on their page pulling client-to-client on the fly. And only from that client, it will upload to my server. – Moe Sweet Oct 11 '11 at 8:05
Even if there are not terms of service, you are not ok because the default is that everything is copyrighted. Which by definition means it can't be copied. But it really depends on where the data is coming from. – Alain Raynaud Oct 11 '11 at 9:58
@AlainRaynaud: I don't think that's quite right. Yes, copyright is the default. But there's at least some sort of implied licence for every public webpage, because visitors need to copy the site simply to view it on their computers. How broad is that implied licence? As far as I know that's an open question in the US. – Zopa Oct 13 '11 at 17:25

What you have described (a) likely will constitute copyright infringement in many instances and (b) may violate terms of service in some instances, so the answer is "No, it's not [legally] OK."

Disclaimer: This information does not constitute legal advice and does not establish an attorney-client relationship.

share|improve this answer

This idea sounds similar to Instapaper. According to their privacy policy, Instapaper works with the publishers of the content they scrape to provide information about how people use the product to interact with the publisher's site. You would probably want to talk to a lawyer about this, but to me the Instapaper example indicates that there are legal ways to do this.

share|improve this answer

Your Answer

 
discard

By posting your answer, you agree to the privacy policy and terms of service.

Not the answer you're looking for? Browse other questions tagged or ask your own question.