Skip to content
×
Try PRO Free Today!
BiggerPockets Pro offers you a comprehensive suite of tools and resources
Market and Deal Finder Tools
Deal Analysis Calculators
Property Management Software
Exclusive discounts to Home Depot, RentRedi, and more
$0
7 days free
$828/yr or $69/mo when billed monthly.
$390/yr or $32.5/mo when billed annually.
7 days free. Cancel anytime.
Already a Pro Member? Sign in here

Join Over 3 Million Real Estate Investors

Create a free BiggerPockets account to comment, participate, and connect with over 3 million real estate investors.
Use your real name
By signing up, you indicate that you agree to the BiggerPockets Terms & Conditions.
The community here is like my own little personal real estate army that I can depend upon to help me through ANY problems I come across.
Real Estate Technology
All Forum Categories
Followed Discussions
Followed Categories
Followed People
Followed Locations
Market News & Data
General Info
Real Estate Strategies
Landlording & Rental Properties
Real Estate Professionals
Financial, Tax, & Legal
Real Estate Classifieds
Reviews & Feedback

Updated over 12 years ago on . Most recent reply

User Stats

213
Posts
265
Votes
Kenneth E.
  • Davenport , IA
265
Votes |
213
Posts

Do-it-yourself Screen Scraping/ Data mining

Kenneth E.
  • Davenport , IA
Posted

Hi all. I am posting this info about this program because I couldn’t find it by searching in the forums search box, so maybe it will help someone who happens to be lost ……where I got stuck.

If you have access to a public website and need to ‘capture’ the data from it to create your database or spreadsheet for your leads (assuming your county doesn’t already offer the information in a spreadsheet format), the keywords you need to learn are “data mining,” “screen scraping”, and “data extraction” (I had never heard of either term to describe capturing info from a website until I went to craigslist and found someone under the ‘services’ section who offered IT services….and explained what I needed. They in turn taught me what it’s called which was the first step! Ha). These are the processes of electronically combing through websites and snatching specific data from them.

You can hire people to do this type of process for you, but if you are a small-time operation like I am, I ended up going with a service called Mozenda (www.mozenda.com). It is a program (you download to your computer) that allows you to customize it and how it clicks through the pages on a website and captures the data you need…on multiple pages. It’s a bit complicated to set up but the customer service people were SUPER helpful. Once it was set up, I pressed ‘go’ and turned it loose for the next few hours. It combed through numerous public records and pulled only the data I told it to. It did all the rest and put the results into spreadsheet format for me, which I then used to create mailers with (mail-merge, anyone?). :)

I recommend it because it worked for me. But, keep in mind the two disadvantages: 1. It can get costly if you have thousands of pages to take data from, and 2. It is a bit complicated to set up.

I understand that a program called Python can do something similar and cheaper….but it may require some programming experience (though you can probably hire some IT person through craigslist to do the programming for you).

Also, I have no affiliation with mozenda at all. Just trying to help others out.

Most Popular Reply

User Stats

16,121
Posts
5,816
Votes
Joshua Dorkin
#2 Questions About BiggerPockets & Official Site Announcements Contributor
  • BiggerPockets Founder
  • Maui, HI
5,816
Votes |
16,121
Posts
Joshua Dorkin
#2 Questions About BiggerPockets & Official Site Announcements Contributor
  • BiggerPockets Founder
  • Maui, HI
Replied

Kenneth Elshoff - Scraping data or content from websites, sometimes dubbed as "Automated Data Collection" is likely a violation of their terms of use.

Ann Bellamy - To continue my point above, just check out the TOS of the site you mentioned and you get the following:

You can certainly see that they would not be fond of scrapers.

I STRONGLY advise against scraping content. It is a fast way to find yourself in court. If a website is going to make their content available for use, they are likely to do so via an API.

Loading replies...