Quantcast
Channel: The Unofficial Scrapebox FAQ
Viewing all 240 articles
Browse latest View live

Pinging your links to get them indexed

$
0
0

Pinging your links to get them indexed

If you want to Ping your links to get them indexed you need to use the RSS ping function, which is labeled simply RSS in the commenter section of scrapebox. The option labeled PING is for inflating page views and won't get your urls indexed.

RSS Ping is an XML-RPC spec http://www.xmlrpc.com

So the way to do it is import the file that contains the urls you want to get indexed,  into the harvester grid, go to Export URL List >> Export as RSS XML List. Then scan the URL’s which fetches the link Title and Descriptions, set how many entries in each feed and export. It saves as an .xml file(s) which then needs to be uploaded to your domain and will look like: http://www.scrapeboxfaq.com/feed.xml

Then select RSS in the commenter section. Load the RSS Services and feed URL’s to ping them.  There are default RSS services that come with scrapebox or you can use your own.  The feed urls are the ones you uploaded to your domain, like above.


What format do I put my username and password in for proxies?

$
0
0

What format do I put my username and password in for proxies?

The way that you format proxies is like this:

IP:PORT

or

IP:PORT:USERNAME:PASS

127.0.0.1:80:username:password

Scrape emails from Craigslist

$
0
0

Scrape emails from Craigslist

You can grab emails with the email grabber in the harvested urls section. It will let you harvest emails from a url or a local file.

Say you wanted to harvest emails from the Jobs category on Craigslist.

In a regular web browser open up Craigslist. Find the category you want to harvest from, in the case of the jobs category, most major cities it looks like this:

http://losangeles.craigslist.org/jjj/

I got this by selecting the city I wanted, and then clicking the "jobs" link at the top of the category.

Then you would copy down that url, which is what is above.  Note: make sure that if it gives you a spam warning you follow thru to get the actual url of the page that lists the ads.

If you like you can also copy down the urls of the "Next 100 results".

Then save off all of the urls from the categories you want.

Then import them into the Link Extractor addon.

Choose Internal only.

Then let it harvest all the urls from those pages.  This will give you all the current craigslist ads for each category from all the pages you choose.

Then export the results to a txt file.

Then import that txt file into the urls harvester section.

Then use the email grabber to get the emails from those urls.  Thus you have scraped all the emails from Craigslist for the current ads from the categories you have chosen.

The best part is the category urls are static, but the urls that you harvest from them change daily, so you can repeat this process over and over.

Scraping not working or returning no results

$
0
0

Scraping not working or returning no results

If you are scraping a engine, especially Google and you are not getting any results, its generally due to one of a couple reasons.

1.) The terms you are scraping simply do not have any results to return.  For example if you manually go to Google.com and search for:
inurl:car inurl:house inurl:cat site:purple.com purple cows

Google is then going to return this:
Your search - inurl:car inurl:house inurl:cat site:purple.com purple cows - did not match any documents.

So if you put that same string into scrapebox its not going to harvest any results either.

2.) Your proxies are blocked or some other error is happening.  The easiest way to check this is go to the settings menu.  Uncheck "use multi threaded harvester".   Then try to harvest.  Scrapebox will display each query, the proxy used and the result, including any error messages.

If you see lines that say:
Results 0 completed using proxy xxx.xxx.xxx.xxx:xxx   - then it means that the request finished, but the engine returned no results, same as it would do if you actually went to the engine and manually searched for it.

If you see:
Results 0 Error xxx received using proxy xxx.xxx.xxx.xxx:xxx - Then you can look at the error message and generally determine the problem.  Most common for this one are:

Error 302 - Your IP is blocked
Error 404 - the proxy is bad or was never found
Error 407 - your proxy need authentication, if your using private proxies you need to get with your provider to see if you need to use IP authentication and how to do it, or if you need a username and password

I submitted for a license activation or transfer and its been over 12 hours and its not active.

$
0
0

I submitted for a license activation or transfer and its been over 12 hours and its not active.

99% chance you submitted the wrong info.  Make sure you submit the correct Paypal transaction ID.  You can find your transaction ID in paypal, here is a video that shows you how:
http://www.youtube.com/user/looplinescrapebox#p/u/9/k8PWovAQaiU

Also make sure you submit the email address that the money was sent from.  It is the email that you received the "thanks for purchasing scrapebox" email too.  Also it is the email address that paypal has labeled as "primary" in your paypal account (unless you have changed your primary email address since you purchased).

Else if you submitted the correct info then it could be that a anti-virus or firewall is blocking scrapebox from reaching the authentication servers to activate your license.  Go to help >> test server connection and see if you have 6 green lights.  If you do not have all 6 lights green then you need to find out what is blocking scrapebox.  Make sure you add allow rules in all of your anti-virus/malware checkers/firewalls for scrapebox.

Does scrapebox work with https proxies?

$
0
0

Does scrapebox work with https proxies?

No, scrapebox lacks the elements to support and function with HTTPS proxies.

How does the link checker treat spinnable text {spinnable}?

$
0
0

How does the link checker treat spinnable text {spinnable}?

The link checker ignores any text inside of {}.

Such as:

http://www.site.com/id=220 {anchor|anchor2|anchor3|John Smith}

It will check for the link only and ignore the rest.

How does the link checker treat the / on the end of links?

$
0
0

How does the link checker treat the / on the end of links?

The link checker will automatically check for links both with and without the trailing slash.


How do I transfer my scrapebox liscense to a new PC?

$
0
0

How do I transfer my scrapebox liscense to a new PC?

You are permitted to transfer your ScrapeBox license to another PC once per month for free in case you get a new PC, re-install Windows etc.  For complete instructions on how to do this go here: http://www.scrapebox.com/scrapebox-license-transfer

How does Fast Poster Connection Balancing work?

$
0
0

How does Fast Poster Connection Balancing work?

It splits your list in to 500 URL batches internally, the connections go down to zero momentarily after each 500 URL "burst' before the next 500 are posted to. This gives Windows and the network a short break to process outstanding messages etc and everything to free up.

It will slow down the comment run slightly, but can provide more stability on some peoples systems.

What determines the success rate when posting?

$
0
0

What determines the success rate when posting?

When you manually post to a blog, sometimes it will kick back and say something like "Your comment was successful".  When scrapebox receives this message it reports the post as successful.

If scrapebox does not receive a message to this extent, it reports the post as failed.  If you have done much manual blog commenting you know that that doesn't mean it failed.  Sometimes it simply accepts your comment and redirects you back to the post without any notification, or something else.  The comment might be successful in this case, but it just didn't kick back the notification so scrapebox can't report that it was successful, so it reports failed.

How do I run scrapebox on multiple machines?

$
0
0

How do I run scrapebox on multiple machines?

You need to purchase a separate license for each physical machine that you want to run scrapebox on.

Does scrapebox have and affiliate program?

$
0
0

Does scrapebox have and affiliate program?

No, not at this time.

Can I scrape more then 1 million ulrs at a time?

$
0
0

Can I scrape more then 1 million ulrs at a time?

Yes.  You can virtually scrape and unlimited amount of ulrs.  Urls are stored in 1 million chunks in the following folder:

Scrapebox Folder>> Harvester Sessions >> Harvester_ XXX_XXX

Each session creates a new folder.  You may want to delete these from time to time.

Where are harvested results stored?

$
0
0

Where are harvested results stored?

Urls are stored in 1 million chunks in the following folder:

Scrapebox Folder>> Harvester Sessions >> Harvester_ XXX_XXX

Each session creates a new folder.  You may want to delete these from time to time.


How to whitelist an email

$
0
0

Please Whitelist:

loopline@scrapeboxfaq.com

Information on how to whitelist this address in any popular platform/program can be found on this helpful site  – Whatcounts.com (Note: WhatCounts.com is not owned or Affiliated with Scrapebox FAQ)

Is there a monthly cost for Scrapebox?

$
0
0

Is there a monthly cost for Scrapebox?

No, not at this time.

What is the best general Troubleshooting steps?

$
0
0

What is the best general Troubleshooting steps?

Generally speaking when you have a problem with scrapebox these are the best steps to follow.  Work thru them until your problem is resolved.

1.) Restart your computer

2.) Update scrapebox to the latest version if there is an update available.  Sometimes things change with Google, or Wordpress etc...  and the updates fix these issues.

3.) Take a moment to go back thru your settings and double check all of the most obvious things.  Often a simple setting can cause great grief.

4.) Reinstall Scrapebox.  Completely delete all of the scrapebox operating files (of course back up any files you are working with, like your names, emails etc... You only need to delete the actual scrapebox files that come with scrapebox originally when you download it).  Then re-download the latest version of scrapebox and try it.  (download here: http://www.scrapebox.com/payment-received)

5.) If all else fails try posting in the sales thread where you purchased scrapebox to see if the user community can help you (if you purchased on a forum),  or you can try the sales thread at BHW, they are very helpful: http://www.blackhatworld.com/blackhat-seo/buy-sell-trade/129096-scrapebox-ultimate-serp-scraper-auto-blog-commenter-prstorm-mode.html

6.) If that doesn't help then you can contact support directly: http://www.scrapebox.com/contact-us

Scrapebox says it is unable to connect to the servers, are they down?

$
0
0

Scrapebox says it is unable to connect to the servers, are they down?

All scrapebox servers are monitored by Pingdom 1 minute monitoring.  Scrapebox has multiple servers, the publicly monitored ones, including the scrapebox licensing servers an scrapebox.com can be found in the Pingdom summary below.

Pingdom summary of Scrapebox servers

What is a Trackback?

$
0
0

What is a Trackback?

In short a trackback is when site A links to site B and in turn site B links back to site A.  (more or less).  You do not have to link to the sites from your site for this to work, scrapebox sends trackbacks automatically and fakes it like you are linking to their site.  So that they will link to your site without you having to link to theirs.

For a technical definition you can read more on trackbacks here.

Viewing all 240 articles
Browse latest View live