SEO Blog

Posts Tagged ‘January’


Another January Mozscape Index Has Been Released!

Posted by:  /  Tags: , , , , ,

Posted by carinoverturf

Just 13 days ago on January 11th, we released the first Mozscape index for 2013. And today, we're launching the latest January Mozscape index – another two indexes in one month! Mozscape data has been refreshed across all our applications so you can see the latest data in Open Site Explorer, the MozbarPRO campaigns, and the Mozscape API.

This index finished up in record time, running smoothly on the high power cluster compute machines in AWS. Our Mozscape processing team (Doug, Martin, Brandon, and Stephen) has spent the past few months really cleaning up and optmizing the software that produces these indexes. Changes are slow going with this software – big data is big and changes are big! There is a lot of testing and optimizing that must be done before changes even make it into the production index, but these guys are dedicated to getting you index twice a month! 
 
We're eagerly waiting for our first index to be released from our new colocation in Virginia – hopefully in the month of February. With some new configurations and master network tuning from our Tech Ops team, we currently have an index churning away, so far with promising performance!
 
Here are the metrics for this latest index:
  • 70,278,347,012 (70 billion) URLs
  • 1,516,212,211 (1.5 billion) Subdomains
  • 145,518,352 (145 million) Root Domains
  •  783,206,227,396 (783 billion) Links
  • Followed vs. Nofollowed
    • 2.24% of all links found were nofollowed
    • 56.43% of nofollowed links are internal
    • 43.57% are external
  • Rel Canonical – 15.11% of all pages now employ a rel=canonical tag
  • The average page has 78 links on it
    •  66.68 internal links on average
    •  11.07 external links on average
And the following correlations with Google's US search results:
  • Page Authority – 0.36
  • Domain Authority – 0.19
  • MozRank – 0.24
  • Linking Root Domains – 0.30
  • Total Links – 0.25
  • External Links – 0.29
Crawl histogram for the January 25th Mozscape index
 
Since this index was kicked off January 14th, the latest crawl data is really fresh! There is just over 30 days of crawl data in this index, the majority being crawled in January, but some crawl data as old as mid-December. There was a significant increase in the number of subdomains crawled for this index compared to the our previous index. Further investigation revealed we found a fairly small increase of root domains that had a substantial number of new subdomains associated with them. Because they are such low authority, the increase won't have any impact on our metrics, but does significantly increase the number subdomains in this index.  
 
We always love to hear your thoughts! And remember, if you're ever curious about when Mozscape is updating, you can check the calendar here. We also maintain a list of previous index updates with metrics here.

Sign up for The Moz Top 10, a semimonthly mailer updating you on the top ten hottest pieces of SEO news, tips, and rad links uncovered by the Moz team. Think of it as your exclusive digest of stuff you don’t have time to hunt down but want to read!


SEOmoz Daily SEO Blog

January Mozscape Index is Live!

Posted by:  /  Tags: , , ,

Posted by carinoverturf

Today, we are releasing the latest Mozscape index – just three weeks after our last release on 12/21! Mozscape data has been refreshed across all our applications so you can see the latest data in Open Site Explorer, the MozbarPRO campaigns, and the Mozscape API.

The top focus of the Big Data team in the next couple months is to get our Mozscape index releases to be more frequent – consistently releasing every three weeks, and then, ultimately, every two weeks. We're utilizing both the high power compute AWS machines as well as our own virtual private cloud setup in our Virginia colocation. At this point, processing in AWS is still slightly faster than processing in our private cloud. However, our top notch Tech Ops team is working closely with us to fine tune our implementation of Open Stack, open source software that allows us to put a virtual layer on top of our fleet of hardware in Virginia. Our own super computers in Virginia should give us even more computing power than we've seen in AWS, meaning faster index processing and more frequent releases for you guys!

Here are the metrics for this latest index:

  • 68,291,839,694 (68 billion) URLs
  • 512,802,814 (512 million) Subdomains
  • 96,918,414 (97 million) Root Domains
  • 771,699,931,943 (771 billion) Links
  • Followed vs. Nofollowed
    • 2.24% of all links found were nofollowed
    • 56.32% of nofollowed links are internal
    • 43.68% are external
  • Rel Canonical – 11.39% of all pages now employ a rel=canonical tag
  • The average page has 61 links on it
    •  51.68 internal links on average
    •  8.77 external links on average

And the following correlations with Google's US search results:

  • Page Authority – 0.36
  • Domain Authority – 0.19
  • MozRank – 0.24
  • Linking Root Domains – 0.30
  • Total Links – 0.25
  • External Links – 0.29

Crawl histogram for the January Mozscape index

This index is a little bit smaller than the previous index, but fairly fresh with the oldest data being crawled late November and the freshest from January 1st. As you can see from the histogram, a pretty big portion was crawled mid- to late-December!

We always love to hear your thoughts! And remember, if you're ever curious about when Mozscape is updating, you can check the calendar here. We also maintain a list of previous index updates with metrics here.

Sign up for The Moz Top 10, a semimonthly mailer updating you on the top ten hottest pieces of SEO news, tips, and rad links uncovered by the Moz team. Think of it as your exclusive digest of stuff you don’t have time to hunt down but want to read!


SEOmoz Daily SEO Blog