Robots.txt blocking 90% of Google Indexing
Coffeehound
Registered Users Posts: 3 Beginner grinner
Without any intervention on my part Google has deleted about 90 percent of my sites pages from the index. It appears to coincide with a sudden change in our my robots.txt file which declared most of our pages as blocked urls. The robots.txt file seems to be back to normal but we continue to lose pages. I've seen similar reports but no indication whether it's been figured out or when it will be fixed. Please let me know what's happening.
Thanks
Thanks
0
Comments
SmugMug Support Hero
I went to a free sitemap generator site - http://www.freesitemapgenerator.com/quick-sitemap.html - just to see what would happen. I have over 15 galleries on my site, a blog, and other pages. Here is the sitemap generated by that site. It found my home page, plus one other page. That means every other page on my site isn't being indexed, right?
I view this as an extremely serious problem. I'm glad it appears that SM has 3 engineers on it. I hope you guys resolve this quickly!
Instagram Twitter Facebook
Did you upload your own sitemap? If so how?
Images in the Backcountry
My SmugMug Customizations | Adding CSS to Your Site | SEO for the Photographer | Locate Your Page/Widget Number | SmugMug Help Desk
That site doesn't require that you upload your own. I simply crawls your home page and creates a sitemap from that.
Instagram Twitter Facebook
I understand that it crawls your home page and creates a sitemap. I thought you actually uploaded that sitemap.
Got me thinking though. I can't FTP my own sitemap using FileZilla Client, but I can upload a sitemap via my cPanel.
Images in the Backcountry
My SmugMug Customizations | Adding CSS to Your Site | SEO for the Photographer | Locate Your Page/Widget Number | SmugMug Help Desk
Images in the Backcountry
My SmugMug Customizations | Adding CSS to Your Site | SEO for the Photographer | Locate Your Page/Widget Number | SmugMug Help Desk
.....I'm speechless
photosbygerry.smugmug.com
Right you are. I found another one here -- http://www.web-site-map.com/xml_sitemap.php -- and it created a sitemap with 35 pages, which is what it should have.
Instagram Twitter Facebook
Could this explain my issue do you think?
I have to agree. This is simply not acceptable. I can tell you firsthand this is costing me business. Hello smugmug?
Blog 1 - Blog 2 - Blog 3
Tomoscott, I don't see any issue on your site. It has a very current sitemap with lots of details.
Coffeehound, looking at the graph you included in your first post, shows almost 0 issues with robots.txt for many months. While there have been recent indexing issues, they weren't robots.txt related. Please read through the thread I linked to above for details and what we've done to address the matter. In fact, there have been first reports on that thread that the matters have already taken effect for some.
Star Path Images, what you quote from Coffeehound is something that's not actually the case. For details, please see the post I linked to above (and if you wish for more details, you can read the thread as well).
SmugMug Support Hero
Other sitemap-generators might be blocked by robots.txt unless they appear as "googlebot" or another allowed useragent.
Append
/sitemap-base.xml
/sitemap-galleryimages.xml
to you URL oder download them zipped:
/sitemap-base.xml.gz
/sitemap-galleryimages.xml.gz