robots.txt file
justawebbie
Registered Users Posts: 34 Big grins
How can I get the robots.txt file opened so our Google search appliance can index our smugmug site?
http://photomgr.smugmug.com
I sent an email to SmugMug support but have not heard back from them at all.
http://photomgr.smugmug.com
I sent an email to SmugMug support but have not heard back from them at all.
0
Comments
Yup, here it is fro December 9th:
Can you please check your spam folders at work? Thanks!
The answer is, http://photomgr.smugmug.com/robots.txt
Portfolio • Workshops • Facebook • Twitter
I'm really not sure - I do know that you cannot modify the robots.txt file, but that Google sure can. Not sure why your Google appliance can't though
Portfolio • Workshops • Facebook • Twitter
Thank you though Andy.
Google itself may be allowed, but perhaps not our GSA (search.alaska.edu) Do you know how to get this changed, I am a pro user?
Maybe you could edit the user agent used for your GSA service? This page seems to suggest that changing the name is possible:
http://code.google.com/apis/searchappliance/documentation/50/help_gsa/crawl_headers.html
May try using a useragent that's listed in our robots.txt file, like the Googlebot ?
If you'd like to suggest supporting of another search useragent, you could add your suggestion here:
http://feedback.smugmug.com/forums/17723-smugmug
SmugMug Support Hero
Portfolio • Workshops • Facebook • Twitter
User-Agent: MJ12bot
Disallow:
Thanks. Greetings form Canada's Far East.
Brian Carey
http://www.briancareyphotography.com/
I received a message from google adsense.
Wondering if robots.txt file can be modified as suggested to them if this has not been done already.
thanks very much,
Paul Lantz
Large number of failed ad crawls
76
failed crawls
last week
We noticed that the AdSense ad crawler is having some issues accessing your site on smugmug.com. This issue is caused by a misconfigured robots.txt file, which is blocking the ad crawler from viewing certain sections of your site.
Last week, we detected 76 failed crawl requests. Because of this, your AdSense ads are less targeted and are generating less revenue.
View crawler errors
To fix this, you’ll need to edit your robots.txt file to allow our AdSense crawler by adding these two lines to the very top:
User-agent: Mediapartners-Google
Disallow:
This will allow the AdSense ad crawler to access pages on your site that already have AdSense code on them. As a result, you and your users will benefit from more targeted, relevant ads.
It's important to note that making this change will not impact your Google search or SEO rankings. Adding these two lines to your robots.txt file will simply deliver better, more relevant ads to pages with AdSense code already on them. Pages that don’t have AdSense ad code will not be affected.
For more in-depth information, make sure that you read our blog post about this on the Inside AdSense blog.
If you don't want to post the links in the public forum, email our HelpDesk with details.
SmugMug Support Hero
http://paullantz.smugmug.com/search/?searchWords=baby&searchType=InUser&NickName=paullantz&x=0&y=0
http://paullantz.smugmug.com/search/?searchWords=Fort+Albany&searchType=InAlbum&AlbumID=25782496&x=0&y=0
http://paullantz.smugmug.com/search/?searchWordsShort=October&searchType=InAlbum&AlbumID=25782496&x=0&y=0
http://paullantz.smugmug.com/search/?searchWordsShort=moosonee&searchType=InAlbum&AlbumID=861351&x=0&y=0
very much wish I understood this better
Paul
Homepage • Popular
JFriend's javascript customizations • Secrets for getting fast answers on Dgrin
Always include a link to your site when posting a question
nickname.smugmug.com/robots.txt
(replace nickname with your own site nickname)
SmugMug Support Hero