Why is Facebook crawling through SmugMug?
W.W. Webster
Registered Users Posts: 3,204 Major grins
StatCounter has reported Facebook crawling through my SmugMug site on a number of occasions over recent weeks, most recently in the past few minutes.
What possible interest can (or should) they have? I'm perfectly happy for Google, Bing, Yahoo and numerous other search engines to index my images - which they do - but Facebook? They don't provide a search function, and they have no rights to my images.
Does SmugMug facilitate or encourage Facebook in this? I don't trust those b*st*rds! :huh
What possible interest can (or should) they have? I'm perfectly happy for Google, Bing, Yahoo and numerous other search engines to index my images - which they do - but Facebook? They don't provide a search function, and they have no rights to my images.
Does SmugMug facilitate or encourage Facebook in this? I don't trust those b*st*rds! :huh
0
Comments
Sorry about this, but I'm not really sure. Do you know if anyone has linked to your site on Facebook? If images were posted or linked that could be what's showing up. Like this one:
http://www.facebook.com/WellingtonPhotographicSociety/posts/213828098689791
I'd suggest trying to contact Facebook to see if they have more insight about why they'd be showing up as crawling your pages. That's not really helpful of me though, sorry!
I don't see why if I, or anyone else, for that matter, links to a SmugMug image on Facebook, that Facebook needs to come nosing around my site! They've got the link, what else are they looking for?
I seldom see Google or Bing spooking around, but I imagine that's because SmugMug pushes keywords, etc to them. Some Asian search sites come looking, which is fine, but Facebook has no need for this data, as I see it.
Homepage • Popular
JFriend's javascript customizations • Secrets for getting fast answers on Dgrin
Always include a link to your site when posting a question
Homepage • Popular
JFriend's javascript customizations • Secrets for getting fast answers on Dgrin
Always include a link to your site when posting a question
But more to the point, this suggests SmugMug facilitates Facebook access, or can they crawl around unassisted? Surely this can't happen without SmugMug allowing it?
Legitimate crawlers are supposed to follow the rules that Smugmug puts in the robots.txt file, but that is voluntary - there is no absolute enforcement.
Facebook could also be looking at your page when somebody else on the internet posts a link in their FB page to your site. If it's a crawler looking at your page, then statcounter would probably not see it because crawlers don't cause javascript to get executed which statcounter generally needs.
Homepage • Popular
JFriend's javascript customizations • Secrets for getting fast answers on Dgrin
Always include a link to your site when posting a question
We don't do anything special for Facebook Like John says, they, like other sites can crawl your site (think: google, bing, yahoo)
Portfolio • Workshops • Facebook • Twitter
That's possible, I suppose, but again - why?
I understand, as whenever (rarely) Google, Yahoo, or one of the Chinese or Korean search engines, etc come looking, StatCounter never reports specific page URLs being accessed, which is entirely understandable (and acceptable).
What I don't understand (nor appreciate) is that Facebook, which is not a search engine and which shows no regard for photographers' copyright in its terms of service, should be crawling about in SmugMug users' sites - even if (as you suggested previously), "they're just try(ing) to make a decent user experience for people that try to post a link to (my) page".
We have an expression in this country for claims and assertions as dubious as this - "yeah right"!
Homepage • Popular
JFriend's javascript customizations • Secrets for getting fast answers on Dgrin
Always include a link to your site when posting a question
I'm just trying to understand what's going on and, as always, you've been very helpful!
Thanks!
Are you sure about that?
https://www.google.com/search?num=100&hl=en&q="does+facebook+crawl"
In this podcast, around the 35 minute mark, they talk about how/why Facebook crawls a site. They says that if your site has a FB connect or like button, FB crawls your site once every 24 hours.
They "say" the search results come from Bing, but I bet they rank higher the sites that have FB connect or like buttons, over non-FB connected sites.
jc
"Chance favors the prepared mind." ~ Ansel Adams
"Light thinks it travels faster than anything but it is wrong. No matter how fast light travels, it finds the darkness has always got there first, and is waiting for it." ~ Terry Pratchett