Previous :: Next Topic |
Author |
Message |
neek Member
Joined: 12 Sep 2011 Posts: 2338 | TRs | Pics Location: Seattle, WA |
|
neek
Member
|
Sun Jan 07, 2024 10:28 am
|
|
|
This is weird, I used to add "site:nwhikers.net" to my google searches all the time to search this site exclusively, but it no longer returns results (or sometimes just one). Did we get blacklisted? I'll try again later, maybe google is just having a moment.
|
Back to top |
|
|
dave allyn Member
Joined: 05 Apr 2011 Posts: 428 | TRs | Pics
|
Try it with The Angry Hiker
|
Back to top |
|
|
neek Member
Joined: 12 Sep 2011 Posts: 2338 | TRs | Pics Location: Seattle, WA |
|
neek
Member
|
Sun Jan 07, 2024 11:29 am
|
|
|
Try it with whatever you like. Are you getting different results? For me even w/o cookies it's just crickets.
|
Back to top |
|
|
seawallrunner dilettante
Joined: 27 Apr 2005 Posts: 3308 | TRs | Pics Location: Lotusland |
Same. A few weeks ago I tried searching for previous posts with REI in the title or body, and the search returned empty.
|
Back to top |
|
|
Tom Admin
Joined: 15 Dec 2001 Posts: 17857 | TRs | Pics
|
|
Tom
Admin
|
Sun Jan 07, 2024 2:30 pm
|
|
|
This summer MCaver needed to add some stuff to block a rogue bot that was hammering the site. I'm guessing it might have led to unintended side effects. Nothing else has changed.
|
Back to top |
|
|
huron Member
Joined: 13 Sep 2004 Posts: 1039 | TRs | Pics
|
|
huron
Member
|
Sun Jan 07, 2024 5:05 pm
|
|
|
|
Back to top |
|
|
zimmertr TJ Zimmerman
Joined: 24 Jun 2018 Posts: 1228 | TRs | Pics Location: Issaquah |
|
zimmertr
TJ Zimmerman
|
Sun Jan 07, 2024 6:51 pm
|
|
|
|
Back to top |
|
|
Tom Admin
Joined: 15 Dec 2001 Posts: 17857 | TRs | Pics
|
|
Tom
Admin
|
Mon Jan 08, 2024 2:45 pm
|
|
|
Seems as if MCaver unintentionally blocked all bots, I've tweaked the robots.txt to allow all bots and block rogue bots via .htaccess instead. Hopefully the google indexing comes back.
FWIW, this is how bad that rogue bot was hammering the site this summer, caused several hundred dollars of bandwidth overage charges. If you google Bytespider you'll see it continues to hammer other sites with webmasters trying to figure out how to block it. Apparently it doesn't honor robots.txt so maybe we just got lucky that it stopped hammering us, at the cost of putting in something that only served to block well behaved bots.
|
Back to top |
|
|
zimmertr TJ Zimmerman
Joined: 24 Jun 2018 Posts: 1228 | TRs | Pics Location: Issaquah |
|
zimmertr
TJ Zimmerman
|
Mon Jan 08, 2024 3:00 pm
|
|
|
IIRC, ByteSpider advertises itself by user agent. I don't really know what the CDN/LB looks like for this website, but you might be able to drop ingress traffic when the user agent matches Bytespider. For example, using CloudFlare WAF:
|
Back to top |
|
|
zimmertr TJ Zimmerman
Joined: 24 Jun 2018 Posts: 1228 | TRs | Pics Location: Issaquah |
|
zimmertr
TJ Zimmerman
|
Mon Jan 08, 2024 3:02 pm
|
|
|
PS, ByteSpider is owned/operated by ByteDance (TikTok) and used to train AI models. Maybe soon we'll be able to ask AI where we should go hiking :P
|
Back to top |
|
|
Tom Admin
Joined: 15 Dec 2001 Posts: 17857 | TRs | Pics
|
|
Tom
Admin
|
Mon Jan 08, 2024 3:14 pm
|
|
|
zimmertr
|
Back to top |
|
|
zimmertr TJ Zimmerman
Joined: 24 Jun 2018 Posts: 1228 | TRs | Pics Location: Issaquah |
|
zimmertr
TJ Zimmerman
|
Mon Jan 08, 2024 3:36 pm
|
|
|
It works!
|
Back to top |
|
|
Tom Admin
Joined: 15 Dec 2001 Posts: 17857 | TRs | Pics
|
|
Tom
Admin
|
Mon Jan 29, 2024 10:52 pm
|
|
|
Seems like google is crawling NWH again. Not sure how extensive it will be but at least it's a fresh index. Plenty of hits for "The Angry Hiker" which is a good sign.
|
Back to top |
|
|
|