Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Collapse
Brand Logo
UDS UDS: $1.86
24h: 8.65%
Trade UDS
Gate.io
Gate.io
UDS / USDT
MEXC
MEXC
UDS / USDT
WEEX
WEEX
UDS / USDT
COINSTORE
COINSTORE
UDS / USDT
Biconomy.com
Biconomy.com
UDS / USDT
BingX
BingX
UDS / USDT
XT.COM
XT.COM
UDS / USDT
Uniswap v3
Uniswap v3
UDS / USDT
PancakeSwap v3
PancakeSwap v3
UDS / USDT

Earn up to 50 UDS per post

Post in Forum to earn rewards!

Learn more
UDS Right

Spin your Wheel of Fortune!

Earn or purchase spins to test your luck. Spin the Wheel of Fortune and win amazing prizes!

Spin now
Wheel of Fortune
selector
wheel
Spin

Paired Staking

Stake $UDS
APR icon Earn up to 50% APR
NFT icon Boost earnings with NFTs
Earn icon Play, HODL & earn more
Stake $UDS
Stake $UDS
UDS Left

Buy UDS!

Buy UDS with popular exchanges! Make purchases and claim rewards!

Buy UDS
UDS Right

Post in Forum to earn rewards!

UDS Rewards
Rewards for UDS holders
Rewards for UDS holders (per post)*
  • 100 - 999 UDS: 0.05 UDS
  • 1000 - 2499 UDS: 0.10 UDS
  • 2500 - 4999 UDS: 0.5 UDS
  • 5000 - 9999 UDS: 1.5 UDS
  • 10000 - 24999 UDS: 5 UDS
  • 25000 - 49999 UDS: 10 UDS
  • 50000 - 99 999 UDS: 25 UDS
  • 100 000 UDS or more: 50 UDS
*

Rewards are credited at the end of the day. Limited to 5 payable posts per day, 50 K holders - 3 posts per day, 100K holders - 2 posts per day. Staked UDS gives additional coefficient up to X1.5

  1. Home
  2. Beyond Blockchain
  3. 🚨 Cloudflare: Perplexity bots are scraping websites despite owner restrictions

🚨 Cloudflare: Perplexity bots are scraping websites despite owner restrictions

Scheduled Pinned Locked Moved Beyond Blockchain
3 Posts 3 Posters 15 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
This topic has been deleted. Only users with topic management privileges can see it.
  • cryptoenthusiastC Offline
    cryptoenthusiastC Offline
    cryptoenthusiast
    wrote on last edited by
    #1

    leonardo.osnova.webp
    Cloudflare has reported that web crawlers from the AI search startup Perplexity are bypassing website restrictions — even when explicitly blocked by site owners.

    🕵️‍♂️ Here’s what’s happening:

    Since July 1, 2025, Cloudflare began automatically blocking AI crawlers on customer websites. But many site admins noticed that Perplexity bots were still getting through, despite being denied access via robots.txt and Web Application Firewall (WAF) settings.

    Upon investigation, Cloudflare discovered that:

    🔍 Perplexity disguises its bots as real users — by spoofing browser headers like Chrome on macOS.
    
    📶 They rotate IP addresses and ASN identifiers, allowing them to operate outside known ranges.
    
    🐢 When disguised, the bots slow down crawling speed — from 20–25 million requests per day to 3–6 million — making them harder to detect.
    
    🧩 If blocked completely, Perplexity tries to reconstruct page data from third-party sources, even if those sources are outdated or inaccurate.
    

    🛡️ The good news:

    Cloudflare has rolled out new protections against stealth crawlers — even on free-tier plans. Users just need to enable the feature in their dashboard.

    ✅ Also worth noting: ChatGPT bots from OpenAI were found to respect website rules and don’t violate crawling policies, Cloudflare confirmed.

    📣 Cloudflare reminds AI crawler operators:

    Stay transparent, ethical, and responsible.
    🔒 Don’t overload websites
    🙅‍♂️ Don’t harvest personal data
    🏷️ Always identify your bot clearly.
    

    📌 Bottom line: If you're running a site and want to protect your content from unauthorized AI scraping, now’s the time to double-check your Cloudflare settings. The AI web crawler war is heating up — and staying one step ahead is key.

    1 Reply Last reply
    0
    • J Offline
      J Offline
      jacson4
      wrote on last edited by
      #2

      This is a huge red flag for web transparency and control. If site owners are explicitly blocking crawlers via robots.txt or other means — and those instructions are still being bypassed — that’s not just a tech glitch, it’s a trust issue.Cloudflare’s involvement makes it even more complex, because they’re often seen as the protectors of web infrastructure. If platforms like Perplexity are getting through, intentionally or not, it raises serious questions about consent, enforcement, and the future of content ownership in the age of AI. 🛡️🧠📉

      1 Reply Last reply
      0
      • N Offline
        N Offline
        Nahid10
        wrote on last edited by
        #3

        Scraping the web is nothing new, but ignoring explicit opt-outs is where it crosses a line. If bots are bypassing standard blocks, that’s not “indexing” — that’s digital trespassing.The irony is that these AI models depend on the open web, yet risk poisoning that same ecosystem by overreaching. Platforms need to be held accountable before this becomes the norm, not the exception. Respect to the post for calling this out — these are the conversations we need to have now, not later. ⚖️🌐🚨

        1 Reply Last reply
        0


        Powered by NodeBB Contributors
        • First post
          Last post
        0
        • Categories
        • Recent
        • Tags
        • Popular
        • World
        • Users
        • Groups