
如何做搜尋引擎蜘蛛日誌分析
搜尋引擎蜘蛛日誌檔案是一種非常強大但未被站長充分利用的檔案,分析它可以獲取有關每個搜尋引擎如何爬取網站內容的相關資訊點,及檢視搜尋引擎蜘蛛在一段時間內的行為。
IP地址(3) | 伺服器名稱 | 所屬國家 |
---|---|---|
34.86.212.119 | 119.212.86.34.bc.googleusercontent.com | US |
35.222.190.7 | 7.190.222.35.bc.googleusercontent.com | US |
34.174.192.151 | 151.192.174.34.bc.googleusercontent.com | US |
104.198.200.51 | 51.200.198.104.bc.googleusercontent.com | US |
34.71.214.216 | 216.214.71.34.bc.googleusercontent.com | US |
34.174.198.74 | 74.198.174.34.bc.googleusercontent.com | US |
34.125.230.24 | 24.230.125.34.bc.googleusercontent.com | US |
34.171.160.166 | 166.160.171.34.bc.googleusercontent.com | US |
35.239.141.230 | 230.141.239.35.bc.googleusercontent.com | US |
34.125.4.31 | 31.4.125.34.bc.googleusercontent.com | US |
35.245.37.253 | 253.37.245.35.bc.googleusercontent.com | US |
34.48.173.62 | 62.173.48.34.bc.googleusercontent.com | US |
34.125.116.87 | 87.116.125.34.bc.googleusercontent.com | US |
31.145.16.12 | yoncu.bilisim.cozumleri | TR |
35.193.85.39 | 39.85.193.35.bc.googleusercontent.com | US |
IP地址(27) | 伺服器名稱 | 所屬國家 |
---|---|---|
35.243.23.3 | 3.23.243.35.bc.googleusercontent.com | ? |
35.243.23.31 | 31.23.243.35.bc.googleusercontent.com | ? |
107.178.194.227 | 227.194.178.107.gae.googleusercontent.com | US |
107.178.194.229 | 229.194.178.107.gae.googleusercontent.com | US |
107.178.194.231 | 231.194.178.107.gae.googleusercontent.com | US |
34.98.143.50 | 34.98.143.50 | US |
35.187.132.232 | 35.187.132.232 | US |
35.187.132.239 | 35.187.132.239 | US |
35.187.132.241 | 35.187.132.241 | US |
34.98.143.52 | 34.98.143.52 | US |
IP地址(213) | 伺服器名稱 | 所屬國家 |
---|---|---|
35.243.23.31 | 31.23.243.35.bc.googleusercontent.com | US |
35.243.23.7 | 7.23.243.35.bc.googleusercontent.com | US |
107.178.194.227 | 227.194.178.107.gae.googleusercontent.com | US |
107.178.194.229 | 229.194.178.107.gae.googleusercontent.com | US |
107.178.194.231 | 231.194.178.107.gae.googleusercontent.com | US |
34.98.143.48 | 34.98.143.48 | US |
35.187.132.230 | 35.187.132.230 | US |
35.187.132.239 | 35.187.132.239 | US |
35.187.132.241 | 35.187.132.241 | US |
34.98.143.52 | 34.98.143.52 | US |
217.25.223.244 | pppoe244.net223.omkc.ru | RU |
35.243.23.3 | 3.23.243.35.bc.googleusercontent.com | US |
34.98.143.50 | 34.98.143.50 | US |
35.187.132.232 | 35.187.132.232 | US |
35.187.132.120 | 35.187.132.120 | US |
35.243.23.18 | 18.23.243.35.bc.googleusercontent.com | US |
35.243.23.20 | 20.23.243.35.bc.googleusercontent.com | US |
35.187.132.122 | 35.187.132.122 | US |
34.98.143.14 | 34.98.143.14 | US |
34.98.143.10 | 34.98.143.10 | US |
35.187.132.118 | 35.187.132.118 | US |
35.243.23.180 | 180.23.243.35.bc.googleusercontent.com | US |
35.187.132.103 | 35.187.132.103 | US |
107.178.194.103 | 103.194.178.107.gae.googleusercontent.com | US |
107.178.194.105 | 105.194.178.107.gae.googleusercontent.com | US |
107.178.194.177 | 177.194.178.107.gae.googleusercontent.com | US |
107.178.194.178 | 178.194.178.107.gae.googleusercontent.com | US |
107.178.194.179 | 179.194.178.107.gae.googleusercontent.com | US |
35.243.23.168 | 168.23.243.35.bc.googleusercontent.com | US |
35.243.23.169 | 169.23.243.35.bc.googleusercontent.com | US |
35.243.23.167 | 167.23.243.35.bc.googleusercontent.com | US |
35.243.23.160 | 160.23.243.35.bc.googleusercontent.com | US |
35.243.23.174 | 174.23.243.35.bc.googleusercontent.com | US |
35.187.132.199 | 35.187.132.199 | US |
35.187.132.200 | 35.187.132.200 | US |
107.178.194.203 | 203.194.178.107.gae.googleusercontent.com | US |
107.178.194.201 | 201.194.178.107.gae.googleusercontent.com | US |
107.178.194.202 | 202.194.178.107.gae.googleusercontent.com | US |
35.243.23.165 | 165.23.243.35.bc.googleusercontent.com | US |
35.243.23.163 | 163.23.243.35.bc.googleusercontent.com | US |
35.243.23.136 | 136.23.243.35.bc.googleusercontent.com | US |
107.178.194.194 | 194.194.178.107.gae.googleusercontent.com | US |
107.178.194.163 | 163.194.178.107.gae.googleusercontent.com | US |
35.187.132.201 | 35.187.132.201 | US |
35.187.132.172 | 35.187.132.172 | US |
35.187.132.171 | 35.187.132.171 | US |
35.187.132.170 | 35.187.132.170 | US |
34.98.143.194 | 34.98.143.194 | US |
34.98.143.195 | 34.98.143.195 | US |
34.98.143.196 | 34.98.143.196 | US |
35.243.23.108 | ? | US |
35.243.23.110 | ? | US |
35.243.23.109 | ? | US |
35.243.23.5 | ? | US |
35.243.23.4 | ? | US |
35.243.23.142 | 142.23.243.35.bc.googleusercontent.com | US |
35.243.23.128 | 128.23.243.35.bc.googleusercontent.com | US |
34.98.143.193 | 34.98.143.193 | US |
34.98.143.197 | 34.98.143.197 | US |
107.178.194.141 | 141.194.178.107.gae.googleusercontent.com | US |
35.243.23.37 | 37.23.243.35.bc.googleusercontent.com | US |
35.243.23.36 | 36.23.243.35.bc.googleusercontent.com | US |
107.178.194.35 | 35.194.178.107.gae.googleusercontent.com | US |
35.243.23.35 | 35.23.243.35.bc.googleusercontent.com | US |
107.178.194.136 | 136.194.178.107.gae.googleusercontent.com | US |
107.178.194.135 | 135.194.178.107.gae.googleusercontent.com | US |
107.178.194.162 | 162.194.178.107.gae.googleusercontent.com | US |
107.178.194.161 | 161.194.178.107.gae.googleusercontent.com | US |
107.178.194.137 | 137.194.178.107.gae.googleusercontent.com | US |
107.178.194.169 | 169.194.178.107.gae.googleusercontent.com | US |
107.178.194.167 | 167.194.178.107.gae.googleusercontent.com | US |
107.178.194.168 | 168.194.178.107.gae.googleusercontent.com | US |
107.178.194.34 | 34.194.178.107.gae.googleusercontent.com | US |
107.178.194.33 | 33.194.178.107.gae.googleusercontent.com | US |
35.243.23.132 | 132.23.243.35.bc.googleusercontent.com | US |
35.243.23.131 | 131.23.243.35.bc.googleusercontent.com | US |
35.243.23.133 | 133.23.243.35.bc.googleusercontent.com | US |
IP地址(213) | 伺服器名稱 | 所屬國家 |
---|---|---|
35.187.132.120 | 35.187.132.120 | US |
35.187.132.239 | 35.187.132.239 | US |
35.243.23.18 | 18.23.243.35.bc.googleusercontent.com | US |
35.243.23.20 | 20.23.243.35.bc.googleusercontent.com | US |
35.187.132.122 | 35.187.132.122 | US |
34.98.143.14 | 34.98.143.14 | US |
34.98.143.10 | 34.98.143.10 | US |
35.187.132.118 | 35.187.132.118 | US |
35.243.23.180 | 180.23.243.35.bc.googleusercontent.com | US |
35.187.132.103 | 35.187.132.103 | US |
一般不攔截。此類爬蟲通常是網站所有者提交掃描請求才會出現。如果攔截,則無法執行相應的掃描動作。
您可以通過在網站的 robots.txt 中設定使用者代理訪問規則來遮蔽 virustotal 或限制其訪問許可權。我們建議安裝 Spider Analyser 外掛,以檢查它是否真正遵循這些規則。
# robots.txt # 下列程式碼一般情況可以攔截該代理 User-agent: virustotal Disallow: /
您無需手動執行此操作,可通過我們的 Wordpress 外掛 Spider Analyser 來攔截不必要的蜘蛛或者爬蟲。
VirusTotal.com是一個免費的病毒、蠕蟲、木馬和各種惡意軟體分析服務,可以針對可疑檔案和網址進行快速檢測,最初由Hispasec維護。它與傳統防毒軟體的不同之處是它通過多種防毒引擎掃描檔案。使用多種反病毒引擎可以令使用者們通過各防毒引擎的偵測結果,判斷上傳的檔案是否為惡意軟體。
VirusTotal是一個由西班牙安全公司Hispasec Sistemas建立的網站。它於2004年6月推出,在2012年9月被谷歌收購。
VirusTotal聚合了許多反病毒產品和線上掃描引擎,稱為貢獻者。2018年11月,美國網路司令部下屬的網路國家任務部隊成為貢獻者。來自這些貢獻者的聚合資料允許使用者檢查使用者自己的反病毒軟體可能錯過的病毒,或驗證任何誤報。最大650MB的檔案可以上傳到網站,或通過電子郵件傳送(最大32MB)。反病毒軟體供應商可以收到被其他掃描標記但被他們自己的引擎通過的檔案副本,以幫助改進他們的軟體,並延伸到VirusTotal自己的能力。使用者還可以掃描可疑的URL並通過VirusTotal資料集進行搜尋。VirusTotal使用Cuckoo沙箱對惡意軟體進行動態分析。VirusTotal被《PC世界》評選為2007年最佳100個產品之一。