
如何做搜索引擎蜘蛛日志分析
搜索引擎蜘蛛日志文件是一种非常强大但未被站长充分利用的文件,分析它可以获取有关每个搜索引擎如何爬取网站内容的相关信息点,及查看搜索引擎蜘蛛在一段时间内的行为。
IP地址(1779) | 服务器名称 | 所属国家 |
---|---|---|
54.237.205.143 | ec2-54-237-205-143.compute-1.amazonaws.com | US |
18.232.112.107 | ec2-18-232-112-107.compute-1.amazonaws.com | US |
52.90.98.244 | ec2-52-90-98-244.compute-1.amazonaws.com | US |
54.172.73.90 | ec2-54-172-73-90.compute-1.amazonaws.com | US |
107.21.82.20 | ec2-107-21-82-20.compute-1.amazonaws.com | US |
3.239.42.210 | ec2-3-239-42-210.compute-1.amazonaws.com | US |
3.239.111.62 | ec2-3-239-111-62.compute-1.amazonaws.com | US |
3.237.61.237 | ec2-3-237-61-237.compute-1.amazonaws.com | US |
3.238.152.149 | ec2-3-238-152-149.compute-1.amazonaws.com | US |
44.192.51.162 | ec2-44-192-51-162.compute-1.amazonaws.com | US |
IP地址(8) | 服务器名称 | 所属国家 |
---|---|---|
34.207.241.106 | ec2-34-207-241-106.compute-1.amazonaws.com | US |
54.80.182.130 | ec2-54-80-182-130.compute-1.amazonaws.com | US |
54.80.10.131 | ec2-54-80-10-131.compute-1.amazonaws.com | US |
18.212.104.154 | ec2-18-212-104-154.compute-1.amazonaws.com | US |
3.89.156.231 | ec2-3-89-156-231.compute-1.amazonaws.com | US |
54.87.176.14 | ec2-54-87-176-14.compute-1.amazonaws.com | US |
54.197.222.220 | ec2-54-197-222-220.compute-1.amazonaws.com | US |
54.81.231.180 | ec2-54-81-231-180.compute-1.amazonaws.com | US |
54.198.200.47 | ec2-54-198-200-47.compute-1.amazonaws.com | US |
54.236.253.7 | ec2-54-236-253-7.compute-1.amazonaws.com | US |
35.171.88.187 | ec2-35-171-88-187.compute-1.amazonaws.com | US |
54.235.13.36 | ec2-54-235-13-36.compute-1.amazonaws.com | US |
18.207.161.80 | ec2-18-207-161-80.compute-1.amazonaws.com | US |
54.83.95.126 | ec2-54-83-95-126.compute-1.amazonaws.com | US |
54.237.205.143 | ec2-54-237-205-143.compute-1.amazonaws.com | US |
18.232.112.107 | ec2-18-232-112-107.compute-1.amazonaws.com | US |
52.90.98.244 | ec2-52-90-98-244.compute-1.amazonaws.com | US |
54.172.73.90 | ec2-54-172-73-90.compute-1.amazonaws.com | US |
107.21.82.20 | ec2-107-21-82-20.compute-1.amazonaws.com | US |
3.239.111.62 | ec2-3-239-111-62.compute-1.amazonaws.com | US |
75.101.213.220 | ec2-75-101-213-220.compute-1.amazonaws.com | US |
107.20.40.240 | ec2-107-20-40-240.compute-1.amazonaws.com | US |
107.22.91.186 | ec2-107-22-91-186.compute-1.amazonaws.com | US |
213.239.214.213 | server2.triona.com | DE |
50.16.112.189 | ec2-50-16-112-189.compute-1.amazonaws.com | US |
50.17.75.103 | ec2-50-17-75-103.compute-1.amazonaws.com | US |
50.17.84.14 | ec2-50-17-84-14.compute-1.amazonaws.com | US |
54.242.140.12 | ec2-54-242-140-12.compute-1.amazonaws.com | US |
54.224.221.194 | ec2-54-224-221-194.compute-1.amazonaws.com | US |
52.54.103.215 | ec2-52-54-103-215.compute-1.amazonaws.com | US |
3.93.43.124 | ec2-3-93-43-124.compute-1.amazonaws.com | US |
34.207.134.141 | ec2-34-207-134-141.compute-1.amazonaws.com | US |
54.158.163.46 | ec2-54-158-163-46.compute-1.amazonaws.com | US |
52.203.176.224 | ec2-52-203-176-224.compute-1.amazonaws.com | US |
54.88.175.51 | ec2-54-88-175-51.compute-1.amazonaws.com | US |
54.224.21.69 | ec2-54-224-21-69.compute-1.amazonaws.com | US |
3.92.63.156 | ec2-3-92-63-156.compute-1.amazonaws.com | US |
52.91.66.86 | ec2-52-91-66-86.compute-1.amazonaws.com | US |
54.145.190.99 | ec2-54-145-190-99.compute-1.amazonaws.com | US |
34.201.216.158 | ec2-34-201-216-158.compute-1.amazonaws.com | US |
54.91.72.15 | ec2-54-91-72-15.compute-1.amazonaws.com | US |
52.20.59.186 | ec2-52-20-59-186.compute-1.amazonaws.com | US |
54.224.131.56 | ec2-54-224-131-56.compute-1.amazonaws.com | US |
185.241.208.206 | 185.241.208.206 | PL |
109.70.100.65 | tor-exit-anonymizer.appliedprivacy.net | AT |
IP地址(8) | 服务器名称 | 所属国家 |
---|---|---|
50.16.112.189 | ec2-50-16-112-189.compute-1.amazonaws.com | US |
50.17.84.14 | ec2-50-17-84-14.compute-1.amazonaws.com | US |
54.242.140.12 | ec2-54-242-140-12.compute-1.amazonaws.com | US |
213.239.214.213 | server2.triona.com | DE |
107.20.40.240 | ec2-107-20-40-240.compute-1.amazonaws.com | US |
107.22.91.186 | ec2-107-22-91-186.compute-1.amazonaws.com | US |
75.101.213.220 | ec2-75-101-213-220.compute-1.amazonaws.com | US |
50.17.75.103 | ec2-50-17-75-103.compute-1.amazonaws.com | US |
一般不需要拦截,尤其是如果你自己也受益于搜索引擎优化服务。不过,如果你担心服务器资源占用等问题,且您都不使用这些工具,当然也可以选择拦截它们。
您可以通过在网站的 robots.txt 中设置用户代理访问规则来屏蔽 proximic 或限制其访问权限。我们建议安装 Spider Analyser 插件,以检查它是否真正遵循这些规则。
# robots.txt # 下列代码一般情况可以拦截该代理 User-agent: proximic Disallow: /
您无需手动执行此操作,可通过我们的 Wordpress 插件 Spider Analyser 来拦截不必要的蜘蛛或者爬虫。