sese-engine是否完全遵守了robots协议以及配置.py里爬虫的名字是什么意思

我想问一下sese-engine是否完全遵守了robots协议
我在https://sese.yyj.moe 上搜索`bilibili`时得到如下结果
![image](https://user-images.githubusercontent.com/73788063/210332980-bc096b07-5396-44a7-8e64-2cc2529a7129.png)
但是根据https://www.bilibili.com/robots.txt 来看，sese-engine里默认配置的`loli_spider` 显然不属于允许的UA
那么如果sese-engine完全遵守了robots协议，则不会爬到https://www.bilibili.com
所以是https://sese.yyj.moe 修改了`爬虫的名字`还是sese-engine不是完全遵守robots协议

另外
大部中国分网站的`robots.txt`喜在文件末尾写上
```
User-agent: *
Disallow: /
```
所以如果我希望能和正常的搜索引擎一样爬取是否需要修改 爬虫的名字

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

sese-engine是否完全遵守了robots协议以及配置.py里爬虫的名字是什么意思 #38

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

sese-engine是否完全遵守了robots协议以及配置.py里爬虫的名字是什么意思 #38

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions