As we observe, redundant data may occur, due to the following reasons:
- Sending search request too often
- weixin.sogou sometimes gives redundant response
To solve:
- use a statics package, to observe the redundant rate
- modify the crawler strategy in terms of redundant rate