web scraper——爬取知乎|微博用户数据模板【三】
2024-08-30 06:15:01
前言
在这里呢,我就只给模板,不写具体的教程啦,具体的可以参考我之前写的博文。
https://www.cnblogs.com/wangyang0210/p/10338574.html
模板
进入微博选择粉丝较多的博主
复制下面的模板导入站点即可
修改地址,编辑好名称,点击
Import Sitemap
即可
微博
{"_id":"weibo_chenglong","startUrl":["https://weibo.com/p/1006051234552257/follow?relate=fans&page=[1-5]"],"selectors":[{"id":"userinfo","type":"SelectorElement","parentSelectors":["_root"],"selector":"li.follow_item","multiple":true,"delay":6},{"id":"username","type":"SelectorText","parentSelectors":["userinfo"],"selector":"a.S_txt1","multiple":false,"regex":"","delay":0},{"id":"avatar","type":"SelectorImage","parentSelectors":["userinfo"],"selector":"img","multiple":false,"delay":0},{"id":"city","type":"SelectorText","parentSelectors":["userinfo"],"selector":"div.info_add span","multiple":false,"regex":"","delay":0}]}
知乎
{"_id":"zhihuranqiqigongzuoshi","startUrl":["https://www.zhihu.com/people/xie-ling-520/followers?page=[1-45]"],"selectors":[{"id":"list","type":"SelectorElement","parentSelectors":["_root"],"selector":"div.List-item","multiple":true,"delay":0},{"id":"username","type":"SelectorText","parentSelectors":["list"],"selector":"div.UserItem-title","multiple":false,"regex":"","delay":0},{"id":"avatar","type":"SelectorImage","parentSelectors":["list"],"selector":"img","multiple":false,"delay":0}]}
最新文章
- MVVM框架下 WPF隐藏DataGrid一列
- ios 控件代码transform学习笔记
- listView当中有嵌套了有onClickListener的控件时ListView自身的onItemClick无响应的解决方案
- Java你可能不知道的事(3)HashMap
- Java 线程通信
- UIlabel 属性text
- 16.检查是否为BST
- Linux makefile教程之书写命令四[转]
- git filename to long问题解决
- centos 7 安装mp3解码器
- MVC再次学习1
- require.js实践
- React文档翻译系列(二)Hello World
- fwrite文件写入数据
- Linux-Shell基础(变量,字符串,数组)
- oracle创建用户、创建表空间、授权、建表
- 浅谈java构建工具的选择
- jenkins构建时报错
- 常用命名_html
- 如何手动编译java文件