Python之抓取网页元素
2024-09-06 02:03:44
import urllib.request
from bs4 import BeautifulSoup
url = "http://www.wal-martchina.com/walmart/store/14_hubei.htm"
user_agent = "Mozilla/5.0 (Windows NT 6.2; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/43.0.2357.134 Safari/537.36"
request = urllib.request.Request(url)
request.add_header("User-Agent", user_agent)
content = urllib.request.urlopen(request)
soup = BeautifulSoup(content,from_encoding="gb18030")
#店名
shopname = soup.find_all('td', {"class": "xl714445"})
#地址
addresss = soup.find_all('td', {"class": "xl684445"})
#联系电话
phones = soup.find_all('td', {"class": "xl744445"})
for shop in shopname:
print("店铺名称:"+shop.text.lstrip().rstrip())
print("----------------------------------------------")
for address in addresss:
print("店铺地址:"+address.text.lstrip().rstrip())
sum = 0
for phone in phones:
if sum % 2 == 0:
print("联系电话:" + phone.text.lstrip().rstrip())
else:
print("交通路线:" + phone.text.lstrip().rstrip())
print('---------------------------------------------------')
sum += 1
最新文章
- vim快捷键
- Switch重构处理
- nginx tomcat session丢失的问题
- android largeheap 的设定
- hdu5072 Coprime (2014鞍山区域赛C题)(数论)
- (OAF)jdeveloper集成log4j并将日志输出到指定文件并写入数据库
- NYOJ 536 开心的mdd【矩阵链乘】
- Linux环境Nginx安装多版本PHP
- 猫学习IOS(十五)UI以前的热的打砖块游戏
- apns关于APP数字角标的理解
- python连接mysql、oracle小例子
- Android Quick Tip - ADB over WiFi
- Vue中的v-cloak用法
- android手机短信获取
- 前端-JavaScript2-5——JavaScript之运算符进阶
- 微信HTML5页面设计建议
- 参数化define
- 如何优化JavaScript的构造函数
- Python爬虫入门教程石家庄链家租房数据抓取
- ubuntu下制作u盘启动盘