Python正文提取算法
太阳 就一个static/image/smiley/default/lol.gif
niu 好牛逼 原帖由 xspoco 于 2011-4-6 23:44 发表
https://www.hs2v.com/images/common/back.gif
好牛逼 这语言真累,,,, [*] for div in divs:
[*] div_html = div.__str__()
[*] chinese_utf8 = re_chinese.findall(div_html)
[*] chinese_number = len(chinese_utf8) / 3
[*] if chinese_number 复制代码这段要改下,迭代里删东西会出问题的
页:
[1]