История изменений

Исправление slovazap, 01.02.19 18:24 (текущая версия) :

Зачем убирать, наоборот выгребай только числа.

>>> import re
>>> html = '<th class=«tableright»>& pound;1.95& nbsp;& nbsp;& nbsp;& nbsp;& nbsp;</th> <th class=«tableright»>& pound;2.95& nbsp;& nbsp;</th>'
>>> re.findall('[0-9]+\\.[0-9][0-9]', html)
['1.95', '2.95']
>>> list(map(float, re.findall('[0-9]+\\.[0-9][0-9]', html)))
[1.95, 2.95]

Эсли этого перестанет хватать, лучше бери lxml и парси html по-человечески.

Исправление slovazap, 01.02.19 18:24:

Зачем убирать, наоборот выгребай только числа.

>>> import re
>>> re.findall('[0-9]+\\.[0-9][0-9]', '<th class=«tableright»>& pound;1.95& nbsp;& nbsp;& nbsp;& nbsp;& nbsp;</th> <th class=«tableright»>& pound;2.95& nbsp;& nbsp;</th>')
['1.95', '2.95']
>>> list(map(float, re.findall('[0-9]+\\.[0-9][0-9]', '<th class=«tableright»>& pound;1.95& nbsp;& nbsp;& nbsp;& nbsp;& nbsp;</th> <th class=«tableright»>& pound;2.95& nbsp;& nbsp;</th>')))
[1.95, 2.95]

Эсли этого перестанет хватать, лучше бери lxml и парси html по-человечески.

Исправление slovazap, 01.02.19 18:23:

Зачем убирать, наоборот выгребай только числа.

>>> import re
>>> re.findall('[0-9]+\\.[0-9][0-9]', '<th class=«tableright»>& pound;1.95& nbsp;& nbsp;& nbsp;& nbsp;& nbsp;</th> <th class=«tableright»>& pound;2.95& nbsp;& nbsp;</th>')
['1.95', '2.95']

Эсли этого перестанет хватать, лучше бери lxml и парси html по-человечески.

Исходная версия slovazap, 01.02.19 18:22:

Зачем убирать, наоборот выгребай только числа.

[code]

import re re.findall(‘[0-9]+\.[0-9][0-9]’, ‘& pound;1.95& nbsp;& nbsp;& nbsp;& nbsp;& nbsp; & pound;2.95& nbsp;& nbsp;’) [‘1.95’, ‘2.95’] [/code]

Эсли этого перестанет хватать, лучше бери lxml и парси html по-человечески.