Romain Bignon
dbd87d22ab
move handle_loaders into AbstractElement
2014-09-20 23:51:28 +02:00
smurail
99e797410d
JSVar: oops
2014-09-18 12:51:28 +02:00
smurail
1b4f750888
minor fix of JSVar
2014-09-17 16:06:03 +02:00
smurail
7f7d762aa5
add some javascript dedicated filters
2014-09-17 15:57:16 +02:00
smurail
e37df1417f
Regexp: let template be a callable
2014-09-17 15:54:21 +02:00
Romain Bignon
e3666afdfe
require python-concurrent.futures only when needed
2014-09-16 20:01:45 +02:00
smurail
902f26767d
add a way to asynchronously handle requests and pages
2014-09-12 18:58:56 +02:00
smurail
1d84d6aa39
add a way to customize sign handling for CleanDecimal
2014-09-11 19:46:11 +02:00
Romain Bignon
a96b738242
raise ParseError only with None/NotAvailable/NotLoaded values, not with empty strings
2014-09-06 11:44:41 +02:00
Romain Bignon
52899a7ca9
add __repr__ on NotAvailable, NotLoaded and _NO_DEFAULT constants to be more readable on doc
2014-09-03 07:21:23 +02:00
Laurent Bachelier
e741431aa1
Add a new "Type" filter
2014-09-02 12:15:54 +02:00
Laurent Bachelier
f7136045d1
Fix doctest
2014-09-02 12:15:54 +02:00
Laurent Bachelier
59095aa85e
CleanDecimal: do not crash with inputs like NotAvailable
2014-09-02 12:15:54 +02:00
Laurent Bachelier
1838f62e28
Allow more flexibility for the submit button parameter
2014-08-26 18:05:25 +02:00
Laurent Bachelier
a3c208dccf
Add support for forms with multiple "submit" elements
2014-08-26 18:05:25 +02:00
Laurent Bachelier
3728ba2e74
Update Firefox versions to latest ESR
2014-08-26 18:05:25 +02:00
Florent
09b0de545e
set replace_dots default value to False
2014-08-21 10:27:20 +02:00
Florent
ce133dcf8a
Fix documentation of nr parameter
2014-08-19 08:58:12 +02:00
Laurent Bachelier
8c3e92aa31
fix re flags usage for Python 2.6
...
fixes #1444
2014-08-13 19:49:24 +02:00
Laurent Bachelier
3a3e3d0438
Help debug Filter errors
2014-08-05 20:27:48 +02:00
Laurent Bachelier
94deb53352
Add support for a default in Env
2014-08-05 20:27:48 +02:00
Laurent Bachelier
a0559e539e
CleanText improvements
...
* \t is always in \s so no need to add it
* handle the non-breaking space thanks to the re.UNICODE flag
* add an option to keep (but normalize) newlines
* more tests
2014-08-05 20:27:48 +02:00
Laurent Bachelier
b6c6ed2306
Fix filters doctests and add them to the usual test run
2014-08-05 20:27:48 +02:00
Laurent Bachelier
d19e0637e4
CleanText: Always return unicode
2014-08-05 20:27:48 +02:00
Laurent Bachelier
819de1ace0
Do not crash if total_seconds() is not implemented
...
Which is the case with Python 2.6.
We could do the calculation ourselves, but this is not a very important
feature.
2014-08-05 20:27:48 +02:00
Romain Bignon
1005197a92
add CsvPage
2014-07-11 15:24:24 +02:00
Romain Bignon
c90b5844e4
split filters into several files
2014-07-11 15:24:19 +02:00
smurail
8cb44a45a7
possibility to set custom separators for decimals
2014-07-09 19:02:23 +02:00
Florent
6759dec279
Add missing import
2014-07-09 14:58:49 +02:00
Florent
9ae7cc692f
Fix a regression: restore ListItem version
...
And move it in the same place than before to help the diff...
2014-07-09 14:51:34 +02:00
Florent
1daa866949
Move the import oh html2text outside of misc
2014-07-09 11:43:14 +02:00
Florent
fb555c3079
Do not import lxml in headers of page.py
2014-07-09 10:23:24 +02:00
Florent
8a2a1ece5e
Do not always import module used only in debug mode
2014-07-09 10:23:24 +02:00
Florent
76cb004eb4
Move ItemListTable-Element outside of page.py
...
One of the goal is to not import all modules needed by filters by
loading the page file.
In the same goal, move the import of parsers in the class definition.
2014-07-09 10:23:20 +02:00
Romain Bignon
983ed221e2
ability to use filters as classes in chain (refs #1426 )
2014-07-05 20:22:45 +02:00
Romain Bignon
2268eb2ff1
ability to use Dict['a']['b']['c'] instead of Dict('a/b/c') (refs #1426 )
2014-07-05 20:22:39 +02:00
Romain Bignon
8efd37e71d
overload & and | operators to chain filters (refs #1426 )
2014-07-05 20:00:04 +02:00
Bezleputh
ac161104ea
[filter] manage basestring entry in CleanHTML filter
2014-07-05 14:24:37 +02:00
Bezleputh
96271b6de4
[filters] manage default in Dict filter
2014-07-05 14:24:14 +02:00
Laurent Bachelier
73cd8762f5
Allow for a default argument in MultiFilter
2014-07-05 14:23:29 +02:00
Laurent Bachelier
3f2d8ae185
Allow for a custom element finder
...
And end up with less duplicate code!
2014-07-05 14:23:27 +02:00
Romain Bignon
18c1f46922
ability to override the flush() method
2014-07-01 20:37:58 +02:00
Vincent Paredes
714a0e7617
matching content with url using is_here
2014-07-01 15:52:51 +02:00
Laurent Bachelier
b9c6176628
browser2: Allow setting query string params on build_url
...
The outcome is exactly the same as using requests with the "params"
parameter.
2014-06-20 17:58:51 +02:00
Laurent Bachelier
5dd0e9e0ec
Small style fixes
2014-06-20 17:58:51 +02:00
Laurent Bachelier
b013828ad0
browser2: Add a filter to change the base element used for selectors
2014-06-17 00:48:30 +02:00
Laurent Bachelier
04cec70e1f
browser2 filters: Force unicode, little style fixes
...
lxml for Python2 has the tendency to return str instead of
unicode when the contents are pure ASCII.
Try to fix the nonsense.
2014-06-04 00:58:35 +02:00
Laurent Bachelier
9619ddcaa2
browser2: Add RawText filter
...
Allows getting .text of elements without any alteration.
This is useful for at least textarea and pre tags.
Maybe the .join character should be configurable.
2014-06-04 00:49:00 +02:00
Laurent Bachelier
c69c5cf5ef
browser2: More specialized exceptions
...
and extend common exceptions
2014-06-03 22:28:21 +02:00
Laurent Bachelier
e01fda826c
filters: Properly handle defaults that are not datetimes
2014-05-27 17:41:51 +02:00