Commit graph

198 commits

Author SHA1 Message Date
Romain Bignon
dbd87d22ab move handle_loaders into AbstractElement 2014-09-20 23:51:28 +02:00
smurail
99e797410d JSVar: oops 2014-09-18 12:51:28 +02:00
smurail
1b4f750888 minor fix of JSVar 2014-09-17 16:06:03 +02:00
smurail
7f7d762aa5 add some javascript dedicated filters 2014-09-17 15:57:16 +02:00
smurail
e37df1417f Regexp: let template be a callable 2014-09-17 15:54:21 +02:00
Romain Bignon
e3666afdfe require python-concurrent.futures only when needed 2014-09-16 20:01:45 +02:00
smurail
902f26767d add a way to asynchronously handle requests and pages 2014-09-12 18:58:56 +02:00
smurail
1d84d6aa39 add a way to customize sign handling for CleanDecimal 2014-09-11 19:46:11 +02:00
Romain Bignon
a96b738242 raise ParseError only with None/NotAvailable/NotLoaded values, not with empty strings 2014-09-06 11:44:41 +02:00
Romain Bignon
52899a7ca9 add __repr__ on NotAvailable, NotLoaded and _NO_DEFAULT constants to be more readable on doc 2014-09-03 07:21:23 +02:00
Laurent Bachelier
e741431aa1 Add a new "Type" filter 2014-09-02 12:15:54 +02:00
Laurent Bachelier
f7136045d1 Fix doctest 2014-09-02 12:15:54 +02:00
Laurent Bachelier
59095aa85e CleanDecimal: do not crash with inputs like NotAvailable 2014-09-02 12:15:54 +02:00
Laurent Bachelier
1838f62e28 Allow more flexibility for the submit button parameter 2014-08-26 18:05:25 +02:00
Laurent Bachelier
a3c208dccf Add support for forms with multiple "submit" elements 2014-08-26 18:05:25 +02:00
Laurent Bachelier
3728ba2e74 Update Firefox versions to latest ESR 2014-08-26 18:05:25 +02:00
Florent
09b0de545e set replace_dots default value to False 2014-08-21 10:27:20 +02:00
Florent
ce133dcf8a Fix documentation of nr parameter 2014-08-19 08:58:12 +02:00
Laurent Bachelier
8c3e92aa31 fix re flags usage for Python 2.6
fixes #1444
2014-08-13 19:49:24 +02:00
Laurent Bachelier
3a3e3d0438 Help debug Filter errors 2014-08-05 20:27:48 +02:00
Laurent Bachelier
94deb53352 Add support for a default in Env 2014-08-05 20:27:48 +02:00
Laurent Bachelier
a0559e539e CleanText improvements
* \t is always in \s so no need to add it
* handle the non-breaking space thanks to the re.UNICODE flag
* add an option to keep (but normalize) newlines
* more tests
2014-08-05 20:27:48 +02:00
Laurent Bachelier
b6c6ed2306 Fix filters doctests and add them to the usual test run 2014-08-05 20:27:48 +02:00
Laurent Bachelier
d19e0637e4 CleanText: Always return unicode 2014-08-05 20:27:48 +02:00
Laurent Bachelier
819de1ace0 Do not crash if total_seconds() is not implemented
Which is the case with Python 2.6.
We could do the calculation ourselves, but this is not a very important
feature.
2014-08-05 20:27:48 +02:00
Romain Bignon
1005197a92 add CsvPage 2014-07-11 15:24:24 +02:00
Romain Bignon
c90b5844e4 split filters into several files 2014-07-11 15:24:19 +02:00
smurail
8cb44a45a7 possibility to set custom separators for decimals 2014-07-09 19:02:23 +02:00
Florent
6759dec279 Add missing import 2014-07-09 14:58:49 +02:00
Florent
9ae7cc692f Fix a regression: restore ListItem version
And move it in the same place than before to help the diff...
2014-07-09 14:51:34 +02:00
Florent
1daa866949 Move the import oh html2text outside of misc 2014-07-09 11:43:14 +02:00
Florent
fb555c3079 Do not import lxml in headers of page.py 2014-07-09 10:23:24 +02:00
Florent
8a2a1ece5e Do not always import module used only in debug mode 2014-07-09 10:23:24 +02:00
Florent
76cb004eb4 Move ItemListTable-Element outside of page.py
One of the goal is to not import all modules needed by filters by
loading the page file.

In the same goal, move the import of parsers in the class definition.
2014-07-09 10:23:20 +02:00
Romain Bignon
983ed221e2 ability to use filters as classes in chain (refs #1426) 2014-07-05 20:22:45 +02:00
Romain Bignon
2268eb2ff1 ability to use Dict['a']['b']['c'] instead of Dict('a/b/c') (refs #1426) 2014-07-05 20:22:39 +02:00
Romain Bignon
8efd37e71d overload & and | operators to chain filters (refs #1426) 2014-07-05 20:00:04 +02:00
Bezleputh
ac161104ea [filter] manage basestring entry in CleanHTML filter 2014-07-05 14:24:37 +02:00
Bezleputh
96271b6de4 [filters] manage default in Dict filter 2014-07-05 14:24:14 +02:00
Laurent Bachelier
73cd8762f5 Allow for a default argument in MultiFilter 2014-07-05 14:23:29 +02:00
Laurent Bachelier
3f2d8ae185 Allow for a custom element finder
And end up with less duplicate code!
2014-07-05 14:23:27 +02:00
Romain Bignon
18c1f46922 ability to override the flush() method 2014-07-01 20:37:58 +02:00
Vincent Paredes
714a0e7617 matching content with url using is_here 2014-07-01 15:52:51 +02:00
Laurent Bachelier
b9c6176628 browser2: Allow setting query string params on build_url
The outcome is exactly the same as using requests with the "params"
parameter.
2014-06-20 17:58:51 +02:00
Laurent Bachelier
5dd0e9e0ec Small style fixes 2014-06-20 17:58:51 +02:00
Laurent Bachelier
b013828ad0 browser2: Add a filter to change the base element used for selectors 2014-06-17 00:48:30 +02:00
Laurent Bachelier
04cec70e1f browser2 filters: Force unicode, little style fixes
lxml for Python2 has the tendency to return str instead of
unicode when the contents are pure ASCII.
Try to fix the nonsense.
2014-06-04 00:58:35 +02:00
Laurent Bachelier
9619ddcaa2 browser2: Add RawText filter
Allows getting .text of elements without any alteration.
This is useful for at least textarea and pre tags.

Maybe the .join character should be configurable.
2014-06-04 00:49:00 +02:00
Laurent Bachelier
c69c5cf5ef browser2: More specialized exceptions
and extend common exceptions
2014-06-03 22:28:21 +02:00
Laurent Bachelier
e01fda826c filters: Properly handle defaults that are not datetimes 2014-05-27 17:41:51 +02:00