Florent
0088013ae1
Rename BaseBrowser to Browser
2014-09-26 10:18:12 +02:00
Florent
444d36eee8
Rename BasePage to Page
2014-09-26 10:18:12 +02:00
smurail
26c929ff9d
JSVar: fix encoding issues and simplify code
2014-09-24 16:00:48 +02:00
smurail
128739aceb
JSVar: add cornercase
2014-09-23 11:12:19 +02:00
smurail
72852cee13
let JSVar inherits from parameters of Regexp (like nth)
2014-09-23 11:12:18 +02:00
smurail
eb335e66fc
add an nth parameter to Regexp filter
2014-09-23 11:12:17 +02:00
Vincent Paredes
c44c7853d0
backporting mergin_hook to support hook's requests in wheezy
2014-09-22 23:00:03 +02:00
Romain Bignon
dbd87d22ab
move handle_loaders into AbstractElement
2014-09-20 23:51:28 +02:00
smurail
99e797410d
JSVar: oops
2014-09-18 12:51:28 +02:00
smurail
1b4f750888
minor fix of JSVar
2014-09-17 16:06:03 +02:00
smurail
7f7d762aa5
add some javascript dedicated filters
2014-09-17 15:57:16 +02:00
smurail
e37df1417f
Regexp: let template be a callable
2014-09-17 15:54:21 +02:00
Romain Bignon
e3666afdfe
require python-concurrent.futures only when needed
2014-09-16 20:01:45 +02:00
smurail
902f26767d
add a way to asynchronously handle requests and pages
2014-09-12 18:58:56 +02:00
smurail
1d84d6aa39
add a way to customize sign handling for CleanDecimal
2014-09-11 19:46:11 +02:00
Romain Bignon
a96b738242
raise ParseError only with None/NotAvailable/NotLoaded values, not with empty strings
2014-09-06 11:44:41 +02:00
Romain Bignon
52899a7ca9
add __repr__ on NotAvailable, NotLoaded and _NO_DEFAULT constants to be more readable on doc
2014-09-03 07:21:23 +02:00
Laurent Bachelier
e741431aa1
Add a new "Type" filter
2014-09-02 12:15:54 +02:00
Laurent Bachelier
f7136045d1
Fix doctest
2014-09-02 12:15:54 +02:00
Laurent Bachelier
59095aa85e
CleanDecimal: do not crash with inputs like NotAvailable
2014-09-02 12:15:54 +02:00
Laurent Bachelier
1838f62e28
Allow more flexibility for the submit button parameter
2014-08-26 18:05:25 +02:00
Laurent Bachelier
a3c208dccf
Add support for forms with multiple "submit" elements
2014-08-26 18:05:25 +02:00
Laurent Bachelier
3728ba2e74
Update Firefox versions to latest ESR
2014-08-26 18:05:25 +02:00
Florent
09b0de545e
set replace_dots default value to False
2014-08-21 10:27:20 +02:00
Florent
ce133dcf8a
Fix documentation of nr parameter
2014-08-19 08:58:12 +02:00
Laurent Bachelier
8c3e92aa31
fix re flags usage for Python 2.6
...
fixes #1444
2014-08-13 19:49:24 +02:00
Laurent Bachelier
3a3e3d0438
Help debug Filter errors
2014-08-05 20:27:48 +02:00
Laurent Bachelier
94deb53352
Add support for a default in Env
2014-08-05 20:27:48 +02:00
Laurent Bachelier
a0559e539e
CleanText improvements
...
* \t is always in \s so no need to add it
* handle the non-breaking space thanks to the re.UNICODE flag
* add an option to keep (but normalize) newlines
* more tests
2014-08-05 20:27:48 +02:00
Laurent Bachelier
b6c6ed2306
Fix filters doctests and add them to the usual test run
2014-08-05 20:27:48 +02:00
Laurent Bachelier
d19e0637e4
CleanText: Always return unicode
2014-08-05 20:27:48 +02:00
Laurent Bachelier
819de1ace0
Do not crash if total_seconds() is not implemented
...
Which is the case with Python 2.6.
We could do the calculation ourselves, but this is not a very important
feature.
2014-08-05 20:27:48 +02:00
Romain Bignon
1005197a92
add CsvPage
2014-07-11 15:24:24 +02:00
Romain Bignon
c90b5844e4
split filters into several files
2014-07-11 15:24:19 +02:00
smurail
8cb44a45a7
possibility to set custom separators for decimals
2014-07-09 19:02:23 +02:00
Florent
6759dec279
Add missing import
2014-07-09 14:58:49 +02:00
Florent
9ae7cc692f
Fix a regression: restore ListItem version
...
And move it in the same place than before to help the diff...
2014-07-09 14:51:34 +02:00
Florent
1daa866949
Move the import oh html2text outside of misc
2014-07-09 11:43:14 +02:00
Florent
fb555c3079
Do not import lxml in headers of page.py
2014-07-09 10:23:24 +02:00
Florent
8a2a1ece5e
Do not always import module used only in debug mode
2014-07-09 10:23:24 +02:00
Florent
76cb004eb4
Move ItemListTable-Element outside of page.py
...
One of the goal is to not import all modules needed by filters by
loading the page file.
In the same goal, move the import of parsers in the class definition.
2014-07-09 10:23:20 +02:00
Romain Bignon
983ed221e2
ability to use filters as classes in chain (refs #1426 )
2014-07-05 20:22:45 +02:00
Romain Bignon
2268eb2ff1
ability to use Dict['a']['b']['c'] instead of Dict('a/b/c') (refs #1426 )
2014-07-05 20:22:39 +02:00
Romain Bignon
8efd37e71d
overload & and | operators to chain filters (refs #1426 )
2014-07-05 20:00:04 +02:00
Bezleputh
ac161104ea
[filter] manage basestring entry in CleanHTML filter
2014-07-05 14:24:37 +02:00
Bezleputh
96271b6de4
[filters] manage default in Dict filter
2014-07-05 14:24:14 +02:00
Laurent Bachelier
73cd8762f5
Allow for a default argument in MultiFilter
2014-07-05 14:23:29 +02:00
Laurent Bachelier
3f2d8ae185
Allow for a custom element finder
...
And end up with less duplicate code!
2014-07-05 14:23:27 +02:00
Romain Bignon
18c1f46922
ability to override the flush() method
2014-07-01 20:37:58 +02:00
Vincent Paredes
714a0e7617
matching content with url using is_here
2014-07-01 15:52:51 +02:00