Commit graph

170 commits

Author SHA1 Message Date
Florent
6759dec279 Add missing import 2014-07-09 14:58:49 +02:00
Florent
9ae7cc692f Fix a regression: restore ListItem version
And move it in the same place than before to help the diff...
2014-07-09 14:51:34 +02:00
Florent
1daa866949 Move the import oh html2text outside of misc 2014-07-09 11:43:14 +02:00
Florent
fb555c3079 Do not import lxml in headers of page.py 2014-07-09 10:23:24 +02:00
Florent
8a2a1ece5e Do not always import module used only in debug mode 2014-07-09 10:23:24 +02:00
Florent
76cb004eb4 Move ItemListTable-Element outside of page.py
One of the goal is to not import all modules needed by filters by
loading the page file.

In the same goal, move the import of parsers in the class definition.
2014-07-09 10:23:20 +02:00
Romain Bignon
983ed221e2 ability to use filters as classes in chain (refs #1426) 2014-07-05 20:22:45 +02:00
Romain Bignon
2268eb2ff1 ability to use Dict['a']['b']['c'] instead of Dict('a/b/c') (refs #1426) 2014-07-05 20:22:39 +02:00
Romain Bignon
8efd37e71d overload & and | operators to chain filters (refs #1426) 2014-07-05 20:00:04 +02:00
Bezleputh
ac161104ea [filter] manage basestring entry in CleanHTML filter 2014-07-05 14:24:37 +02:00
Bezleputh
96271b6de4 [filters] manage default in Dict filter 2014-07-05 14:24:14 +02:00
Laurent Bachelier
73cd8762f5 Allow for a default argument in MultiFilter 2014-07-05 14:23:29 +02:00
Laurent Bachelier
3f2d8ae185 Allow for a custom element finder
And end up with less duplicate code!
2014-07-05 14:23:27 +02:00
Romain Bignon
18c1f46922 ability to override the flush() method 2014-07-01 20:37:58 +02:00
Vincent Paredes
714a0e7617 matching content with url using is_here 2014-07-01 15:52:51 +02:00
Laurent Bachelier
b9c6176628 browser2: Allow setting query string params on build_url
The outcome is exactly the same as using requests with the "params"
parameter.
2014-06-20 17:58:51 +02:00
Laurent Bachelier
5dd0e9e0ec Small style fixes 2014-06-20 17:58:51 +02:00
Laurent Bachelier
b013828ad0 browser2: Add a filter to change the base element used for selectors 2014-06-17 00:48:30 +02:00
Laurent Bachelier
04cec70e1f browser2 filters: Force unicode, little style fixes
lxml for Python2 has the tendency to return str instead of
unicode when the contents are pure ASCII.
Try to fix the nonsense.
2014-06-04 00:58:35 +02:00
Laurent Bachelier
9619ddcaa2 browser2: Add RawText filter
Allows getting .text of elements without any alteration.
This is useful for at least textarea and pre tags.

Maybe the .join character should be configurable.
2014-06-04 00:49:00 +02:00
Laurent Bachelier
c69c5cf5ef browser2: More specialized exceptions
and extend common exceptions
2014-06-03 22:28:21 +02:00
Laurent Bachelier
e01fda826c filters: Properly handle defaults that are not datetimes 2014-05-27 17:41:51 +02:00
Romain Bignon
6d451e5f34 Date filter: use default value for empty input 2014-05-27 12:21:24 +02:00
Bezleputh
3c4f8d35e0 [Filters] convert date in english in DateTime 2014-05-21 17:54:55 +02:00
Romain Bignon
a5f95183a7 fix syntax error 2014-05-19 07:50:45 +02:00
Romain Bignon
c409675e6c fix compatibility with python2.6 2014-05-19 07:23:12 +02:00
Laurent Bachelier
14b1b56914 browser2: Add an option to convert POST data to the proper encoding
And autodetect it on forms. There is no other way to know what is the
expected encoding.
2014-05-19 01:01:25 +02:00
Romain Bignon
3e1dec519e move ParseError into weboob.tools.exceptions 2014-05-17 14:27:55 +02:00
Romain Bignon
6fcac89dd5 first step in python3 support 2014-05-17 14:27:55 +02:00
Romain Bignon
686a3b77e8 fix URL.id2url (give the browser instance to URL.build()) 2014-05-17 14:27:34 +02:00
Laurent Bachelier
82f47bff88 Allow forcing a Page content encoding 2014-05-16 15:37:24 +02:00
Laurent Bachelier
e01b39c8d2 Also ignore URLs where all kwargs were not used 2014-05-16 11:47:25 +02:00
Laurent Bachelier
6e9910ae9a Only use full-name substitutions, to allow % in URLs 2014-05-16 11:47:25 +02:00
Roger Philibert
6031ff1ef9 Form.submit can take extra parameters given to location() 2014-05-09 22:59:36 +02:00
Bezleputh
fadd88dafc [browser2] Add a Dict filter 2014-05-06 22:32:41 +02:00
Romain Bignon
ab710e0f74 support GET forms 2014-04-29 22:00:49 +02:00
Romain Bignon
61bc712068 Revert "Detect duplicate objects with id "0""
This reverts commit 6cae2cd0a5.
2014-04-26 12:07:22 +02:00
Florent
6cae2cd0a5 Detect duplicate objects with id "0" 2014-04-24 16:18:19 +02:00
Florent
fe5bfee255 Add ignore_duplicate option for crazy modules 2014-04-24 16:16:50 +02:00
Bezleputh
cdf2996d57 [browther2] Display error message instead of raising DataError when we find the same ids in ListElement 2014-04-23 12:37:36 +02:00
Romain bignon
b268fc043b Use regexp to validate urls 2014-04-18 09:55:31 +02:00
Romain Bignon
d291019ce9 fix TableElement to be really case insensitive 2014-04-15 09:15:12 +02:00
Romain Bignon
55c80097fe add BaseBrowser.prepare_request virtual method 2014-04-14 16:42:39 +02:00
Vincent Paredes
4ac38b89cf fixing DateGuessing filter 2014-04-14 16:42:39 +02:00
Vincent Paredes
f7c4bb39c5 remove case sensitive checking 2014-04-14 16:42:39 +02:00
Romain Bignon
1bef60b347 URL builds absolute urls from BASEURL 2014-04-13 12:32:02 +02:00
Romain Bignon
07f6507096 improve documentation of browser2 2014-04-12 15:02:34 +02:00
Romain Bignon
5f59c130b3 add decorator pagination 2014-04-12 12:48:14 +02:00
Bezleputh
5158aabe98 [browser2] add a textCleaner parameter to Join 2014-04-10 10:09:30 +02:00
Romain Bignon
690010a8d2 add DateGuesser filter 2014-04-07 07:43:22 +02:00