summary |
shortlog | log |
commit |
commitdiff |
tree
first ⋅ prev ⋅ next
Frederic Jolliton [Tue, 13 Sep 2005 06:09:05 +0000 (06:09 +0000)]
Updated HTMLParser to recognize XML element name and attribute name.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-35
Frederic Jolliton [Tue, 13 Sep 2005 05:57:27 +0000 (05:57 +0000)]
Added two iterators (iterChildOrTop and iterAttributeOrTop) for XSLT support.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-34
Frederic Jolliton [Mon, 12 Sep 2005 09:09:23 +0000 (09:09 +0000)]
Discard duplicate matches. (Also "tabified" source.)
* Updated Base.parse to discard duplicate matches.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-33
Frederic Jolliton [Mon, 12 Sep 2005 08:43:10 +0000 (08:43 +0000)]
Small enhancement to lispy function.
* lispy now drop the 'u' prefix of unicode strings representation.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-32
Frederic Jolliton [Mon, 12 Sep 2005 08:40:38 +0000 (08:40 +0000)]
Fixed several functions (see details.)
* Updated registerFast to check return value of functions in debug mode
(to test if they are correctly returning a Sequence.)
* Fixed register which was creating a wrapper that was not forwarding
parameters correctly.
* Updated op:union (small optimization.)
* Fixed op:intersect and op:except. They were returning wrong result
in some cases.
* Updated various functions to correctly return a Sequence instead of an
item.
* Removed some call to zeroOrMoreItem, since this was a no-op.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-31
Frederic Jolliton [Mon, 12 Sep 2005 08:03:55 +0000 (08:03 +0000)]
Updated XPath parser to allow spaces around unary + and -.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-30
Frederic Jolliton [Mon, 12 Sep 2005 08:01:50 +0000 (08:01 +0000)]
Catch AmbiguousMatch when printing syntax tree of an XPath expression.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-29
Frederic Jolliton [Mon, 12 Sep 2005 04:57:30 +0000 (04:57 +0000)]
Updated XPath parser. Fixed fn:number and op:minus. Added fn:document-uri, fn:data and op:plus.
* Update XPath parser to correctly handle op:plus (+X was same as X,
instead of ensuring X was cast to a number.) Added fn:plus accordingly.
* Fixed op:minus to atomize its argument as needed.
* Fixed fn:number to return a Sequence.
* Added fn:document-uri and fn:data.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-28
Frederic Jolliton [Mon, 12 Sep 2005 04:53:33 +0000 (04:53 +0000)]
Fixed Document.dmDocumentUri()
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-27
Frederic Jolliton [Mon, 12 Sep 2005 03:23:16 +0000 (03:23 +0000)]
Fixed one optimization.
* Fixed optimization to find descendant attributes with a predicate.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-26
Frederic Jolliton [Mon, 12 Sep 2005 03:14:19 +0000 (03:14 +0000)]
Commented out two optimizations that break predicate tests.
* Commented out two optimizations that were breaking predicate tests
based on position():
- descendant-or-self::node()/child::<TEST>[<PRED>] -> descendant::<TEST>[<PRED>]
- descendant-or-self::node()/attribute::<NAME>[<PRED>] -> ext:descendant-attribute(<NAME>)[PRED]
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-25
Frederic Jolliton [Mon, 12 Sep 2005 03:08:03 +0000 (03:08 +0000)]
Updated Node.__cmp__ to work with other types.
* Update Node.__cmp__ to work when compared with other type.
In such case, comparison is done according to type of both
object, instead of comparing values.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-24
Frederic Jolliton [Mon, 12 Sep 2005 03:05:25 +0000 (03:05 +0000)]
Updated fn:contains.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-23
Frederic Jolliton [Mon, 12 Sep 2005 02:59:09 +0000 (02:59 +0000)]
Added sort function. Added '_' variable to keep last result.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-22
Frederic Jolliton [Sun, 11 Sep 2005 06:13:41 +0000 (06:13 +0000)]
Added more XPath functions.
* Added support for special floats (nan, -inf, +inf)
* Added optimization to prevent building full string representation of a
node (using string iterator instead of dmStringValue.)
* Several new functions.
* Updated some functions to follow standard more closely.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-21
Frederic Jolliton [Sun, 11 Sep 2005 05:49:25 +0000 (05:49 +0000)]
Added functions for special floats numbers. Added functions for working on string iterator.
* Added functions to test special floats numbers (NaN, +inf, -inf)
(isNotANumber, isInfinity, isPositiveInfinity, isNegativeInfinity,
isSpecialFloat)
* Added some functions that can work on string and also on iterator of
strings to:
- compute the total length,
- test if a string starts with another string,
- test if two strings are equals.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-20
Frederic Jolliton [Sun, 11 Sep 2005 05:43:30 +0000 (05:43 +0000)]
Clone nodes as needed. Added __cmp__ to Node. Removed dead function _getEncoder.
* Nodes are cloned when inserted in a tree if they're already part of
another tree.
* Added __cmp__ operator, which compare nodes based on string values.
* Removed dead function _getEncoder.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-19
Frederic Jolliton [Sun, 11 Sep 2005 05:39:30 +0000 (05:39 +0000)]
Better function arity checking. Fixed a raise statement.
* Enhanced functionArity to return None as maximum number of parameters
in case where the function is declared with a *args style argument.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-18
Frederic Jolliton [Sat, 10 Sep 2005 23:41:19 +0000 (23:41 +0000)]
Missing Decimal import.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-17
Frederic Jolliton [Sat, 10 Sep 2005 23:40:22 +0000 (23:40 +0000)]
Sequence can now be compared with a single item.
* Update Sequence's __eq__ operator to allow comparison with a single
item (in which case, True is returned if Sequence is of length 1 and
its only item is equal to the specified item.)
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-16
Frederic Jolliton [Sat, 10 Sep 2005 23:28:27 +0000 (23:28 +0000)]
Added 'parse' function to XPath prompt.
* Added 'parse' function that can parse a HTML (or XML) document and
return the corresponding document.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-15
Frederic Jolliton [Sat, 10 Sep 2005 23:26:48 +0000 (23:26 +0000)]
Tags now allow nested tuple/list, including iterators, to create nodes.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-14
Frederic Jolliton [Thu, 8 Sep 2005 03:16:59 +0000 (03:16 +0000)]
Enhanced encoding detection when parsing HTML (or XML) document.
* Enchanced encoding detection algorithm, to check BOM, XML declaration
and "http-equiv" meta element. Moved to misc module.
* Fixed bug in guessXmlCharacterEncoding when passing an Unicode string
(skip offset was missing from the return value.)
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-13
Frederic Jolliton [Wed, 7 Sep 2005 23:25:14 +0000 (23:25 +0000)]
Ensure wrappers function name are set with a non-unicode string.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-12
Frederic Jolliton [Wed, 7 Sep 2005 23:22:47 +0000 (23:22 +0000)]
Fixed prompt to correctly handle 8bits input.
* Fixed prompt to correctly handle 8bits input, such as when using
Unicode terminal.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-11
Frederic Jolliton [Wed, 7 Sep 2005 23:20:23 +0000 (23:20 +0000)]
Added fn:contains and fn:ends-with. Optimized fn:starts-with.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-10
Frederic Jolliton [Wed, 7 Sep 2005 21:45:23 +0000 (21:45 +0000)]
Added fn:starts-with function.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-9
Frederic Jolliton [Wed, 7 Sep 2005 21:44:20 +0000 (21:44 +0000)]
Fixed attribute parsing to tolerate more bad stuff (from Google..)
* Fixed attribute parsing to tolerate some bad formated value
(such as those who contains '=' character without even being
quoted properly.)
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-8
Frederic Jolliton [Wed, 7 Sep 2005 16:31:10 +0000 (16:31 +0000)]
Handle ^C from prompt.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-7
Frederic Jolliton [Wed, 7 Sep 2005 16:27:51 +0000 (16:27 +0000)]
Fixed <script>/<style> element parsing.
* Fixed HTML parser to wait correct ending tag for <script> and <style>
element (previously we were stopping at the first "</" found.)
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-6
Frederic Jolliton [Wed, 7 Sep 2005 15:53:11 +0000 (15:53 +0000)]
tx-prompt: Fake UserAgent by default + allow to change UserAgent at runtime.
* Default UserAgent (so that testing with Google and some other sites
works.)
* Allow to change UserAgent at runtime by updating $ua variable.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-5
Frederic Jolliton [Wed, 7 Sep 2005 15:46:30 +0000 (15:46 +0000)]
Updated install script to generate symbolic link.
* Updated install script to generate symbolic link when specified in
install_conf.py.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-4
Frederic Jolliton [Wed, 7 Sep 2005 15:45:19 +0000 (15:45 +0000)]
Fixed HTML parsing when several attributes have the same name.
* Fixed HTML parsing by allowing several attributes with same name.
Only the first of such set is kept.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-3
Frederic Jolliton [Wed, 7 Sep 2005 15:43:05 +0000 (15:43 +0000)]
Fix attributes with no value following '='.
* Fixed attributes parsing by allowing empty value after =.
We return '' (empty string) instead of None previously.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-2
Frederic Jolliton [Wed, 7 Sep 2005 13:52:06 +0000 (13:52 +0000)]
Fix optimization when looking for attributes with any name ('*').
* Fixed extDescendantAttribute function to handle correctly attribute
name '*'.
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--patch-1
Frederic Jolliton [Wed, 7 Sep 2005 13:23:34 +0000 (13:23 +0000)]
initial import
(automatically generated log message)
git-archimport-id: frederic@jolliton.com--2005-main/tx--main--0.1--base-0