Using tohtml to translate &
Kenneth Brody
kenbrody at spamcop.net
Tue Jun 28 09:05:09 PDT 2011
On 6/24/2011 5:09 PM, Mike Schwartz wrote:
>>> I have to read some incoming XML files that are encoded with&,
>>> >, etcetera. filePro 5.0.14 doesn't have a "FROMHTML()" function
>> available.
>
> Ok, I *ALMOST* have the following edit working to un-unicode a field.
> I just have a problem with how to quote the< and> character:
>
> ununi °{ {!" "!} @ | !&!<&> | !<!<"<"> | !>!<"<"> | *}
>
> This string works OK for 1 or more "&" strings. It removes the
> "&", replaces it with an "&" and closes up the extra spaces.
>
> However, with a "test< test" string, it gives leaves me with a
> string looking like "test< test" and with "test> test" it gives me a
> syntax error.
>
> The following string seems to work:
>
> ununi °{ {!" "!} @ | !&!<&> | !<!<z> | !>!<q> | *}
>
> any suggestions?
You appear to have hit an "undocumented restriction" :-) in filePro,
regarding the apparent-but-not-really nesting of punctuation such as "<"
within an edit. (ie: you have a quoted "<" within the "<...>" item.)
If that "undocumented restriction" :-) were eliminated, the following would
work:
unhtml { @ | (!"&"!unhtml2!";"!) | * }
unhtml2 !amp!<"&"> | !eq!<"="> | !lt!<"<"> | !gt!<">">
Note, too, that your construct:
{!" "!} @
is redundant, as the "@" will eat all trailing spaces.
--
Kenneth Brody
More information about the Filepro-list
mailing list