HTML Parser
Posted: Wed Jan 14, 2009 10:16 pm
I'm writing a conversion program to convert HTML to RTF. I'm looking for an HTML parser so that I can replace all HTML tags with RTF internal codes.
I have written several sample program but it's very slow when processing 30,000+ lines of HTML file. I used two different methods. First, I loaded all possible HTML tags in an array as a lookup table; Second, I used ISAM database as a lookup table but both methods are very slow.
Others have suggested using REGULAR EXPRESSION but most of the articles online about this topic are just an in depth explanation of what it is but it never show you the source code.
Please...simple explanation only that a 1st grader can understand...straight to the point. Using big words only conveys to people that one has no clue what the question is. LOL
HELP!!!
I have written several sample program but it's very slow when processing 30,000+ lines of HTML file. I used two different methods. First, I loaded all possible HTML tags in an array as a lookup table; Second, I used ISAM database as a lookup table but both methods are very slow.
Others have suggested using REGULAR EXPRESSION but most of the articles online about this topic are just an in depth explanation of what it is but it never show you the source code.
Please...simple explanation only that a 1st grader can understand...straight to the point. Using big words only conveys to people that one has no clue what the question is. LOL
HELP!!!