Page 1 of 1

Can not get the HTML correctly

Posted: 2002-10-24 09:33:34
by changjunk
Well, I'm in Korea and we use http://films.hitel.net/ for Movie data base.

The trouble is happened due to that site use '&lt' for '<' and '&gt' for '>'.
The strang thing is happended when I try to get page of below URL.

http://films.hitel.net/Search/index.htm ... =Notorious

There are some more text after '&lt' However HTML page is trancated after '&lt'

More strange thing is when I get page of following URL, everything is O.K. including '&lt' and '&gt'

http://films.hitel.net/film.php?film=A9041

My Moviecatalog.exe is v3.4.10 and AMCExchang.dll is v1.0.1.1

Would you fix it? :( :(

Posted: 2002-10-24 10:54:00
by antp
It is a problem from the site, I cannot correct that.
Actually on the page there is a character that has the value "0" (ASCII value 0) which should not be there, just before the &lt
In C and in some case in Pascal (what I use) the end of text string is a character that has the value 0. That's why the page is cut : it is received as a text string, and while converting this string to a Pascal-style string (which does not need a 0 to find end of string) a 0 is found, so the program thinks it's the end of the string. I cannot correct that, except by getting the page as a file instead of a text string, but I do not really want to do that...
Does the problem happen with all pages or only with this one ?

Yes, they use a lot of &lt

Posted: 2002-10-24 12:43:30
by changjunk
I believe that they make trouble when I heard from you. And they like to show '<' '>' charater much as to identify AKA(As Known As) for every movie title. I believe that you can can image how many AKA for movie.

Could you make some special tool for Korean? :cry:

Posted: 2002-10-24 12:56:46
by antp
The problem is not related to the < > symbols, there is not reason for that... I guess that's a problem in the way they "build" the page.
I will try to make tests.
Actually I may correct the problem in a future version, should not be so difficult.

Hopefully soon...

Posted: 2002-10-24 13:10:30
by changjunk
I hope that you can find a way soon. :wink:

Best Wishes,
Changjunk