Can not get the HTML correctly

If you need help on how to use the program
Post Reply
changjunk

Can not get the HTML correctly

Post by changjunk »

Well, I'm in Korea and we use http://films.hitel.net/ for Movie data base.

The trouble is happened due to that site use '&lt' for '<' and '&gt' for '>'.
The strang thing is happended when I try to get page of below URL.

http://films.hitel.net/Search/index.htm ... =Notorious

There are some more text after '&lt' However HTML page is trancated after '&lt'

More strange thing is when I get page of following URL, everything is O.K. including '&lt' and '&gt'

http://films.hitel.net/film.php?film=A9041

My Moviecatalog.exe is v3.4.10 and AMCExchang.dll is v1.0.1.1

Would you fix it? :( :(
antp
Site Admin
Posts: 9668
Joined: 2002-05-30 10:13:07
Location: Brussels
Contact:

Post by antp »

It is a problem from the site, I cannot correct that.
Actually on the page there is a character that has the value "0" (ASCII value 0) which should not be there, just before the &lt
In C and in some case in Pascal (what I use) the end of text string is a character that has the value 0. That's why the page is cut : it is received as a text string, and while converting this string to a Pascal-style string (which does not need a 0 to find end of string) a 0 is found, so the program thinks it's the end of the string. I cannot correct that, except by getting the page as a file instead of a text string, but I do not really want to do that...
Does the problem happen with all pages or only with this one ?
changjunk

Yes, they use a lot of &lt

Post by changjunk »

I believe that they make trouble when I heard from you. And they like to show '<' '>' charater much as to identify AKA(As Known As) for every movie title. I believe that you can can image how many AKA for movie.

Could you make some special tool for Korean? :cry:
antp
Site Admin
Posts: 9668
Joined: 2002-05-30 10:13:07
Location: Brussels
Contact:

Post by antp »

The problem is not related to the < > symbols, there is not reason for that... I guess that's a problem in the way they "build" the page.
I will try to make tests.
Actually I may correct the problem in a future version, should not be so difficult.
changjunk

Hopefully soon...

Post by changjunk »

I hope that you can find a way soon. :wink:

Best Wishes,
Changjunk
Post Reply