IMDb doesn't always work
IMDb doesn't always work
I've tried importing a few movies from IMDb. I notice that the "Description" field doesn't get downloaded anymore and for many sites I get the error "Read Timeout".
Is this because of something that www.imdb.com is doing? I've started trying to debug the script. I've had a little success in changing the URL to http://us.imdb.com/find?s=all;q=...
Some examples where I have problems include:
- Reeker (after changing to ?tt=1; to ?s=all;, on line 737 it worked)
- Date Movie (change above doesn't matter).
When I enter the URL directly in firefox for example it works. Is the backend dependent on the version of IE I'm running? It's IE7.
The error appears on line 18 in the script:
PageText := GetPage(Address);
The strange thing is, when I type the URL defined by "Address" into Firefox, it works correctly. e.g. "http://us.imdb.com/find?s=all;q=DATE%20MOVIE". It get's redirected to "http://us.imdb.com/title/tt0466342/".
Any ideas how I can continue? I'm a C-Coder for embedded, not really a Pascal Scripter... And I've never used Delphi before....
Is this because of something that www.imdb.com is doing? I've started trying to debug the script. I've had a little success in changing the URL to http://us.imdb.com/find?s=all;q=...
Some examples where I have problems include:
- Reeker (after changing to ?tt=1; to ?s=all;, on line 737 it worked)
- Date Movie (change above doesn't matter).
When I enter the URL directly in firefox for example it works. Is the backend dependent on the version of IE I'm running? It's IE7.
The error appears on line 18 in the script:
PageText := GetPage(Address);
The strange thing is, when I type the URL defined by "Address" into Firefox, it works correctly. e.g. "http://us.imdb.com/find?s=all;q=DATE%20MOVIE". It get's redirected to "http://us.imdb.com/title/tt0466342/".
Any ideas how I can continue? I'm a C-Coder for embedded, not really a Pascal Scripter... And I've never used Delphi before....
Any ideas how I can investigate this further? I've had this for quite some time now (few months). I've got some freetime at last. Unfortunately, I can't go deeper than the line "GetPage(Address)" to find out what's going wrong. The error just says "Read Timeout" and it comes up pretty fast (within 1-2s) after downloading a chunk of at least 32768 bytes.
Taking the URL from the script debugger and putting this directly into Firefox, or IE7 (running XP SP2) brings up the page correctly.
I love this program, but right now I'm copying stuff over manually from the IMDB site. Changing us.imdb.com to www.imdb.com doesn't make a difference.
Just checked with a packet sniffer, and the data is being requested by AMC, the response is coming back. I'm now trying with the title "MY NAME IS MODESTY". I'm getting the same thing and can't download from IMDB, although the page is being requested and is also being downloaded. Let me know if you want what happened.
Taking the URL from the script debugger and putting this directly into Firefox, or IE7 (running XP SP2) brings up the page correctly.
I love this program, but right now I'm copying stuff over manually from the IMDB site. Changing us.imdb.com to www.imdb.com doesn't make a difference.
Just checked with a packet sniffer, and the data is being requested by AMC, the response is coming back. I'm now trying with the title "MY NAME IS MODESTY". I'm getting the same thing and can't download from IMDB, although the page is being requested and is also being downloaded. Let me know if you want what happened.
Last edited by jcurl on 2006-12-09 16:42:07, edited 1 time in total.
At a time some similar problems were solved by adding a pause of 1 or 2 seconds in the script (using the Sleep function if I remember well).
But that does not seem to happen to all users and in all cases. Maybe that when you do too many connexions in some time, IMDB blocks the IP address for few seconds.
But that does not seem to happen to all users and in all cases. Maybe that when you do too many connexions in some time, IMDB blocks the IP address for few seconds.
Just got a packet sniff when AMC does it's stuff. I can send this on if it helps.
I see:
GET /find?s=all;q=....
HTTP/1.1 302 Found
GET /title/tt0347591/?fr=...;fc=1;ft=20
User-Agent: Mozilla/5.0 (compatible; Ant Movie Catalog using Indy Library)
and I see the rest of the data being downloaded. Maybe the problem is how the page is handled by the Indy library?
I can add delays to the script, but the "Read Timeout" error is occurring on the first GetPage(Address) instruction in the script at line 18.
I see:
GET /find?s=all;q=....
HTTP/1.1 302 Found
GET /title/tt0347591/?fr=...;fc=1;ft=20
User-Agent: Mozilla/5.0 (compatible; Ant Movie Catalog using Indy Library)
and I see the rest of the data being downloaded. Maybe the problem is how the page is handled by the Indy library?
I can add delays to the script, but the "Read Timeout" error is occurring on the first GetPage(Address) instruction in the script at line 18.
Could there be a limit while downloading of 32768 bytes? The counter is stuck at 32k just before it says there is the "Read Timeout". I doubt it though, half of the other's work. So this problem doesn't occur all the time.
I guess, I don't have Delphi, so I won't be able to debug the software to at least figure out what's going on. Any further ideas? Any libraries to check? I noticed I can download a trial for 30 days of the Personal Version. Will this work?
I guess, I don't have Delphi, so I won't be able to debug the software to at least figure out what's going on. Any further ideas? Any libraries to check? I noticed I can download a trial for 30 days of the Personal Version. Will this work?
With the personal edition it may be possible to compile the program if you remove all code related to charts in the statistics window, though I am not sure sure that it will be possible to use all the additional components in a trial version (I guess that yes, but I am not sure).
So the error occurs even if you download info for one movie, and even on the first fetch (i.e. when fetching list of result) ?
So the error occurs even if you download info for one movie, and even on the first fetch (i.e. when fetching list of result) ?
The error occurs even if I download info for one movie and on the first fetch. The interesting thing for the movie "Reeker", it didn't work for the URL tt=1; but it did for s=all;. For the second case, it brought up a list of choices of which then it was downloadable.
However, for the movie "My Name Is Modesty" neither worked. What appears to be interesting, is when the GetPage(Address) results in something that is a list of matches, instead of going to the movie directly, I get the "Read Timeout" error. For example "Saw II" worked for me (producing instead a Read Timeout when trying to get the description of the movie).
Could there be a problem with redirects?
However, for the movie "My Name Is Modesty" neither worked. What appears to be interesting, is when the GetPage(Address) results in something that is a list of matches, instead of going to the movie directly, I get the "Read Timeout" error. For example "Saw II" worked for me (producing instead a Read Timeout when trying to get the description of the movie).
Could there be a problem with redirects?
-
- Posts: 1
- Joined: 2006-12-11 15:07:45
This works with me
I had your problem and i found a temporary solution for it.
Solution: Try to disable any Firewalls or traffic analyzers, and test the program
I'm using Kaspersky Antivirus which analyze any HTTP traffic, when i disable it every thing works fine.
Try and tell me.
Solution: Try to disable any Firewalls or traffic analyzers, and test the program
I'm using Kaspersky Antivirus which analyze any HTTP traffic, when i disable it every thing works fine.
Try and tell me.
Re: This works with me
Thankyou very much. I killed Kaspersky completely, and everything is working great. It's a temporary solution. If I ever get my hands on Delphi 7 (actually, I only found the trial Key from Borland's site, not a download), I'll look into it. Something is definitely weird going on.