robinturner: (Default)
[personal profile] robinturner
Well this is puzzling. I'm working on a quick dirty Perl hack for downloading my journal (comments and all). It goes like this:

#!/usr/bin/perl
use LWP::Simple;
for ($count=579; $count<600; $count++) {
$head="http://www.livejournal.com/talkread.bml?journal=solri&itemid=";
$url=$head . $count;
$content = get($url);
print "$content \n";
}

Of course the last bit will be changed to append to a file, rather than fill the terminal with HTML. The problem with this method is that most itemids aren't used (so you download zillions of error pages), and I can't see a pattern for the one's which are used. I mean, can anyone see anything meaningful in this sequence?

76946
77116
77555
77741

OK, the numbers get bigger, but that's not much help. Of course I could include a search string for "No such entry" and not print that to the file, but I'd still waste time downloading a few hundred error messages for each journal entry.
If you don't have an account you can create one now.
HTML doesn't work in the subject.
More info about formatting

If you are unable to use this captcha for any reason, please contact us by email at support@dreamwidth.org

Profile

robinturner: (Default)
Robin Turner

June 2014

M T W T F S S
      1
2345678
9101112131415
16171819202122
232425 26272829
30      

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags