Finally got it working where it does what I most need it to do: I can upload a Zipped folder of Internet shortcut files (.URL), and it unzips the file into a secure location and parses the .URL files, grabbing the first .jpg file out of the page and returns it with a proper href tag back to the originating webpage. Also presents a list of pages that didn't have any found .jpgs to allow quick and easy search for those missing images.

Limitations: only recognizes .jpgs now. Reason: too many .gif files in the LJ layout. Need to restrict the search to just inside the "content" HTML comment. Also need to allow for image files without extensions, but theoretically the first image link inside the content field should be close enough for government work.

Also doesn't yet parse f-locked posts (as about 10% of the links I get are retroactively locked). Also doesn't know how to handle non-.URL files in the archive, but this is sooooo small potatoes. I wanna have this up-n-running by the end of the week, but don't mark your calendars just yet.

And now, the real reason you're here. Light load; these are leftovers that the first automated post choked on.


Note: de-hotlinked.

















Tags:
.

Profile

sigma7: Sims (Default)
sigma7

Most Popular Tags

Powered by Dreamwidth Studios

Style Credit

Expand Cut Tags

No cut tags