i believe they've shared data with norganna. not sure if they're into sharing with others or what. since they get scraped anyways, you'd think a low bandwidth version of their data would save everybody a ton of trouble.
i'm wondering if my problem is related to them trying to detect my browser so they can format appropriately (like for a phone or something). the data i'm not getting is the tabbed columns of things like milling data (view a pigment and then collect which herbs it comes from).
i don't know if libpt's data scanner serves its own purposes, but the code used to get data from wowhead doesn't work for me. it used to.
basically, i know squat about how one goes about collecting data from a web site. so i just copied the basic code from the libpt miner. i'm basically calling "getpage()" which takes a url and gives you a big text string representation of the html page located at that address.
but the results from getpage() don't contain that info. it USED to, but i'm thinking maybe there's some token being fed to the server that indicates the browser i'm using to assist in formatting. that's the only reason i can figure why one means of collecting the data fails where the other succeeds.
i'll see about setting up irc, but i'm not really "at my computer" at the moment.
edit: maybe i'll try it on my mac. i'm thinking it might be related to different installed libs and interactions with the web...