Thanks, so far I notice most solution people using are loading some kind of browser module in order to simulate the web browsing in order to take the data. At least in Perl, there is a module which simulate firefox to achieve this.
This is doable, but really takes more effort and need furtfurther study
The problem of using simple wget is the content of the page are generated with AJAX javascript. So when I use the simple wget, it gets the html structure but nothing of the content which I need to parse.
use wget and curl to get the message of ajax. you know ajax is a method of communication right? you can easily find the url of the data source, it is easier than crawling the web page, as the contents are always formatted in json or xml.