./ask testdata/Our_Team/Spanish_language_raw.txt 6
./answer testdata/Our_Team/Spanish_language_raw.txt testdata/Our_Team/Spanish_language_question.txt
02/20/2016 - wenyanh
-
I have uploaded four wiki raw files we once used for our questions asking & answering in txt format.
-
The wikiPretreament.py is used for pretreatment the wiki file into an operable txt file for your further steps. It is in the command line format for your file input and output. And I have produced 4 data files (data1-4) for you to use. Contact me directly if there is any problem.
-
I tried to fetch the main information from the website directly but haven't found a good way for the data cleaning. Now the code is in the txtFetch.py. It would be better if you could give me some advice. Or, need we complete this function?
02/22/2016 - xc2
- something have to do: easy question answer: no
- Proper Noun