I finally found a text file of Spanish words in CSV format (with the english translations too, of course).
I intended to clean it up a bit, but unfortunately the program imported 4880 items into the deck I already use, so now I'm cleaning it up as I go along. There are people's names, place names, and a few other things that I don't need at all, and a few terms may just be completely wrong, but I am checking terms more or less randomly and it seems pretty good.
I also have a text file with almost 20000 words, but I'll be extra careful with that one. I don't need that one to mix with my current one quite yet.
I have this little wish, when I start projects like this, that I could get everything nice and clean. The 5000 most frequently used words, alphabetized and perhaps with some semantic info. But life never seems to happen like that. It's all messy, but I am still seeing improvement, so I am not too worried about it.
Anyway, it helps to have something to do when I'm restless, which seems to be a lot these days.
No comments:
Post a Comment