Latent Semantic Indexing (LSI): What are the capacity planning (Disk, type of DB, processing) issues to roll out a system based on LSA?
-
a). Assume 10,000 domain-specific documents. b) Incremental and periodic addition of say 100+ documents. c) What type of database to use? d) What are the expected pre-processing tasks and how much processing capacity needed for these pre-processing tasks e) how about cache and processing needs for latency Radim gives an example for English Wikipedia here: http://radimrehurek.com/gensim/wiki.html
-
Answer:
We use PostgrisSQL for Theme Zoom's Krakken. Pre-processing is extensive. http://www.themezoom.com - Russell Wright and the Theme Zoom team
Russell Wright at Quora Visit the source
Related Q & A:
- What Is Man Power Planning?Best solution by Stack Overflow
- What is the difference between Generic type and Wildcard type?Best solution by stackoverflow.com
- What type of digital camera would be best for a serious beginner?Best solution by Yahoo! Answers
- I have a system in my car. What kind of speakers are better?Best solution by Yahoo! Answers
- What is operational capacity?Best solution by en.wikipedia.org
Just Added Q & A:
- How many active mobile subscribers are there in China?Best solution by Quora
- How to find the right vacation?Best solution by bookit.com
- How To Make Your Own Primer?Best solution by thekrazycouponlady.com
- How do you get the domain & range?Best solution by ChaCha
- How do you open pop up blockers?Best solution by Yahoo! Answers
For every problem there is a solution! Proved by Solucija.
-
Got an issue and looking for advice?
-
Ask Solucija to search every corner of the Web for help.
-
Get workable solutions and helpful tips in a moment.
Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.