Work to be done:
modify the code for Linux environment
There are view improvements that can be made to improve the support for Arabic words and to speed up word search. Here are our plans:
Function name: ArbStem (arbWord as Unicode string)
Returns Arabic word in its stem format.
Arabic words have suffix and prefix, so we need a function to remove suffix and prefix for Arabic words. Then store the word in the dictionary and use it in its functions. The function should read from two files that have list of suffix and prefix.
You do not need to know Arabic. Just get the information about Arabic coding in Unicode table, and treat the Arabic string as a sequence of characters.
Integrate the code with OpenOffice