Commit Graph

482 Commits

Author SHA1 Message Date
kelson42 1fd0314f1e + small fix to avoid system error by "killing" the indexing 2012-04-07 20:27:56 +00:00
kelson42 6e5d8f717e + clucene fix for the new indexer 2012-04-07 19:31:11 +00:00
kelson42 5aacc01d65 + imp. of the new indexing process 2012-04-07 15:00:15 +00:00
kelson42 65b4015f03 + new indexer 2012-04-07 14:09:47 +00:00
kelson42 62daa9ffe5 + new indexer code 2012-04-06 21:48:43 +00:00
kelson42 735d9afd3a + imp. of the new indexer 2012-04-03 10:55:49 +00:00
kelson42 9ead81ca0c + compute "default" title if not specified in the ZIM file metadata 2012-04-02 19:52:19 +00:00
kelson42 2ae7112a32 + fix small bug in the new indexer 2012-04-02 19:49:16 +00:00
kelson42 9e8d6f3c25 + first working version of the multithreaded indexer 2012-04-02 18:25:10 +00:00
kelson42 6e66fd176d + further dev of the multithreader indexer 2012-03-29 13:33:15 +00:00
kelson42 e930f65297 + further multithreaded indexer dev. 2012-03-29 09:42:35 +00:00
kelson42 20daa92b65 + start of the work on the multithreading indexer 2012-03-28 14:59:57 +00:00
kelson42 81424fedf3 + new function def. for the indexer 2012-03-15 19:18:48 +00:00
kelson42 e6299aaeab + stub of new functions 2012-03-15 18:41:46 +00:00
kelson42 1455c1bfbc FIXED: Library filtering by language with ISO639-1 language ZIM metadata 2012-03-08 21:13:09 +00:00
kelson42 288b0018fa + exception include 2012-03-07 17:25:33 +00:00
kelson42 b17f6a6d01 FIXED: Kiwix-serve sporadic invisible search box with non-latin contents 2012-02-27 17:57:26 +00:00
kelson42 8f89edf17c + fix bug in regex insertion (string was inserted in wrong place) 2012-02-27 17:25:01 +00:00
kelson42 bc8a400fd1 FIXED: Number of articles to high in the library (ID: 3396763) 2012-02-27 11:32:06 +00:00
kelson42 0b50729e03 + fixed compilation with clucene 2.3 2012-02-07 23:41:03 +00:00
kelson42 1fb5194906 + new getCurrentBook() method 2012-01-24 12:45:22 +00:00
reg_ 19c355241c fixed OSX indexing never ending 2011-12-22 14:07:28 +00:00
kelson42 d0d08c4f96 + better default page loading 2011-12-22 11:54:19 +00:00
kelson42 fb1d0893e3 + remove useless comment 2011-12-22 11:25:59 +00:00
kelson42 72e62fab6b + move appendToFirstOccurence() to regexTools.cpp and use now ICU as regex engine 2011-12-22 11:25:24 +00:00
kelson42 73217d5e49 + small bug fix for windows 2011-12-20 20:11:56 +00:00
kelson42 784f87e715 + port to windows 2011-12-17 14:59:43 +00:00
kelson42 4b545b8ae2 2011-12-12 14:49:49 +00:00
kelson42 faff227365 + port makeDirectory to WIN32 2011-12-10 18:52:11 +00:00
kelson42 8df9c53878 + fix the implementation of the arrows in the search restult page 2011-12-04 15:11:38 +00:00
kelson42 aa9ebdb87e + * Add "First" and "Last" arrows in the search result page (ID: 3442313) 2011-12-04 13:40:22 +00:00
kelson42 d851d065e8 + small modification in the behaviour of addBook() 2011-11-28 10:50:57 +00:00
kelson42 74caed4811 Fix a wrong behaviour in case of two portable kiwix consecutive runs (ID: 3442083) 2011-11-26 09:32:02 +00:00
kelson42 fb12ac5eb1 + small improvment of getMainPageUrl() 2011-11-26 07:14:09 +00:00
kelson42 9d4fd28ba6 * In the library, pressing <ENTER> in your local library directly open the file
* Greying "Load random article" menu entry if no content is loaded (ID: 3439112)
* Add the support of the "publisher" ZIM Metadata
2011-11-18 14:25:21 +00:00
kelson42 97548d4798 + better deal by SetIndexBook() and SetBookPath() with relative paths.
+ Add addBookFromPathAndGetId() to know which book id has the added book
2011-11-01 15:07:30 +00:00
kelson42 bf58cf6b76 + small fix to be able to save relative path in addBookFromPath() 2011-11-01 12:58:34 +00:00
kelson42 79d00c2b36 + small fix to avoid duplicate slashes in paths 2011-11-01 12:58:00 +00:00
kelson42 5922808e12 + copyFile() & getLastPathElement() 2011-11-01 12:25:33 +00:00
kelson42 f2280a6b02 + bool makeDirectory(const string &path); 2011-11-01 10:57:51 +00:00
kelson42 cfa00c33f2 + fix bug in path computation in computeAbsolutePath() 2011-11-01 10:04:16 +00:00
kelson42 9d371f295c + use libicu in regexTools 2011-10-31 09:55:34 +00:00
kelson42 9c9b32f897 + fixed a regression I added one week ago 2011-10-31 09:46:38 +00:00
kelson42 5bd84c8db7 + add the fileExists() function 2011-10-28 16:13:44 +00:00
kelson42 400cf57343 + remove a few warnings with better parentheses and a cast 2011-10-28 16:06:19 +00:00
kelson42 98bf706297 + better factorisation of the code (moved from manager.cpp to pathTools.cpp) -> creation of functions getFileSize() and getFileSizeAsString() 2011-10-28 15:54:31 +00:00
kelson42 9182f4e529 + reorganisation of the code: creation of componentTools.[h|cpp] with two methods for charset manipulation (only used in contentManager XPCOM) and move of 3 path manipulations functions from manager.cpp to pathTools.cpp 2011-10-28 15:40:17 +00:00
kelson42 7e00a3339d + introduction of ternary operators 2011-10-24 13:26:53 +00:00
kelson42 8dce6d645e + remove useless string init. (string foobar="") 2011-10-24 13:13:50 +00:00
kelson42 c40a87837e + less if, more efficient 2011-10-24 12:55:12 +00:00
kelson42 1e9bfa168d Better kiwix-serve default loading if no welcome page is defined in ZIM (ID: 3417227) 2011-10-24 12:50:27 +00:00
kelson42 72b8527654 + add regex cache 2011-10-23 14:16:03 +00:00
kelson42 a1dfb3c9fd + New content manager search/filter feature (ID: 3390515) 2011-10-13 11:31:20 +00:00
kelson42 e33b448380 + fix the generation of the lastopen list of files 2011-10-13 10:35:56 +00:00
kelson42 aadce46adf + improve filter 2011-10-11 22:22:14 +00:00
kelson42 5cd20c8fa7 + code to pupulate publisher list in the content manager 2011-10-11 20:33:08 +00:00
kelson42 8b5559af2b + code to populate the language filter combobox in the content manager 2011-10-11 18:47:08 +00:00
kelson42 0f3c1e2888 + filter by size 2011-10-10 17:05:36 +00:00
kelson42 d11f027fbe + content manager sortBy 2011-10-10 14:24:32 +00:00
kelson42 8f43bcc954 * Fixing a bug responsible of masking ongoing downloads while deleting an other content 2011-09-18 14:13:19 +00:00
kelson42 cf07bf827b + better deal with accented paths 2011-09-05 19:21:58 +00:00
kelson42 6959085e3c + better deal with accented paths 2011-09-05 18:53:02 +00:00
kelson42 1c2037e5fa + better deal with accented paths 2011-09-05 18:24:21 +00:00
kelson42 1475646cf3 + pathTools 2011-09-05 18:18:41 +00:00
kelson42 a1c38f8d74 + fix to deal with relative path on Windows 2011-09-04 13:48:26 +00:00
kelson42 4fbf78164a + fix a bug with index relative paths 2011-09-04 13:07:04 +00:00
kelson42 e06af8e68f + improvement of isRelativePath() 2011-09-03 14:50:11 +00:00
kelson42 39c44c578c + first version to deal with relative paths 2011-08-31 05:58:00 +00:00
kelson42 215300f206 + load content manager per default if not local content detected (first start use case) 2011-08-25 06:39:53 +00:00
kelson42 a596427a5e + display start=n+1 in search result pat if offset=n (people start to count at 1, not 0) 2011-08-16 19:28:11 +00:00
synhershko b7984c5138 Updating CLucene search and index routines 2011-08-03 07:48:31 +00:00
synhershko b146d87158 Updating clucene searcher 2011-08-02 16:14:53 +00:00
kelson42 4b6caa48ce + few additional check to avoid segfaults if ZIM file is not there 2011-07-27 19:25:25 +00:00
kelson42 97881fbbb6 + avoid the listing of ZIM in content manager as remote if url attribute is empty (obvious) 2011-07-21 19:47:12 +00:00
kelson42 8569f67c84 <base64.h> -> "../base64.h" 2011-07-21 18:45:34 +00:00
kelson42 a2e4ecc9f5 + kiwix-manage is not able anymore to add ZIM file without title || date || language metadata 2011-07-21 12:13:44 +00:00
kelson42 86dae55c18 + add alternative favicon urls 2011-07-07 11:54:43 +00:00
kelson42 5b679f6b1b + add "remove" button in the download manager 2011-06-28 18:52:20 +00:00
kelson42 2b50a1d505 + remove unistd.h 2011-06-08 19:39:20 +00:00
kelson42 2bcd992049 + remove debug code 2011-06-04 10:04:27 +00:00
kelson42 bf9bed56d7 + last improvement in the download manager 2011-06-02 15:43:46 +00:00
kelson42 acf0294e78 + favicon UI integration finished 2011-05-14 10:59:05 +00:00
kelson42 6ed6fbe171 + add support of favicon in library manager
+ fix a bug in the snippet lenght computation in the search engin indexer
2011-05-14 10:29:55 +00:00
kelson42 67d0340d77 + last content mgmt. imp. 2011-05-09 17:18:44 +00:00
kelson42 49dd576a8a + last imp. 2011-05-08 19:45:09 +00:00
kelson42 de807c8e78 + last imp. in new content mgmt. 2011-05-08 19:00:53 +00:00
kelson42 e6ff96ecb4 + content mgmt. last dev. 2011-05-08 18:45:26 +00:00
kelson42 804889d5dc + last imp. of the content manager 2011-05-08 16:44:58 +00:00
kelson42 35b4a1402f + content manager last imp. 2011-05-08 09:47:27 +00:00
kelson42 526aec41d5 + content mgmt last imp. : we have now something working as good as before but with C++ code 2011-05-07 20:07:24 +00:00
kelson42 e05dd65111 + last imp. in the new content manager 2011-05-07 15:33:33 +00:00
kelson42 ca1713d609 + last imp. of contentManager 2011-05-06 19:45:25 +00:00
kelson42 c7fbc52e7a + new core content mgmt imp. 2011-05-05 05:39:09 +00:00
kelson42 6cfc716c84 + new dev in content manager core 2011-05-04 15:27:08 +00:00
kelson42 6b8112b88b + imp. of kiwix-manage 2011-04-20 19:20:51 +00:00
kelson42 355ca7057e + imp. of kiwix-manage 2011-04-20 18:47:01 +00:00
kelson42 08bcb43afb + kiwix-manage new devs. 2011-04-20 06:16:41 +00:00
kelson42 93d5daeb94 + next dev. of kiwix-manage 2011-04-17 17:12:49 +00:00
kelson42 2fae9e3b71 + readXML() 2011-04-15 17:43:08 +00:00
kelson42 97d38cd5ad + stub of kiwix-manage 2011-04-15 17:16:27 +00:00
kelson42 4b5126ab75 + template adaptation for using in kiwix-serve 2011-04-08 14:12:39 +00:00
kelson42 bfd78ed71c + possible now to navigate trhough all the results 2011-04-03 11:32:51 +00:00
kelson42 91dfb75b47 + selected only the current page in the page list at the bottom of each result page 2011-04-03 11:08:35 +00:00
kelson42 a52d290739 + replace "." by "," for numbers over 999 2011-04-02 18:20:33 +00:00
kelson42 709f32a872 + better able to deal with indexes without article size & wordCount 2011-04-02 17:50:54 +00:00
kelson42 db145fb4a7 + in getHtml(), fix the range end value if end > estimatedCount 2011-03-30 21:10:20 +00:00
kelson42 0f1bfed262 + fix a bug in searcher::getPageUrlFromTitle() 2011-03-29 15:16:18 +00:00
kelson42 91212bdb13 + last dev. on the search feature before beta1 2011-03-26 15:29:56 +00:00
kelson42 f7e842db3a + remove a compilation warning 2011-03-24 12:03:36 +00:00
kelson42 71c3bda0cc + remove a few compile warnings 2011-03-24 11:26:58 +00:00
kelson42 bd5205688a + better fix the the multibyte character issue. 2011-03-24 07:24:23 +00:00
kelson42 939155a9cb + fix a bug occuring if cutting the snippet in the middle of a multibyte character 2011-03-24 07:15:17 +00:00
kelson42 41217c22d8 + search page navigation 2011-03-22 21:04:41 +00:00
kelson42 8b291cd219 + beautifyInteger() 2011-03-22 18:27:20 +00:00
kelson42 ef3202dc02 + better search results 2011-03-20 18:47:14 +00:00
kelson42 5056c59747 + imp. of the search results display 2011-03-20 17:59:31 +00:00
kelson42 51b605906a + load the result template automaticaly from the skin directory 2011-03-19 10:48:23 +00:00
kelson42 3f2f79d40f + introduction of estimatedResultCount in template 2011-03-18 15:13:57 +00:00
kelson42 8878f4e8f3 + getEstimatedResultCount() 2011-03-18 13:51:41 +00:00
kelson42 5e162d6f47 + removing of the result sidebar 2011-03-18 07:07:44 +00:00
kelson42 9e8ebdaf6e + stub template for the result page 2011-03-15 21:23:12 +00:00
kelson42 fb570a6a9d + ctpp2 integration continues... it starts to work 2011-03-15 20:59:59 +00:00
kelson42 cebae645e6 + new improvements in ctpp2 integration 2011-03-14 22:50:50 +00:00
kelson42 519eebccd2 + continue the implementation of ctpp2 2011-03-14 21:39:45 +00:00
kelson42 045717117e + first steps for ctpp2 usage 2011-03-14 21:09:06 +00:00
kelson42 8f11852627 + GPLv3 headers 2011-03-12 13:19:36 +00:00
kelson42 1edd7ebd01 + add checksum checks methods in the kiwix::reader and in the XPOM zimAccessor component 2011-03-04 14:05:59 +00:00
kelson42 ddea078734 + if no title is found in the HTML, get the one of the ZIM file 2011-02-15 16:43:48 +00:00
kelson42 e19e53e373 Fix a small bug in the indexing code to avoid forgetting around 100 articler per ZIM file 2011-01-29 20:59:30 +00:00
kelson42 fa0add82ac * Fix regressions responsible for lower case titles in result list (ID: 3147594) 2010-12-29 16:12:51 +00:00
kelson42 4deacdd923 + additional work to the clucene backend... it still does not work :( 2010-11-17 20:58:19 +00:00
kelson42 6eb3da53cc + include "time.h" 2010-11-16 17:58:59 +00:00
kelson42 e3afa2339b + Fix the initialisation of the random article url generator (ID: 3109548) 2010-11-15 16:23:39 +00:00
kelson42 3d10562634 + add the "verbose" arg. to searchInIndex() 2010-11-07 10:42:50 +00:00
kelson42 34953af21b + small test version of the CluceneReader 2010-11-06 16:44:05 +00:00
kelson42 e7923b75e9 + stub of Clucene search code (still does not work) 2010-11-06 16:03:32 +00:00
kelson42 92aec4e612 + factorization of the Searcher class to avoid multiple backends (preparation work for CluceneSearcher class and CluceneAccessor XPCOM) 2010-11-06 11:47:13 +00:00
kelson42 e48d5b5a24 + introduce standart boost factor for title & keywords during the index process 2010-11-05 14:51:36 +00:00
kelson42 4484623e67 + better indentation 2010-11-02 19:13:21 +00:00
kelson42 b4ab94e4c0 + first version working of the clucene indexer class 2010-11-02 19:00:37 +00:00
kelson42 dd4913a77c + fix a memory leak (caused by a regression during the code refactoring) 2010-11-02 16:19:40 +00:00
kelson42 69586116ed + small fix in #include to avoid compilation error 2010-10-31 07:53:32 +00:00
kelson42 db246168b1 + stub of the cluceneindexer class 2010-10-30 21:30:29 +00:00
kelson42 c003035a5e + better factori. of the indexer code 2010-10-30 21:26:14 +00:00
kelson42 ef2423b1a7 + factorizing of the kiwix::Indexer class in kiwix::Indexer (abstract) and kiwix::XapianIndexer 2010-10-27 19:23:05 +00:00
kelson42 d52c86bcac * Fix an URL parsing issue impacting title containing a '/' (ID: 3065286) 2010-09-13 19:59:51 +00:00
kelson42 459e6550b5 + fix for the wikipedia-de 2010-07-16 19:29:58 +00:00
kelson42 d968ac7114 + add a stub HTML header/footer if the content only have the <body>
.--Cette ligne, et les suivantes ci-dessous, seront ignorées--

M    reader.cpp
2010-07-16 18:14:32 +00:00
kelson42 c9a81bf79b + add the --verbose|-v option to kiwix-index 2010-07-15 19:11:43 +00:00
kelson42 5891b91433 + getMetatag*( 2010-07-04 18:11:25 +00:00
kelson42 3555e4f918 + tokenize the search query with Xapian::Query::parser_query() and not anymore with split() 2010-07-04 15:55:12 +00:00
kelson42 4178289614 + fix for malayalam 2010-06-27 16:31:48 +00:00
kelson42 3735f31946 + remove debug comments 2010-06-12 17:15:52 +00:00
kelson42 a69aac50bc + fix a few things for the 0.9 alpha4 2010-06-12 17:13:57 +00:00
kelson42 19ac4741e6 + fix the problem with failing accented searches on windows 2010-05-25 17:11:05 +00:00
kelson42 624547d8ef + remove debug code 2010-05-16 15:42:57 +00:00
kelson42 eedce3b363 + new printStringInHexadecimal debug methof for UnicodestringS 2010-05-16 14:19:54 +00:00
kelson42 eb12d13f59 + printStringInHexadecimal() for debugging purpose 2010-05-16 12:52:14 +00:00
kelson42 e6f0e38677 + remove a compilation warning 2010-04-28 15:16:17 +00:00
kelson42 c1a31db217 + remove an include 2010-04-18 07:58:10 +00:00
kelson42 31dc311f2a + improved version of removeAccents(9 2010-04-16 13:33:50 +00:00
kelson42 27ee27bbe9 + replace libunac by libicu 2010-04-16 12:28:42 +00:00
kelson42 40042efe06 + small imp. 2010-04-14 16:27:52 +00:00
kelson42 9d92cec52d + small beautify 2010-04-14 16:27:11 +00:00
kelson42 cdd43db32d zim::Article currentArticle better local declared 2010-04-09 09:42:55 +00:00
kelson42 c22049ea8c + stopwords code stub 2010-04-06 08:21:26 +00:00
kelson42 fe1095082c + remove debug outputs 2010-03-25 08:49:20 +00:00
kelson42 83bcb034b8 + small refactoring 2010-03-24 20:28:36 +00:00
kelson42 22f701387e + reader::getId() returns a printable string 2010-01-28 14:42:58 +00:00
kelson42 d942d614e7 add double quote in the tokenizer separator character list (see ID: 2939690) 2010-01-26 11:21:10 +00:00
kelson42 0f4d6eb371 + lowercase the search partern in search.[cpp|h] 2010-01-25 11:28:32 +00:00
kelson42 dc9e92e1e6 + factorisation of the code, creation of the kiwix::Searcher class 2010-01-23 12:42:08 +00:00
kelson42 0eba48c200 + move of src files 2010-01-23 12:27:40 +00:00
kelson42 c9778c9097 + kiwix-server uses now kiwix::Reader 2010-01-20 20:01:44 +00:00
kelson42 fb2dfdd677 + factorization of the zim reader code 2010-01-20 19:46:13 +00:00
kelson42 793e51c86c + new trunk zimlib 2010-01-08 23:03:06 +00:00
kelson42 86597c856d + few imp. in the indexer code 2009-12-20 14:10:58 +00:00
kelson42 bc65a459d9 + move the html parser 2009-12-17 21:49:07 +00:00
kelson42 4464e611a9 + factorization of the indexing code 2009-12-17 21:48:16 +00:00
kelson42 c19ad61051 + small imp. 2009-11-15 17:26:17 +00:00
kelson42 8346cc7a23 + imp version 2009-11-14 16:07:01 +00:00
kelson42 482893ae39 + mutualization of unaccent code 2009-11-14 15:54:46 +00:00