Commit Graph

1660 Commits

Author SHA1 Message Date
kelson42 519eebccd2 + continue the implementation of ctpp2 2011-03-14 21:39:45 +00:00
kelson42 045717117e + first steps for ctpp2 usage 2011-03-14 21:09:06 +00:00
kelson42 8f11852627 + GPLv3 headers 2011-03-12 13:19:36 +00:00
kelson42 1edd7ebd01 + add checksum checks methods in the kiwix::reader and in the XPOM zimAccessor component 2011-03-04 14:05:59 +00:00
kelson42 ddea078734 + if no title is found in the HTML, get the one of the ZIM file 2011-02-15 16:43:48 +00:00
kelson42 e19e53e373 Fix a small bug in the indexing code to avoid forgetting around 100 articler per ZIM file 2011-01-29 20:59:30 +00:00
kelson42 fa0add82ac * Fix regressions responsible for lower case titles in result list (ID: 3147594) 2010-12-29 16:12:51 +00:00
kelson42 4deacdd923 + additional work to the clucene backend... it still does not work :( 2010-11-17 20:58:19 +00:00
kelson42 6eb3da53cc + include "time.h" 2010-11-16 17:58:59 +00:00
kelson42 e3afa2339b + Fix the initialisation of the random article url generator (ID: 3109548) 2010-11-15 16:23:39 +00:00
kelson42 3d10562634 + add the "verbose" arg. to searchInIndex() 2010-11-07 10:42:50 +00:00
kelson42 34953af21b + small test version of the CluceneReader 2010-11-06 16:44:05 +00:00
kelson42 e7923b75e9 + stub of Clucene search code (still does not work) 2010-11-06 16:03:32 +00:00
kelson42 92aec4e612 + factorization of the Searcher class to avoid multiple backends (preparation work for CluceneSearcher class and CluceneAccessor XPCOM) 2010-11-06 11:47:13 +00:00
kelson42 e48d5b5a24 + introduce standart boost factor for title & keywords during the index process 2010-11-05 14:51:36 +00:00
kelson42 4484623e67 + better indentation 2010-11-02 19:13:21 +00:00
kelson42 b4ab94e4c0 + first version working of the clucene indexer class 2010-11-02 19:00:37 +00:00
kelson42 dd4913a77c + fix a memory leak (caused by a regression during the code refactoring) 2010-11-02 16:19:40 +00:00
kelson42 69586116ed + small fix in #include to avoid compilation error 2010-10-31 07:53:32 +00:00
kelson42 db246168b1 + stub of the cluceneindexer class 2010-10-30 21:30:29 +00:00
kelson42 c003035a5e + better factori. of the indexer code 2010-10-30 21:26:14 +00:00
kelson42 ef2423b1a7 + factorizing of the kiwix::Indexer class in kiwix::Indexer (abstract) and kiwix::XapianIndexer 2010-10-27 19:23:05 +00:00
kelson42 d52c86bcac * Fix an URL parsing issue impacting title containing a '/' (ID: 3065286) 2010-09-13 19:59:51 +00:00
kelson42 459e6550b5 + fix for the wikipedia-de 2010-07-16 19:29:58 +00:00
kelson42 d968ac7114 + add a stub HTML header/footer if the content only have the <body>
.--Cette ligne, et les suivantes ci-dessous, seront ignorées--

M    reader.cpp
2010-07-16 18:14:32 +00:00
kelson42 c9a81bf79b + add the --verbose|-v option to kiwix-index 2010-07-15 19:11:43 +00:00
kelson42 5891b91433 + getMetatag*( 2010-07-04 18:11:25 +00:00
kelson42 3555e4f918 + tokenize the search query with Xapian::Query::parser_query() and not anymore with split() 2010-07-04 15:55:12 +00:00
kelson42 4178289614 + fix for malayalam 2010-06-27 16:31:48 +00:00
kelson42 3735f31946 + remove debug comments 2010-06-12 17:15:52 +00:00
kelson42 a69aac50bc + fix a few things for the 0.9 alpha4 2010-06-12 17:13:57 +00:00
kelson42 19ac4741e6 + fix the problem with failing accented searches on windows 2010-05-25 17:11:05 +00:00
kelson42 624547d8ef + remove debug code 2010-05-16 15:42:57 +00:00
kelson42 eedce3b363 + new printStringInHexadecimal debug methof for UnicodestringS 2010-05-16 14:19:54 +00:00
kelson42 eb12d13f59 + printStringInHexadecimal() for debugging purpose 2010-05-16 12:52:14 +00:00
kelson42 e6f0e38677 + remove a compilation warning 2010-04-28 15:16:17 +00:00
kelson42 c1a31db217 + remove an include 2010-04-18 07:58:10 +00:00
kelson42 31dc311f2a + improved version of removeAccents(9 2010-04-16 13:33:50 +00:00
kelson42 27ee27bbe9 + replace libunac by libicu 2010-04-16 12:28:42 +00:00
kelson42 40042efe06 + small imp. 2010-04-14 16:27:52 +00:00
kelson42 9d92cec52d + small beautify 2010-04-14 16:27:11 +00:00
kelson42 cdd43db32d zim::Article currentArticle better local declared 2010-04-09 09:42:55 +00:00
kelson42 c22049ea8c + stopwords code stub 2010-04-06 08:21:26 +00:00
kelson42 fe1095082c + remove debug outputs 2010-03-25 08:49:20 +00:00
kelson42 83bcb034b8 + small refactoring 2010-03-24 20:28:36 +00:00
kelson42 22f701387e + reader::getId() returns a printable string 2010-01-28 14:42:58 +00:00
kelson42 d942d614e7 add double quote in the tokenizer separator character list (see ID: 2939690) 2010-01-26 11:21:10 +00:00
kelson42 0f4d6eb371 + lowercase the search partern in search.[cpp|h] 2010-01-25 11:28:32 +00:00
kelson42 dc9e92e1e6 + factorisation of the code, creation of the kiwix::Searcher class 2010-01-23 12:42:08 +00:00
kelson42 0eba48c200 + move of src files 2010-01-23 12:27:40 +00:00