libkiwix

Commit Graph

Author	SHA1	Message	Date
Emmanuel Engelhart	05cc3d015f	Insert root link only if html content	2021-05-14 14:49:28 +02:00
Veloman Yunkan	68189de162	/catalog/search handles out-of-bounds pagination	2021-05-10 11:25:06 +02:00
Veloman Yunkan	41276341d0	Empty query acts as a match-all query After switching to Xapian-based search in the library/catalog, an empty query stopped acting as a match-all query. This commit restores the old behaviour in that regard.	2021-05-09 15:14:43 +02:00
Maneesh P M	be6b58c6ad	Revert "added 204 code for empty return of search" Returning status code 204 in case of an empty results doesn't show the empty results page as described in #466. Reverting the changes in #396 fixes the issue.	2021-05-09 10:47:18 +05:30
Emmanuel Engelhart	950e742116	No metalink file on fs	2021-05-04 13:15:43 +02:00
Veloman Yunkan	3879b82112	const-correct kiwix::Library - Made most methods of kiwix::Library const. - Also added const versions of getBookById() and getBookByPath() methods.	2021-04-28 11:42:55 +04:00
Veloman Yunkan	63e9a09259	Cleaned up/beautified Library::updateBookDB()	2021-04-27 16:59:21 +04:00
Veloman Yunkan	4178c169dd	Xapian documents in book DB store only the book id	2021-04-27 16:59:21 +04:00
Veloman Yunkan	f751aff2fb	Full case/diacritics insensitivity in catalog filtering Catalog filtering should now be case/diacritics insensitive for all fields. However it is not validated for language, name and category fields, and is validated for tags, creator & publisher only for text supplied in the filter (but not for values read from the book).	2021-04-27 16:59:21 +04:00
Veloman Yunkan	87dc9d2723	Made catalog filtering by query diacritics insensitive Catalog filtering by titles/description was sensitive to diacritics present in the query string. Fixed that. Also enhanced the unit test to validate the insensitivity to diacritics present in either the title/description or the query string.	2021-04-27 16:59:21 +04:00
Veloman Yunkan	9c7366890d	Catalog filtering by tags works via Xapian	2021-04-27 16:59:21 +04:00
Veloman Yunkan	19e195cb7d	Filter::Tags typedef	2021-04-27 16:59:21 +04:00
Veloman Yunkan	3d5fd8f585	Catalog filtering by creator works via Xapian	2021-04-27 16:59:21 +04:00
Veloman Yunkan	d3d5abe14d	Handling of non-words in publisher query This change fixes the failure of the LibraryTest.filterByPublisher unit-test broken by the previous commit. The previous approach used in `publisherQuery()` for building a phrase query enforcing the specified prefix for all terms fails if 1. the input phrase contains a non-word term that Xapian's query parser doesn't like (e.g. a standalone ampersand character, 1/2, a#1, etc); 2. the input phrase contains at least three terms that Xapian's query parser has no issue with. Using the `quest` tool (coming with xapian-tools under Ubuntu) the issue can be demonstrated as follows: ``` $ quest -o phrase -d some_xapian_db "Energy & security" Parsed Query: Query((energy@1 PHRASE 11 Zsecur@2)) Exactly 0 matches MSet: $ quest -o phrase -d some_xapian_db "Energy & security act" UnimplementedError: OP_NEAR and OP_PHRASE only currently support leaf subqueries $ quest -o phrase -d some_xapian_db 'Energy 1/2 security act' UnimplementedError: OP_NEAR and OP_PHRASE only currently support leaf subqueries $ quest -o phrase -d some_xapian_db "Energy a#1 security act" UnimplementedError: OP_NEAR and OP_PHRASE only currently support leaf subqueries ``` The problem comes from parsing the query with the default operation set to `OP_PHRASE` (exemplified by the `-o phrase` option in above invocations of `quest`). A workaround is to parse the phrase with a default operation of `OP_OR` and then combine all the terms with `OP_PHRASE`. Besides stemming should be disabled in order to target an exact phrase match (save for the non-word terms, if any, that are ignored by the query parser).	2021-04-27 16:59:21 +04:00
Veloman Yunkan	a759ab989f	Catalog filtering by publisher works via Xapian	2021-04-27 16:59:21 +04:00
Veloman Yunkan	7ccd9ffcce	Catalog filtering by language works via Xapian	2021-04-27 16:59:21 +04:00
Veloman Yunkan	0c0a37073b	Catalog filtering by category works via Xapian	2021-04-27 16:59:21 +04:00
Veloman Yunkan	415c65cf03	Catalog filtering by book name works via Xapian	2021-04-27 16:59:21 +04:00
Veloman Yunkan	8287f351e7	Final logic of Library::filterViaBookDB() Moved the `filter.hasQuery()` check inside `buildXapianQuery()`. `Library::filterViaBookDB()` only cares if the query that is going to be run on the book DB would match all documents. The rest of changes related to enhancing the usage of Xapian for the catalog search will happen inside `buildXapianQuery()` and `updateBookDB()`.	2021-04-27 16:59:21 +04:00
Veloman Yunkan	ea779ac200	Extracted buildXapianQuery()	2021-04-27 16:59:21 +04:00
Veloman Yunkan	80cd1fc989	Renamed 2 functions in Filter and Library	2021-04-27 16:59:21 +04:00
Veloman Yunkan	2d76f8395e	Dropped unused functions from Filter's private API This should have been done back in PR #460	2021-04-27 16:59:21 +04:00
Manan Jethwani	965b9622c2	removed redirect to articles in search	2021-04-20 20:23:42 +05:30
Veloman Yunkan	9d4370403b	get_url() was renamed in zim::search_iterator	2021-04-16 13:30:36 +04:00
Vertigo	611146aa37	Added Search Link for bad bookName/articleName on 404	2021-04-12 21:31:47 +05:30
Veloman Yunkan	b54215f146	Manager::readOpds() doesn't modify its input	2021-04-12 15:14:12 +02:00
Veloman Yunkan	9033f2f28e	Manager::readXml() doesn't modify its input	2021-04-12 15:14:12 +02:00
Veloman Yunkan	ec9186b174	Library::removeBookById() updates the search DB This fix makes the `XmlLibraryTest.removeBookByIdUpdatesTheSearchDB` unit-test pass.	2021-04-09 17:06:45 +04:00
Veloman Yunkan	aaaa5a637e	Library::filter() doesn't create empty books This changes how the `XmlLibraryTest.removeBookByIdUpdatesTheSearchDB` unit-test fails.	2021-04-09 17:06:45 +04:00
Veloman Yunkan	24ed96a38c	Library.removeBookById() drops the reader too This fix makes the `XmlLibraryTest.removeBookByIdDropsTheReader` unit-test pass.	2021-04-09 17:05:56 +04:00
Manan Jethwani	5cb276a933	adding kind and path attributes to suggest response object and using it in autocomplete	2021-04-07 21:04:33 +05:30
Veloman Yunkan	aa2a031ba4	Xapian headers are not exposed through libkiwix	2021-04-07 18:24:33 +04:00
Manan Jethwani	7872734f44	changed method of injecting root link	2021-03-24 14:17:58 +05:30
Manan Jethwani	c557bb271b	injecting root link directly and renamed head_part to head_taskbar	2021-03-24 02:10:16 +05:30
Manan Jethwani	93264f7409	added root functionality for block external link feature	2021-03-23 03:17:14 +05:30
Veloman Yunkan	e214efecd4	Language code conversion via ICU Language code is converted from ISO 639-3 to ISO 639 (which is understood by Xapian) via ICU. The previous approach via an explicit map had its advantages since Xapian has more than one stemmer implementations for some languages (selectable via Xapian-specific identifiers). This commit relies on the defaults associated with the ISO 639 language codes.	2021-03-17 14:32:03 +01:00
Veloman Yunkan	09233bf4f3	Support for partial queries in catalog search The search text in the catalog query is interpreted as partial by default, but partial query mode can be disabled in C++. The latter possibility is not exposed via the /catalog/search kiwix-serve endpoint, though.	2021-03-17 14:32:03 +01:00
Veloman Yunkan	a599fb3892	Initial version of Xapian-based catalog search	2021-03-17 14:32:03 +01:00
Veloman Yunkan	a17fc0ef2d	Library::getBooksByTitleOrDescription()	2021-03-17 14:32:03 +01:00
Veloman Yunkan	db06b2c7ca	Library::BookIdCollection typedef	2021-03-17 14:32:03 +01:00
Veloman Yunkan	a20f9e2ce1	Library::filter() works in two stages 1. Get the subset of books matching the q (title/description) parameter of the search 2. Filter out books not matching the other parameters of the search. Stage 1. currently works in the old way, but will be replaced by Xapian based search in subsequent commits.	2021-03-17 14:32:03 +01:00
Veloman Yunkan	b7b0bdbdd8	Both Book::update() methods update the category	2021-03-17 14:10:57 +04:00
Veloman Yunkan	4abc4f8518	Support for book category attribute in library.xml	2021-03-17 14:10:57 +04:00
Veloman Yunkan	6b2067c236	Reading category element from OPDS stream	2021-03-17 14:10:57 +04:00
Veloman Yunkan	e55bf514e8	Dedicated 'category' parameter in catalog search	2021-03-17 14:10:57 +04:00
Veloman Yunkan	80d4f7e349	Extracted InternalServer::search_catalog()	2021-03-17 14:10:57 +04:00
Veloman Yunkan	58186ffb26	kiwix::Book::getCategory()	2021-03-17 14:09:48 +04:00
Veloman Yunkan	ae32ff40c0	Dropped an extra colon from book <updated> dates	2021-03-17 14:02:27 +04:00
Veloman Yunkan	26331b401e	Fixed the month in OPDS feed <updated> date `tm::tm_mon` varies in the [0, 11] range.	2021-03-17 14:02:27 +04:00
Matthieu Gautier	67caae6c32	Use the new libzim's getRandomEntry instead of implementing it ourselves.	2021-03-02 14:16:09 +01:00
Veloman Yunkan	839fc10a4f	Fixed the Windows build Opening ZIM archives by file descriptor (as well as embedded ZIM archives) is not supported under Windows.	2021-02-10 14:19:47 +01:00
Veloman Yunkan	5a8b825c70	Testing of JNIKiwixReader.getDirectAccessInformation()	2021-02-10 14:19:47 +01:00
Veloman Yunkan	7a465e66d7	Renamed org.kiwix.kiwixlib.{Pair->DirectAccessInfo}	2021-02-10 14:19:47 +01:00
Veloman Yunkan	5a99634dfd	Java wrapper test checks favicon.png too	2021-02-10 14:19:47 +01:00
Veloman Yunkan	e028bcbb04	Android's java.io.FileDescriptor is different	2021-02-10 14:19:47 +01:00
Veloman Yunkan	9cdf7a44c0	JNIKiwixReader can open an embedded ZIM archive	2021-02-10 14:19:47 +01:00
Veloman Yunkan	4d23e44de7	JNIKiwixReader ctor taking a file descriptor ... and a corresponding unit test	2021-02-10 14:19:47 +01:00
Veloman Yunkan	98d69ef59b	Added testReader unit-test for the java wrapper	2021-02-10 14:19:47 +01:00
Veloman Yunkan	e40827fbac	Renamed the java wrapper unit test runner script	2021-02-10 14:19:47 +01:00
Veloman Yunkan	a798e0c0a1	Made the java wrapper unit test run & pass The kiwixlib java wrapper unit test can be run manually via the src/wrapper/java/org/kiwix/testing/compile_test.sh script. The test ZIM files in src/wrapper/java/org/kiwix/testing were created using the create_test_zimfiles. They must be updated/re-generated and committed in git whenever their source data or the create_test_zimfiles script changes. Note: small.zim.embedded is not used at this point, it was created for testing the enhancement coming in a few commits.	2021-02-10 14:19:47 +01:00
Matthieu Gautier	24b2e6e585	Remove unnecessary include.	2021-01-26 17:53:25 +01:00
Matthieu Gautier	3fd1310008	Use c++11 std::thread instead of pthread.	2021-01-26 17:53:25 +01:00
Matthieu Gautier	4749656828	Do not crash if zim file has no `Counter` metadata.	2021-01-26 15:15:27 +01:00
Emmanuel Engelhart	84895c4036	Better </head> detection regex	2021-01-18 13:16:56 +01:00
Emmanuel Engelhart	a8bf9dd5b4	Better Kiwix Serve Taskbar insertion (after charset definition)	2021-01-18 11:18:53 +01:00
Emmanuel Engelhart	a61c94ef10	Add GPLv3 header	2021-01-18 10:54:33 +01:00
Emmanuel Engelhart	8c43fd8d36	Fix taskbar insertion in case of '<head>' attributes	2021-01-11 14:37:19 +01:00
Emmanuel Engelhart	3e2810dff4	Support 'video/' 'audio/*' mimetypes in getMediaCount()	2021-01-07 12:32:32 +01:00
Emmanuel Engelhart	44c4aa931a	Better use kiwix::startsWith()	2021-01-03 15:17:03 +01:00
Emmanuel Engelhart	95b32b168d	More robust getMediaCount()	2021-01-01 17:05:32 +01:00
Matthieu Gautier	1002c15e0d	Remove unnecessary checks. `Reader` cannot be created with a null `zimArchive`. We don't have to check for zimArchive being not null.	2020-12-09 14:25:02 +01:00
Matthieu Gautier	d51000c4a9	Use new libzim method `hasFulltextIndex` to check for fulltext index.	2020-12-09 14:25:02 +01:00
Matthieu Gautier	ba302bed33	Use new libzim method `getFaviconEntry` to get the favicon.	2020-12-09 14:25:02 +01:00
Steve Wills	6900b4e506	fix build on FreeBSD With this header, sockaddr_in and INADDR_ANY are not defined	2020-12-07 09:38:46 -05:00
Matthieu Gautier	1a5a2e7a8e	Adapt kiwix-lib to the new libzim api.	2020-12-02 12:16:48 +01:00
Matthieu Gautier	d87079ec13	Remove deprecated method in the reader.	2020-11-24 19:00:52 +01:00
Veloman Yunkan	0f8fe1f63f	Alternative implementation of parseMimetypeCounter()	2020-10-29 14:11:27 +04:00
Matthieu Gautier	08464f23bc	Better parsing of `M/Counter` Mimetype may contain a parameters. Then, the mimetype would be something like "text/html;foo=bar;foz=baz" It will contains a `;` and `=` and it conflicts with the same operators we use to separate the items in our list. We have to use a more advanced algorithm which takes the context into account. Fix #416	2020-10-28 16:03:18 +01:00
Matthieu Gautier	ef42abea4b	Add some tests of `parseMimetypeCounter`	2020-10-28 14:44:23 +01:00
Matthieu Gautier	4407dd12bd	Move mimetypeCounter parsing in its own function.	2020-10-28 14:08:06 +01:00
Matthieu Gautier	632583ede2	Add missing include	2020-10-07 18:43:57 +02:00
Matthieu Gautier	61f9d4ab3a	Stop the internal server only if it exists.	2020-10-07 14:36:45 +02:00
Matthieu Gautier	470bfc3f1f	Better variable name for outStream.	2020-08-28 15:27:03 +02:00
Matthieu Gautier	ea3180cb8c	Better error printing.	2020-08-28 15:27:03 +02:00
Matthieu Gautier	72d3f8f8e2	Fix segmentation fault with curl requests. Use a heap allocated buffer (with lifetime of Aria2 class) instead of a stack allocated one. Original fix made by @ZaWertun. Kudos to him. Fix #kiwix/kiwix-desktop#123, kiwix/kiwix-desktop#513 and kiwix/kiwix-desktop#423	2020-08-26 12:42:16 +02:00
Matthieu Gautier	af9e03904c	Use std::mutex and std::unique_lock instead of pthread mutex/lock. It simplify a bit the code and ensure that mutex is correctly unlock even in case of exception.	2020-08-26 12:30:56 +02:00
Matthieu Gautier	39611cbd60	Wait for waitingThread to exit before destroying the subprocess memory. WaitingThread read some shared memory with the SubProcess (`mutex`, `m_running`). When we destroy the SubProcess, we must be sure that WaitingThread has correctly finished else we may have invalid read/write on freed memory.	2020-08-26 12:26:04 +02:00
Matthieu Gautier	6f0d3003ac	Remove `m_compress` member.	2020-08-13 11:16:41 +02:00
Matthieu Gautier	ee17b0739a	Fix compilation on CI native dyn. On the CI, the native_dyn docker image is setup with a packaged version on libmicrohttpd for which `MHD_HTTP_RANGE_NOT_SATISFIABLE` is not defined. When the CI will be fixed, we can revert this commit.	2020-08-13 11:16:41 +02:00
Matthieu Gautier	47436f7bdd	Move some header setting in response's constructors. It make easier to understand what is somehow constant and what depends of the context.	2020-08-13 11:16:41 +02:00
Matthieu Gautier	3352c95314	Remove the `RedirectResponse` and use a basic `Response` with header.	2020-08-13 11:16:41 +02:00
Matthieu Gautier	77123ac74c	Move the adding of 304 headers in 304 factory. This avoid us to create a ContentResponse just to have some correct headers.	2020-08-13 11:16:41 +02:00
Matthieu Gautier	9078f0ac6e	Remove `ResponseMode`.	2020-08-13 11:16:41 +02:00
Matthieu Gautier	8d6567d067	Create a utility builder for 416 response. Also add a map in the response to store specific headers.	2020-08-13 11:16:41 +02:00
Matthieu Gautier	6d5cddca12	Fix android compilation Android clang complains about the fact it cannot move the `std::unique_ptr<ContentResponse>` into a `std::unique_ptr<Response>&&` (for the implicit `std::unique_ptr<Response>` constructor). Let's help him a bit.	2020-08-13 11:16:41 +02:00
Matthieu Gautier	a3939e9a05	Move all the content code in the ContentResponse.	2020-08-13 11:16:41 +02:00
Matthieu Gautier	eee621d15b	Move small utilities method to create response in Response class.	2020-08-13 11:16:41 +02:00
Matthieu Gautier	7b2ee37437	Move the entry response to its own class.	2020-08-13 11:16:41 +02:00
Matthieu Gautier	f014fb2895	Introduce a ContentResponse. This is only an "interface" for now as other type of response (entry) may be "transformed" to a ContentResponse. We cannot move all the code in the class.	2020-08-13 11:16:41 +02:00
Matthieu Gautier	1011d1ff0b	Move the redirection response in its own class. The redirection is the easiest to move, let's start with this one.	2020-08-13 11:16:41 +02:00

1 2 3 4 5 ...

923 Commits