libkiwix

Commit Graph

Author	SHA1	Message	Date
Veloman Yunkan	685e7f8ad4	Unconditional blocking of external links	2022-09-21 15:41:40 +04:00
Veloman Yunkan	0ce36e6246	Got rid of isHomePage in ContentResponse::build()	2022-09-21 15:41:40 +04:00
Veloman Yunkan	eb0a45b13e	Undefaulted bool params of ContentResponse::build() This resulted in compiler aided discovery of all call sites where the default values were used. For OPDS/catalog requests now passing true for the `raw` parameter, since XML content isn't supposed to undergo any transformations.	2022-09-21 15:41:40 +04:00
Veloman Yunkan	c988511561	Removed unused param from ContentResponse::build() Removed the isHomePage param from one of the variants of `ContentResponse::build()`. The other overload is dangerous since failing to review&update all of its call site may result in changed semantics. Will do it in a couple of separate commits.	2022-09-21 15:41:40 +04:00
Veloman Yunkan	c73e6f9a81	Dropped unused params from ContentResponse ctor	2022-09-21 15:41:40 +04:00
Veloman Yunkan	0cf4850a9b	Dropped TaskbarInfo	2022-09-21 15:41:40 +04:00
Veloman Yunkan	40c496d401	Removed old-style taskbar injection Double-toolbar in the viewer has gone. Some clean-up has to be performed after this change.	2022-09-21 15:41:40 +04:00
Veloman Yunkan	4db443eca6	Embryo of iframe-based viewer	2022-09-21 15:41:40 +04:00
Emmanuel Engelhart	1062bd73a3	It's libkiwix, not kiwixlib	2022-09-11 16:05:25 +02:00
Veloman Yunkan	e323dcf6c9	Redirecting /nonendpoint URLs to /content/nonendpoint	2022-08-11 18:04:05 +04:00
Veloman Yunkan	3b98987cb3	More robust handling of endpoint URLs The next goal is to redirect old-style /book/path/to/entry URLs to /content/book/path/to/entry, which seemed pretty trivial. However, given the current handling of some endpoint URLs, more work was required to ensure that invalid endpoint URLs (e.g. "/random/number" or "/suggest/fr") are not interpreted as content URLs. Previously, that was not a user-observable issue, since the result would be an immediate 404 error (except in certain edge cases, like handling the request for "/random/number" when there is a book with name "random" containing an article at path "/number"). With redirection of URLs that were assumed to refer to content a 404 error would be issued for the transformed URL ("/content/random/number") which may be confusing. Therefore this change is to ensure the correct routing of endpoint URL handling.	2022-08-11 18:04:05 +04:00
Veloman Yunkan	fd36d11ccf	Search results now use the /content URL scheme	2022-08-11 18:04:05 +04:00
Veloman Yunkan	1b1c1e352e	Introduced /content endpoint Book content is now served under /content/book/... The old access to book content via a top-level URL /book/... is so far preserved for backward compatibility. Redirects were changed to use the new URL scheme. Links in the search results still use the old scheme.	2022-08-11 18:04:05 +04:00
Veloman Yunkan	a4b18893aa	Moved handling of the "/" URL	2022-08-11 18:04:05 +04:00
Veloman Yunkan	cff143b4ec	Included tags in free text catalog search	2022-08-06 07:39:45 +02:00
Veloman Yunkan	111aab0c23	Illustration URL uses the book UUID If the server is initialized with a library.xml file, then the id specified in the XML file is used (rather than the UUID recorded in the ZIM file). Note that in test/data/library.xml the book ids are fake and different from the real ZIM IDs; that file was created for testing of the /catalog endpoint which doesn't access ZIM content, so the the same ZIM file zimfile.zim was added to library.xml three times as three different books (with unique human-friendly ids). This explains the diff in test/library_server.cpp.	2022-08-03 16:13:21 +02:00
Veloman Yunkan	28f8dbcf20	New unit-test stringTools.ICULanguageInfo	2022-07-07 16:13:49 +04:00
Matthieu Gautier	71e2df7406	Explicit std Removed headers were `using namespace std`. So we have to be explicit everywhere.	2022-07-02 16:33:32 +02:00
Matthieu Gautier	69931fb347	Remove libzim's wrapper. It is time to remove them. They are deprecated since 10.0.0	2022-07-02 16:33:32 +02:00
Veloman Yunkan	e3e4bfa533	Support for serving customized resources During work on the kiwix-serve front-end, the edit-save-test cycle is a multistep procedure: 1. build and install libkiwix 2. build kiwix-tools 3. run kiwix-serve 4. reload the web-page in the browser When making changes in static resources that are served by kiwix-serve unmodified, the steps 1-3 can be eliminated if kiwix-serve is capable of serving resources from the file-system. This commit adds such a functionality to kiwix-serve. Now, if during startup of kiwix-serve the environment variable `KIWIX_SERVE_CUSTOMIZED_RESOURCES` is defined it is assumed to point to a file where every line has the following format: URL MIMETYPE RESOURCE_FILE_PATH When a request is received by kiwix-serve and its URL matches any of the URLs read from the customized resource file, then the resource data is read from the respective file RESOURCE_FILE_PATH and served with mime-type MIMETYPE. Though this feature was introduced in order to facilitate the development of the iframe-based content viewer, it can also be useful to users who would like to customize the kiwix-serve front-end on their own (without re-building all of kiwix-serve). There is some overlap with a feature of the kiwix-compile-resources script that also allows to override resources. The differences are: 1. The new way of customizing front-end resources has all such resources listed in a text file and there is a single environment variable from which the path of that file is read. kiwix-compile-resources associates a separate environment variable with each resource. 2. The new way uses regular paths to identify a resource. The kiwix-compile-resources method encodes the resource path by replacing any non-alphanumeric characters (including the path separator) with underscores (so that the resulting resource identifier can be used to construct the name of the environment variable controlling that resource). 3. The new method allows adding new front-end resources. The old method only allows to modify existing resources. 4. The new method allows (actually requires) to specify the URL at which the overriden resource should be served (similarly, the MIME-type can/must be specified, too). The old method only allows to override the contents of a resource. 5. The new method only allows to override front-end resources that are served without any preprocessing by kiwix-serve at runtime. The old method allows to override template resources as well (note that internationalization/translation resources cannot be overriden using the old method, either).	2022-06-22 10:59:41 +02:00
Matthieu Gautier	b442e2371e	Do not use deprecated constructor for Reader. We have a specific private non deprecated constructor especially for that, let's use it.	2022-06-10 10:41:31 +02:00
Matthieu Gautier	70382d15e2	Windows compiler complains about the implicit cast from double to size_t.	2022-06-09 15:21:06 +02:00
Matthieu Gautier	01c384bb64	Remove the java wrapper. - The meson's `wrapper` option is removed. - New meson's option `static-linkage` is added to tell meson to link with static library.	2022-06-09 10:23:02 +02:00
Matthieu Gautier	bfcf317f09	Properly set "language" parameter in `opensearch::Query` tag.	2022-06-03 15:46:41 +02:00
Matthieu Gautier	7cb98f7f4e	Make opensearch start parameter 1 indexed.	2022-06-03 15:46:41 +02:00
Matthieu Gautier	cadd2a5cbb	Make the HTTPErrorHtmlResponse not Html only.	2022-06-03 15:46:41 +02:00
Matthieu Gautier	e51a5b9ebc	Introduce `get_requested_format` helper	2022-06-03 15:46:41 +02:00
Matthieu Gautier	5d6b0ea96a	Add searchdescription.xml endpoint	2022-06-03 15:46:41 +02:00
Matthieu Gautier	e5df5e936f	Render the search result using (opensearch/atom) xml format.	2022-06-03 15:46:41 +02:00
Matthieu Gautier	fbc7656b3f	Use proper argument order when building the SearchRenderer from a Searcher	2022-06-02 17:08:50 +02:00
Matthieu Gautier	d196496802	Make the Searcher owning the stored Reader If we keep a reference to a `Reader` it is better to (share) owning the reference. Else the reader may be deleted after we create the searcher. This is especially the case now we are creating the `Reader` at demand and we don't store it in the library's cache.	2022-06-02 17:08:17 +02:00
Matthieu Gautier	a7651d0e9b	Check early that provided bookIds are valid	2022-06-02 12:37:52 +02:00
Matthieu Gautier	3bca43344f	Correctly url encode querystring Fix tests with querystring needed url encoding (pattern=jazz&books.query.title=Ray%20Charles)	2022-06-02 12:37:52 +02:00
Matthieu Gautier	b857293cfd	Build the bookSelection query string when we parse the query. We have to reuse the query the user give us to generate the pagination links. At search result rendering step we don't have access to the query object. The best place to know which arguments are used to select books (and so which arguments to keep in the pagination links) is when we parse the query to select books. Fix tests (pagination links) with book selector other than "books.id=" (pattern=jazz&books.query.lang=eng)	2022-06-02 12:37:52 +02:00
Matthieu Gautier	b483a8e4e4	Make the request_context be able to generate a querystring for a subset. The request_context can now take a filter to select arguments to keep in the query string.	2022-06-02 12:37:52 +02:00
Matthieu Gautier	e2ab7fd62e	Add some more testing. Note that some tests are failing and will be fixed in next commits.	2022-06-02 12:37:52 +02:00
Matthieu Gautier	1514661c26	Protect search from multi threading race condition. libzim's search is not thread safe (mainly because xapian is not). So we must protect our search objects from multi thread calls. The best way to do this is to associate a mutex to the `zim::Searcher` and lock the searcher each time we access object derivated from the searcher (search, results, iterator, ...)	2022-06-02 12:37:52 +02:00
Matthieu Gautier	e5ea210d2c	Add a template specialization for ConcurrentCache storing shared_ptr When ConcurrentCache store a shared_ptr we may have shared_ptr in used while the ConcurrentCache has drop it. When we "recreate" a value to put in the cache, we don't want to recreate it, but copying the shared_ptr in use. To do so we use a (unlimited) store of weak_ptr (aka `WeakStore`) Every created shared_ptr added to the cache has a weak_ptr ref also stored in the WeakStore, and we check the WeakStore before creating the value.	2022-06-02 12:37:52 +02:00
Matthieu Gautier	2b38d2cf1b	Copy the lrucache test from libzim. - Adapt lrucache.cpp for rigth include path and use `kiwix::lru_cache` instead of `zim::lru_cache`. - Add missing `#include <set>` in lrucache.h	2022-06-02 12:37:52 +02:00
Matthieu Gautier	0081b4d8e7	Make the limit of zim files per search configurable. The default value is 0, which means no limit.	2022-06-02 12:37:52 +02:00
Matthieu Gautier	b74910b2af	Limit the number of zim in multizim fulltext search. We are currently limiting to 5 but it will be changed in next commit.	2022-06-02 12:37:50 +02:00
Matthieu Gautier	cf30233358	Prefix env variable name with `KIWIX_`	2022-06-02 12:23:43 +02:00
Matthieu Gautier	f0065fdd6f	Introduce Error exception to do i18n	2022-06-02 12:23:42 +02:00
Matthieu Gautier	c72132054d	Move i18n helper functions	2022-06-02 12:22:28 +02:00
Matthieu Gautier	077ceac5a5	Make the search_rendered handle multizim search. This introduce a intermediate mustache object to store information about the request made by the user.	2022-06-02 12:22:28 +02:00
Matthieu Gautier	39d0a56be8	Use selectBooks in handle_search	2022-06-02 12:22:28 +02:00
Matthieu Gautier	76d5fafb72	Introduce `selectBooks` `selectBooks` allow us to parse a query in a "standard" way to get the book(s) on which the user want to work.	2022-06-02 12:22:28 +02:00
Matthieu Gautier	4438106c2f	Add a prefix in get_search_filter The prefix will be used to parse a "query to select book" in different context. For now we have only one context : selecting books for the catalog search. But we will want to select books to do fulltext search on them (will be done in later commit)	2022-06-02 12:22:28 +02:00
Matthieu Gautier	76ebfd7ea4	Move get_search_filter and subrange.	2022-06-02 12:22:27 +02:00
Matthieu Gautier	22996e4a6b	Allow user to select multiple books when doing search.	2022-06-02 12:22:27 +02:00
Matthieu Gautier	98c54b2279	Handle multiple arguments in RequestContext.	2022-06-02 12:22:27 +02:00
Matthieu Gautier	854623618c	Use the newly introduced searcherCache for multizim searcher.	2022-06-02 12:22:25 +02:00
Matthieu Gautier	fd0edbba80	Use a set of id as key for a the searcher Cache. It will allow use to cache seacher for multiple zim files.	2022-05-24 14:55:48 +02:00
Matthieu Gautier	f5af0633ec	Move the searcher cache into the Library	2022-05-24 14:55:48 +02:00
Matthieu Gautier	740581c55c	Link the cache size to the book count. Unless explicitly set via user env variable.	2022-05-24 14:55:48 +02:00
Matthieu Gautier	582e3ec46d	Use a concurrent cache to store Archive cache.	2022-05-24 14:55:48 +02:00
Matthieu Gautier	28fb76bbc2	Remove m_readers in `Library::impl` It is a deprecated interface and it is a simple wrapper on Archive.	2022-05-24 14:55:48 +02:00
Matthieu Gautier	7c688a4acc	Move `getCacheLength` to a generic helper function `getEnvVar`	2022-05-24 14:55:48 +02:00
Matthieu Gautier	66b2449800	Remove unnecessary catch Catch of std::exception is already made in `handle_request`	2022-05-23 19:17:28 +02:00
Matthieu Gautier	aad95e3413	Introduce a results intermediate object in the template rendering. Url in href must not be html encoded. As we already url encode the path, it is ok to have `'` in the url.	2022-05-23 19:16:14 +02:00
Matthieu Gautier	f0dd34b6db	Introduce buildQueryData helper in SearchRenderer	2022-05-23 19:13:25 +02:00
Matthieu Gautier	bbdde93f49	Introduce a pagination object to render search result.	2022-05-23 19:12:17 +02:00
Matthieu Gautier	cb62da65c3	Raise a exception if something went wrong in the template rendering.	2022-05-23 10:56:39 +02:00
Matthieu Gautier	288b4ae7df	Fix count of remote books in `Library::Impl::getBookCount`	2022-05-23 10:56:39 +02:00
Matthieu Gautier	52c12b0c2f	Introduce `Library::Impl::getBookCount` We simply introduce a `getBookCount` which is not protected by a lock.	2022-05-23 10:56:39 +02:00
Matthieu Gautier	4695f47dd2	Introduce operator+= to simplify response creation.	2022-05-23 10:56:39 +02:00
Matthieu Gautier	f42f6a60df	Use extractFromString to parse request argument. On top of reusing code, it throw a exception if we cannot convert given argument in the type we want.	2022-05-23 10:56:39 +02:00
Matthieu Gautier	717c39f2ef	Better ExtractFromString - Throw a exception if we cannot extract from string. (We throw the same exception as `std::sto*`) - Add a specialization to extract string from string - Add some unit test	2022-05-23 10:56:39 +02:00
Matthieu Gautier	aa1f73472d	Remove unecessary BookDB helper class. It was needed to not expose Xapian in public header. Now we can remove it and directly use a Xapian db.	2022-05-23 10:56:39 +02:00
Matthieu Gautier	090c2fd31a	Move LibraryBase out of public API. We use composition instead of inheritance to implement Library.	2022-05-23 10:56:39 +02:00
Veloman Yunkan	84c68d4d7b	Search results pagination bugfix Search results pagination is disabled for a single page outcome too.	2022-05-18 12:45:47 +04:00
Veloman Yunkan	3b9f28b2b5	Applied cache-id to search_results.css The story of search_results.css static/skin/search_results.css was extracted from static/templates/no_search_result.html before the latter was dropped. static/templates/no_search_result.html in turn seems to be a copied and edited version of static/templates/search_result.html. In the context of exploratory work on the internationalization of kiwix-serve (PR #679) I noticed duplication of inline CSS across those two templates and intended to eliminated it. That goal was not fully accomplished (static/templates/search_result.html remained untouched) because by that time PR #679 grew too big and the efforts were diverted into splitting it into smaller ones. Thus search_results.css slipped into one of those small PRs, without making much sense because nothing really justifies preserving custom CSS in the "Fulltext search unavailable" error page. At the same time, it served as the only case where a link to a cacheable resource is generated in C++ code (rather than found in a template). This poses certain problems to the handling of cache-ids. A workaround is to expel the URL into a template so that it is processed by `kiwix-resources`. This commit merely demonstrates that solution. But whether it should be preserved (or rather the "Fulltext search unavailable" page should be deprived of CSS) is questionable.	2022-05-02 20:37:22 +04:00
Matthieu Gautier	fba0f09f4f	Do not compress content smaller than 1400 Bytes	2022-04-27 18:23:39 +02:00
Matthieu Gautier	0d294c50a5	[SERVER] Support gzip encoding instead of deflate. The `compress` function is copied from httplib	2022-04-27 18:23:38 +02:00
Veloman Yunkan	927c12574a	Preliminary support for Accept-Language: header In the absence of the "userlang" query parameter in the URL, the value of the "Accept-Language" header is used. However, it is assumed that "Accept-Language" specifies a single language (rather than a comma separated list of languages possibly weighted with quality values). Example: Accept-Language: fr // should work Accept-Language: fr-CH, fr;q=0.9, en;q=0.8, de;q=0.7, ;q=0.5 // The requested language will be considered to be // "fr-CH, fr;q=0.9, en;q=0.8, de;q=0.7, ;q=0.5". // The i18n code will fail to find resources for such a language // and will use the default "en" instead.	2022-04-13 16:40:20 +02:00
Veloman Yunkan	9987fbd488	Fixed CI build failure under android_arm*	2022-04-13 16:40:20 +02:00
Veloman Yunkan	a0d9a824e1	Internationalized searchbox tooltip	2022-04-13 16:40:20 +02:00
Veloman Yunkan	11be821c46	Internationalized "Go to a randomly selected page" At this point a potential issue has been revealed. Now we produce the final HTML via 2-level template expansion 1. Render parameterized messages 2. Render the HTML template In which templates we should use double mustache "{{}}" (HTML-escaping) tags and where we may use triple mustache "{{{}}}" (non-escaping) tags?	2022-04-13 16:40:20 +02:00
Veloman Yunkan	3da81a3d0f	Internationalized "Go to the main page" button	2022-04-13 16:40:20 +02:00
Veloman Yunkan	f73be3cde7	Initializing mustache data via initializer list	2022-04-13 16:40:20 +02:00
Veloman Yunkan	c2bfeb4030	"Go to welcome page" is internationalized	2022-04-13 16:40:20 +02:00
Veloman Yunkan	6f3db20078	Internationalized "Fulltext search unavailable" page	2022-04-13 16:40:20 +02:00
Veloman Yunkan	fbd23a8329	Fully internationalized 400, 404 & 500 error pages	2022-04-13 16:40:20 +02:00
Veloman Yunkan	d2c864b010	Internationalized raw-entry-not-found message	2022-04-13 16:40:20 +02:00
Veloman Yunkan	779382642b	Internationalized bad raw access datatype message	2022-04-13 16:40:20 +02:00
Veloman Yunkan	ca7e0fb4a0	Internationalized random article failure message	2022-04-13 16:40:20 +02:00
Veloman Yunkan	52d4f73e89	RIP searchSuggestionHTML() & English-only message	2022-04-13 16:40:20 +02:00
Veloman Yunkan	1ace16229d	Internationalized search suggestion message	2022-04-13 16:40:20 +02:00
Veloman Yunkan	cb5ae01fd8	Localized "No such book" 404 message for /random However the title and the heading of the 404 page are not localized yet.	2022-04-13 16:40:20 +02:00
Veloman Yunkan	387f977d6c	Enter ParameterizedMessage	2022-04-13 16:40:20 +02:00
Veloman Yunkan	202ec81d8b	URL-not-found message went into i18n JSON resource Yet, the URL-not-found message is not yet fully internationalized since its usage is hardcoded to English.	2022-04-13 16:40:20 +02:00
Veloman Yunkan	577b6e29f9	kiwix::i18n::expandParameterizedString()	2022-04-13 16:40:20 +02:00
Veloman Yunkan	e4a0a029ff	User language control via userlang query param This is a draft commit enabling the testing of the support for kiwix-serve internationalization.	2022-04-13 16:40:20 +02:00
Veloman Yunkan	507e111f34	i18n data is kept in and generated from JSON files Introduced a new resource compiler script kiwix-compile-i18n that processes i18n string data stored in JSON files and generates sorted C++ tables of string keys and values for all languages.	2022-04-13 16:40:20 +02:00
Veloman Yunkan	d029c2b8d5	Enter I18nStringDB	2022-04-13 16:40:20 +02:00
Veloman Yunkan	c574735f51	makeFulltextSearchSuggestion() works via mustache	2022-04-13 16:40:20 +02:00
Veloman Yunkan	a18dd82d82	Introduced makeFulltextSearchSuggestion() helper	2022-04-13 16:40:20 +02:00
Matthieu Gautier	85a9d35488	Correctly detect the number of article for zim version <= 6	2022-04-06 17:21:14 +02:00
Veloman Yunkan	ae1bf39023	Got rid of static/templates/no_search_result.html The "Fulltext search unavailable" error page is now generated using the static/templates/error.html template. Also added two test cases checking that error page.	2022-04-06 14:42:29 +02:00
Veloman Yunkan	dbcbdff275	Added an optional CSS link to error.html	2022-04-05 20:49:09 +04:00
Veloman Yunkan	2a20e87341	Got rid of Response::build_500() This change is not tested (mostly due to the difficulties of triggering an internal server error).	2022-04-04 18:35:20 +02:00
Veloman Yunkan	2028bf3a98	Fixed the CI build failure under android_arm*	2022-04-04 18:35:20 +02:00
Veloman Yunkan	545d409150	Reused HTTPErrorHtmlResponse in HTTP400HtmlResponse	2022-04-04 18:35:20 +02:00
Veloman Yunkan	89dc9afc28	Renamed 404.html to error.html 404.html no longer contains anything specific to the 404 error and will henceforth serve (with some enhancements) as a general purpose error page template.	2022-04-04 18:35:20 +02:00
Veloman Yunkan	647118dd5e	Enter HTTPErrorHtmlResponse In addition to serving as a base class for `HTTP404HtmlResponse`, `HTTPErrorHtmlResponse` is going to be used for a couple of other error pages.	2022-04-04 18:35:20 +02:00
Veloman Yunkan	d8a60db739	Preparing for a single error page template	2022-04-04 18:35:20 +02:00
Veloman Yunkan	f4059f3faf	Got rid of withTaskbarInfo()	2022-04-04 18:35:20 +02:00
Veloman Yunkan	800cc5b68a	Got rid of Response::build_404()	2022-04-04 18:35:19 +02:00
Matthieu Gautier	feb30d08aa	Correctly define the variable `urlNotFoundMsg` and `invalidUrlMsg`. As we must declare the two variables as `extern` in response.h, we must define it somewhere (and `response.cpp` is a good place).	2022-04-01 11:58:57 +02:00
Matthieu Gautier	311f783ea9	Always use the search pattern when searching in the server. There is no reason to not use the pattern if there is a geo_query. If both the pattern and the qeo_query are provided, we must use both.	2022-03-29 14:06:19 +02:00
Matthieu Gautier	3641dbf14d	Handle book without xapian index.	2022-03-29 14:05:45 +02:00
Matthieu Gautier	1962262f94	Correctly handle invalid book. If user request for a non existent book, we must return a 400 page. (This is done by throwing a `std::invalid_argument` and let the catch handle it)	2022-03-29 14:05:45 +02:00
Matthieu Gautier	7407f30790	Better cache usage. It is better to directly try to get the `Search` from the cache instead of getting the `Searcher` first which could be useless in Search already exist.	2022-03-29 14:05:45 +02:00
Matthieu Gautier	d740ffe465	Introduce SearchInfo. SearchInfo is a small helper structure to store information about the queried search. It regroup already existing information (`patternString`, geo query, ...) in one structure. It is also used as key in the cache instead of using a generated string.	2022-03-29 14:05:39 +02:00
Matthieu Gautier	e7293346be	Return http 400 error response when needed.	2022-03-28 17:37:41 +02:00
Matthieu Gautier	b1643e422e	Introduce HTTP400HtmlResponse. HTTP400HtmlResponse is build on the same design than HTTP404HtmlResponse.	2022-03-28 17:35:15 +02:00
Veloman Yunkan	ec2e10b40e	Moved taskbarInfo into ContentResponseBlueprint	2022-03-28 14:56:40 +02:00
Veloman Yunkan	2da8ea1650	Moved function definition to cpp	2022-03-28 14:56:40 +02:00
Veloman Yunkan	0eb8f09f79	One more victory of HTTP404HtmlResponse One more instance of `Response::build_404()` & `withTaskbarInfo()` was taken over by `HTTP404HtmlResponse`.	2022-03-28 14:56:40 +02:00
Veloman Yunkan	0ecbdbcf63	Enter TaskbarInfo After this change it's time to say thank you and good-bye to `withTaskbarInfo()`. But it will take a while.	2022-03-28 14:56:40 +02:00
Veloman Yunkan	9bc09a815c	noSuchBookErrorMsg()	2022-03-28 14:56:40 +02:00
Veloman Yunkan	48d377ca44	HTTP404HtmlResponse::operator+(const std::string&)	2022-03-28 14:56:40 +02:00
Veloman Yunkan	d5ae92e4e2	More uses of HTTP404HtmlResponse	2022-03-28 14:56:40 +02:00
Veloman Yunkan	1a5e2eda0f	HTTP404HtmlResponse::operator+(UrlNotFoundMsg)	2022-03-28 14:56:40 +02:00
Veloman Yunkan	89785a259a	Enter HTTP404HtmlResponse	2022-03-28 14:56:40 +02:00
Veloman Yunkan	668063205c	Enter UrlNotFoundMsg iomanipulator-like class	2022-03-28 14:56:40 +02:00
Veloman Yunkan	df98c58d07	Enter ContentResponseBlueprint	2022-03-28 14:56:40 +02:00
Veloman Yunkan	ff8da65c68	Separated make404ResponseData()	2022-03-28 14:56:40 +02:00
Veloman Yunkan	ae60ba806b	Made 404.html error template a little more generic The fact that an info message was moved into C++ code is temporary since it will be moved to a message resource file soon.	2022-03-28 14:56:40 +02:00
Veloman Yunkan	8cfcf2ea86	A new overload of Response::build_404()	2022-03-28 14:56:40 +02:00
Veloman Yunkan	26c16bb1b2	Renamed a variable	2022-03-28 14:56:40 +02:00
Veloman Yunkan	ca965d448f	Got rid of 2 parameters in Response::build_404() Instead of passing the `bookName` and `bookTitle` parameters to `Response::build_404()`, `withTaskbarInfo()` is applied to its result when needed. Note, that in `InternalServer::handle_raw()` `withTaskbarInfo()` was not utilized since the results of the `/raw` endpoint are not supposed to be decorated with a taskbar.	2022-03-28 14:56:40 +02:00
Veloman Yunkan	6d16d7386d	Changed the signature of ContentResponse::set_taskbar()	2022-03-28 14:56:40 +02:00
Veloman Yunkan	40e9a19c48	Introduced withTaskbarInfo() helper function This was done in preparation for removing the `bookName` and `bookTitle` parameters from `Response::build_404()`, but since the new function could already be put to some use in this commit that was done too.	2022-03-28 14:56:40 +02:00
Veloman Yunkan	d487c78ea4	Changed the return type of Response::build_404()	2022-03-28 14:56:40 +02:00
Veloman Yunkan	96cbd2bf26	kiwix::onlyAsNonEmptyMustacheValue()	2022-03-28 14:56:40 +02:00
Veloman Yunkan	e4a4b2f961	Extracted CSS out of no_search_results.html	2022-03-18 15:46:54 +04:00
Nikhil Tanwar	8136138492	use encoded URLs for searchSuggestionHtml Previously, the seachURL was not encoded. This resulted in an XSS vulnerability, a concept of proof is: start kiwix-serve visit - http://192.168.18.1:8081/"><svg onload="alert(1)"> This would display an alert message. This encodes the searchURL before passing it to searchSuggestionHtml	2022-03-09 06:31:24 +01:00
Maneesh P M	6523d9f563	Retrieve SuggestionSearcher from LRU Cache We create a cache for SuggestionSearcher very similar to that of FT searcher. User can specify a custom cache size using the environment variable SUGGESTION_SEARCHER_CACHE_SIZE. It has a default value of 10% of the number of books in the library.	2022-03-08 17:35:39 +01:00
Maneesh P M	7cb4c1361f	Retrieve Searcher and Search from LRU Cache We use the new cache template to implement two kind of cache. 1: The Searcher cache is more general in terms of its usage. A Searcher can be used for multiple searches without much change to itself. We try to retrieve the searcher and perform searches using it whenever possible, and if not we put a searcher into the cache. User can specify a custom cache length by manipulating the environment variable SEARCHER_CACHE_SIZE. It's default value is 10% of all the books available. 2: The search cache is much more restricted in terms of usage. It's main purpose is to avoid re-searching on the searcher during page changes to generate SearchResultSet of various ranges. User can specify a custom cache length using the environment variable SEARCH_CACHE_SIZE with a default value of 2;	2022-03-08 17:35:39 +01:00
Maneesh P M	a51f8d66a7	Introduce a LRU Cache and concurrent cache The cache is copied from libzim project : https://github.com/openzim/libzim The exact file as been copied from commit 27f5e70	2022-03-08 17:34:27 +01:00
Emmanuel Engelhart	4bd02f07eb	Beautify slightly the code	2022-03-05 16:59:15 +01:00
Nikhil Tanwar	9488842416	Add dagbani language in language map Adds dagbani (dag) language in iso639_3 language map	2022-03-05 16:51:59 +01:00
Nikhil Tanwar	34b50ba30e	Add mappings for languages not given by libicu Adds a std::map<std::string, std::string> with display names for language codes not given by libicu Fault language codes are taken from library.kiwix.org	2022-03-05 16:51:59 +01:00
Matthieu Gautier	422f4c7dd7	Reuse constructor when creating the SearchRenderer with basic constructor.	2022-03-04 17:08:59 +01:00
Matthieu Gautier	609bc24cbe	Small cleanups. - Remove unused `archive` - Replace tab by spaces	2022-02-25 15:46:13 +01:00
Matthieu Gautier	d9124ed40b	Set the book title only if we have a library.	2022-02-25 15:46:13 +01:00
Matthieu Gautier	921671eb4d	Do not use ostringstream to convert the uuid into string. `zim::Uuid` already have a string convertion operator. Let's use it.	2022-02-25 15:46:13 +01:00
Matthieu Gautier	ec18eb40ea	Readd a `SearchRenderer` constructor without `Library` argument. Adding the library argument breaks the API. It is better to add another constructor to not have to create another major version.	2022-02-25 15:46:13 +01:00
Veloman Yunkan	ae2d7d20dc	Handling of <dc:issued> in OPDS import	2022-02-23 14:20:49 +01:00
Veloman Yunkan	afb556bf64	Added <dc:issued> field to OPDS entries	2022-02-19 11:35:44 +04:00
Tristan Havelick	58be502f3f	add book titles to search results	2022-02-16 12:50:18 +01:00
Nikhil Tanwar	261adf0ef9	Add method to change MHD_OPTION_PER_IP_CONNECTION_LIMIT Adds new method setIpConnectionLimit() to server. Default is 0 (infinite)	2022-02-05 18:31:42 +05:30
Veloman Yunkan	b8328a78f6	/catalog/search?count=0 returns all entries	2022-01-21 19:31:46 +04:00
Matthieu Gautier	84587e7f03	Add a new private constructor not deprecated for Reader. As we still create a `Reader` in the deprecated code of `Library`, we need a way to create a reader without raising a deprecated warning. So we create a another constructor with a dummy argument and we use it.	2022-01-18 12:22:11 +01:00
Matthieu Gautier	fcd865bb81	Revert removing of deprecated methods used by android wrapper.	2022-01-14 12:28:50 +01:00
Matthieu Gautier	e5eeb08206	Remove old deprecated methods.	2022-01-13 14:23:29 +01:00
Matthieu Gautier	96e0d15ab4	Deprecate `Entry` creation. As the `Entry` is still created by `Reader` we need a way to create a entry without raising a deprecated warning. To do so we create a second constructor with a dummy argument. This second constructor is private and is not marked as deprecated so we can use it.	2022-01-13 14:23:29 +01:00
Matthieu Gautier	39732e2bcf	Deprecate methods on Book. - `update(const Reader& reader)` is replaced by `update(const zim::Archive& archive)` - `getFavicon()` is replaced by `getIllustration(48)->`	2022-01-12 18:07:46 +01:00
Matthieu Gautier	3052d0787a	UrlEncode the `content_id`. The HumanReadableId can contains special char (`&`/`=`/...) As it is used as to create a url in the opds template, we must url encode it. - We don't need to encode the book id as it is a uuid, it never contains special char. - We don't need to encode the book url as it is read from the library and the url must already be correctly encoded in the library.xml. (tests modified accordingly)	2022-01-11 17:53:29 +01:00
Matthieu Gautier	0112e6102d	Remove the meta endpoint in the server. Now we have `/raw` and `catalog/v2/illustration` endpoints we don't need to keep the meta endpoint.	2022-01-10 13:13:27 +01:00
Nikhil Tanwar	3dbcbe542b	Add tests for kiwix::fileExists and kiwix::fileReadable	2022-01-10 00:18:44 +05:30
Nikhil Tanwar	854058f842	Introduce kiwix::fileReadable kiwix::fileExists only checks for file existence now kiwix::fileReadable will check if the file is readable (implicitly checking for file existence also)	2022-01-05 20:16:38 +05:30
Matthieu Gautier	dc15a9a824	Add `raw` endpoint. As the name suggests it, this endpoint is not smart : It returns the content as it is and only if it is present (no compatibility or whatever). The only "smart" thing is to return a redirect if the entry is a redirect.	2022-01-05 15:12:41 +01:00
Matthieu Gautier	160a74f5f8	Extend ItemResponse and ContentResponse to return raw content.	2022-01-05 15:12:41 +01:00
Matthieu Gautier	6f1799db9f	Use the new endpoint in the OPDS stream.	2022-01-04 14:16:46 +01:00
Matthieu Gautier	e108fb0e47	Add `/catalog/v2/illustration` endpoint	2022-01-04 14:16:46 +01:00
Matthieu Gautier	9482bfb95b	Add a method to get the a book illustration for a specific size.	2022-01-04 14:16:46 +01:00
Matthieu Gautier	66c40817ee	Fix the OPDS stream to handle custom ROOT prefix As we render the entry's xml in a separated steps, we need to pass the rootLocation to all the internal rendering. Testing with and without root is not so easy. I've simply made all server tests using a ROOT prefix. We can assume that if the ROOT is present everywhere we need it, it will not when we don't need. (As long as we don't hardcode "ROOT" in the server.)	2022-01-04 11:15:18 +01:00
Matthieu Gautier	22e5327dcf	Do not create a dummy illustration if library.xml doesn't contain one. Fix #644	2022-01-04 11:12:32 +01:00
Nikhil Tanwar	8bdcb90818	Make aria2 secret a random value Apps using this service will not have a default aria secret (previously 'kiwixariarpc')	2022-01-03 09:35:04 +01:00
Emmanuel Engelhart	f36d8e9851	New kiwix::getVersions() and printVersions()	2022-01-02 12:22:11 +01:00
Matthieu Gautier	f1035fa472	Fix win32 compilation. WSASocket return a `INVALID_SOCKET` if something goes wrong, not SOCKET_ERROR.	2021-12-23 18:32:43 +01:00
Nikhil Tanwar	9554ab5db0	Make getNetworkInterfaces() and getBestPublicIp() available via tools.h Remove HTTP URL helper line - should be done in kiwix-serve Add getters at server level - getAddress and getPort	2021-12-22 22:38:16 +05:30
Nikhil Tanwar	4b563e567e	Provide HTTP URL for the server Added a line to display the IP (use best if nothing is provided) along with port.	2021-12-22 22:08:25 +05:30
Veloman Yunkan	ed2f914e10	Minor cleanup The code for obtaining the archive now looks the same for the /meta, /suggest, /search and /random endpoints.	2021-12-22 17:12:34 +01:00
Veloman Yunkan	872ddd9cb3	Cleaned up InternalServer::handle_suggest() As a result of this clean-up the /suggest endpoint too stopped generating confusing 404 Not Found errors (which, like in /meta's case is not that important). Another functional change is that the "term" parameter became optional.	2021-12-22 17:12:34 +01:00
Veloman Yunkan	20b5a2b971	Less confusing 404 errors from /meta endpoint Before this fix the /meta endpoint could return a 404 Not Found page saying The requested URL "/meta" was not found on this server. Error cases producing such a result were: - `/meta?content=NON-EXISTING-BOOK&name=metaname` - `/meta?content=book&name=BAD-META-NAME` Now a proper message is shown for each of those cases. This fix is being done just for consistency (the /meta endpoint is not a user-facing one and the scripts don't bother about error texts).	2021-12-22 17:12:34 +01:00
Veloman Yunkan	d8c525289b	Changed the signature of Response::build_404() Now Response::build_404() takes the URL instead of the entire RequestContext object. An empty url suppresses the The requested URL "url" was not found on this server. part of the error text.	2021-12-22 17:12:34 +01:00
Veloman Yunkan	f7b853373c	Less confusing 404 errors from /random endpoint Before this fix the /random endpoint could return a 404 Not Found page saying The requested URL "/random" was not found on this server. Error cases producing such a result were: - `/random?content=NON-EXISTING-BOOK` (can happen when a server is restarted or the library is reloaded and the current book is no longer available). - Failure of the libkiwix routine for picking a random article. Now a proper message is shown for each of those cases.	2021-12-22 17:12:34 +01:00
Veloman Yunkan	250f46c7f9	fixup! Searcher::add_reader() rejects duplicate readers	2021-12-16 16:51:03 +01:00
Veloman Yunkan	0be00b791f	Searcher::add_reader() rejects duplicate readers A O(N) linear search was added to `Searcher::add_reader()` deliberately. This doesn't seem to be an operation that may lead to performance problems.	2021-12-16 16:51:03 +01:00
Emmanuel Engelhart	9f3459f3f3	Better libkiwix version variable name	2021-12-13 18:22:40 +01:00
Veloman Yunkan	e1db9164c8	Fixed deadlock in Library::writeBookmarksToFile()	2021-12-05 20:31:21 +04:00
Veloman Yunkan	7161db8e2a	Manager::reload() also removes books from Library	2021-11-30 18:20:27 +04:00
Veloman Yunkan	262e13845c	Enter Library::removeBooksNotUpdatedSince()	2021-11-30 18:20:27 +04:00
Veloman Yunkan	1d5383435d	Noted a potential bug in Library::addBook()	2021-11-30 18:20:27 +04:00
Veloman Yunkan	ad2eb52553	Thread safe dumping of the OPDS feed	2021-11-30 18:20:27 +04:00
Veloman Yunkan	473d2d2a69	Introduced Library::getBookByIdThreadSafe()	2021-11-30 18:20:27 +04:00
Veloman Yunkan	02b9e32d18	Library became almost thread-safe Library became thread-safe with the exception of `getBookById()` and `getBookByPath()` methods - thread safety in those accessors is rendered meaningless by their return type (they return a reference to a book which can be removed any time later by another thread).	2021-11-30 18:20:27 +04:00
Veloman Yunkan	c2927ce6f7	Library got a yet unused mutex Introducing a mutex in `Library` necessitates manually implementing the move constructor and assignment operator. It's better to still delegate that work to the compiler to eliminate any possibility of bugs when new data members are added to `Library`. The trick is to move the data into an auxiliary class `LibraryBase` and derive `Library` from it.	2021-11-30 18:20:27 +04:00
Veloman Yunkan	b712c732f2	Dropped Library::getBookBy*() non-const functions	2021-11-30 18:20:27 +04:00
Veloman Yunkan	298247ca9b	Renamed NameMapperProxy -> UpdatableNameMapper	2021-11-30 18:20:27 +04:00
Veloman Yunkan	3aeeeeee76	Manager::reload()	2021-11-30 18:20:27 +04:00
Veloman Yunkan	226dac2604	LibraryManipulator is now merely a notifier Originally `LibraryManipulator` was an abstract class completely decoupled from `Library`. Its `addBookToLibrary()` and `addBookmarkToLibrary()` methods could be defined in an arbitrary way. Now `LibraryManipulator` has to be bound to a library object, those methods are no longer virtual, they always update the library and allow for some additional actions via virtual functions `bookWasAddedToLibrary()` and `bookmarkWasAddedToLibrary()`.	2021-11-30 18:20:27 +04:00
Veloman Yunkan	76a5e3a877	Library::addBook() updates the reader cache	2021-11-30 18:20:27 +04:00
Veloman Yunkan	6199c11505	NameMapperProxy respects the withAlias flag	2021-11-30 18:18:16 +04:00
Veloman Yunkan	8fffa59974	Added NameMapperProxy from kiwix/kiwix-desktop#714 The right place for NameMapperProxy introduced by kiwix/kiwix-desktop#714 is in libkiwix (so that it can be reused in kiwix-serve).	2021-11-30 18:18:16 +04:00
Veloman Yunkan	5f3c34ed93	NameMapper's API is now const	2021-11-22 21:06:27 +04:00
Veloman Yunkan	339f845fb0	Bugfix in Book::getHumanReadableIdFromPath()	2021-11-22 20:54:44 +04:00

... 2 3 4 5 6 ...

1313 Commits