Commit Graph

31 Commits

Author SHA1 Message Date
jygaulier
b4066c6362 PHRAS-3645_candidates-special-chars (#4046)
PHRAS-3645 : fix : thesaurus indexation _before_ fetch record (prevents double-candidates)
PHRAS-3645 : fix : <term (context)> parsing in php (fix first special char)
2022-06-03 12:31:21 +02:00
aynsix
09ec714009 PHRAS-3285 fix stock cterm 2022-01-18 18:40:51 +03:00
jygaulier
00979f2d6d PHRAS-3389_use-only-conceptpaths-from-selected-dbs_MASTER
fix : fixed [] search (thesaurus) caused by bad thesaurus hydration during record indexing
2021-04-12 15:57:44 +02:00
jygaulier
63bee24775 PHRAS-3389_use-only-conceptpaths-from-selected-dbs_MASTER
fix : search only concept-paths from the relevant thesaurus
fix : search only fields from the relevant databoxes (collections)
2021-04-08 18:15:27 +02:00
jygaulier
e36b0785ee - add some simple cache
- disable check of conf/searchengine/type
- faster construction of concept-paths
- pass strucure to es as factory (useless, to be reverted)
- add stopwatch (log into file, disabled for now)
2020-10-30 11:00:17 +01:00
Jean-Yves Gaulier
da532cc36a PHRAS-2748_bad-chars-cterms_4.1
fix : remove bad ascii chars (0...31 except tab; cr; lf) from cterms values
todo ? : also cleanup bad chars from prod/editing and api/editing ?
2019-11-05 17:59:03 +01:00
Thibaud Fabre
b5bbb1851f Merge branch 4.0 2016-12-05 15:26:48 +01:00
Thibaud Fabre
4acd20fa80 Revert "Revert "PHRAS-714_thesaurus_indexation"" 2016-10-18 21:17:10 +02:00
Thibaud Fabre
ba8be47423 Revert "PHRAS-714_thesaurus_indexation" 2016-10-18 20:15:23 +02:00
Jean-Yves Gaulier
cf40c99ea7 PHRAS-714_thesaurus_indexation
- new : if the thesaurus has been modified since it's last indexation, it will be re-indexed as soon a record is to be indexed.
 - nb : after thesaurus modification, the databox MUST be manually re-indexed (which will reindex the thesaurus before)
 - todo : remove useless dialog boxes from thesaurus app (... "this term was..."  "...add with/out reindexing ?..."). To be done in the future app using future routes.
2016-10-12 18:42:16 +02:00
Benoît Burnichon
b8a506d3b2 Improve readability of Thesaurus Helper 2016-03-15 17:31:01 +01:00
Jean-Yves Gaulier
2efa57e4be PHRAS-715 #time 6h
add databox_id to filter
2015-10-08 20:18:19 +02:00
Mathieu Darse
393e4c62c1 Test thesaurus Concept 2015-07-03 16:57:33 +02:00
Mathieu Darse
24bcdba635 Handle field root concepts (prefixes) on indexing 2015-04-29 20:42:51 +02:00
Mathieu Darse
f7eea7f63d Static method dispatch in Navigator 2015-04-28 12:08:45 +02:00
Mathieu Darse
c6075fcc1a Thesaurus prefixes in field structure
Also fixes candidates collected from all string fields
2015-04-22 20:46:37 +02:00
Mathieu Darse
470804659b Use a single transliterator to do ASCII folding
Move in string utilities too.
2015-03-30 14:46:43 +02:00
Nicolas Le Goff
0da5d16aeb Merge pull request #1294 from mdarse/candidates-empty
Handle case when cterms XML doesn't exist yet
2015-03-12 17:12:39 +01:00
Mathieu Darse
fc3c81f350 Handle case when cterms XML doesn't exist yet 2015-03-12 16:41:25 +01:00
Mathieu Darse
3974e24888 Fix indexing issue after first candidate term in a new field 2015-03-11 17:10:20 +01:00
Mathieu Darse
89c23ec4ae Resolve indexing performance issues with candidate terms 2015-03-10 14:53:32 +01:00
Mathieu Darse
ac42daa062 Fill candidate terms while indexing 2015-03-05 14:49:26 +01:00
Mathieu Darse
c917c7f952 Thesaurus matching while indexing records
- Add a new hydrator to ask query thesaurus on the fly
- Add a filtering system on thesaurus
- And a databox filter friend
2015-03-03 18:50:34 +01:00
Mathieu Darse
e420ad2997 Fix thesaurus indexing crash when XML contains spaces 2015-02-26 19:10:55 +01:00
Mathieu Darse
2ac931c66f Better context dump 2015-02-25 12:22:53 +01:00
Mathieu Darse
9dc653c543 Move bulk operation into indexer namespace 2015-02-18 12:19:10 +01:00
Mathieu Darse
9560628037 Prune narrow concepts while building query 2015-01-20 18:28:22 +01:00
Mathieu Darse
dc2c9f8c7f Refactor thesaurus query build
- Look for text nodes and infer the concepts behind term using thesaurus
- Use value objects for thesaurus terms and concepts
- Pass a QueryContext holding allowed fields and locales informations when building the Elasticsearch query
- Change type hinting and name of query building method on nodes
- Remove unused method Node#isFullTextOnly()
- Move getFieldsStructure from RecordIndexer to RecordHelper for reusing field structure in SearchEngine
2015-01-20 15:05:15 +01:00
Mathieu Darse
bc22dfcd6d WIP Thesaurus terms linking 2014-09-22 10:17:53 +02:00
Mathieu Darse
c3e2c25787 Thesaurus terms indexing 2014-09-16 16:13:17 +02:00
Mathieu Darse
e34e4ce255 Thesaurus flattening using hierarchical visitor 2014-09-15 19:11:45 +02:00