Commit Graph

313 Commits

Author SHA1 Message Date
d9a0985feb
Extend list of uncertainty prefixes for places with uppercase variants 2023-11-27 01:15:35 +01:00
b36a504277
Add blacklist for unwanted rewrites in consolidating place names 2023-11-26 23:55:43 +01:00
e610723107
Add functions for automatic rewriting of country names to brackets at
the end of place names based on lists
2023-11-26 00:54:14 +01:00
f6409322e5
Add classes for writing consolidating spellings of actor and place names 2023-11-25 22:42:07 +01:00
61e83022ae
Extend uncertainty indicator lists with Ukrainian terms
(thanks Ekaterina)
2023-11-25 13:33:50 +01:00
3d58ce3edf
Add Ukrainian terms 2023-11-24 12:52:20 +01:00
a33c354ad6
Conform to stricter typing rules 2023-11-21 22:01:28 +01:00
d6c514c208
Add functions to check for actor IDs by name while including their life dates 2023-11-20 13:43:05 +01:00
4496a35f5c
Rewrite incomplete time span spellings to extend parsable and splittable time names 2023-11-20 03:18:02 +01:00
78d5137b96
Add Ukrainian uncertainty indicators 2023-11-19 03:53:05 +01:00
a102758606
Extend tests to ensure "vermutl." is included in place uncertainty
indicators
2023-11-18 01:36:01 +01:00
700fefd28c
Add "wahrscheinlich" to list of uncertainty indicators for places 2023-11-17 18:32:41 +01:00
4582f6a697
Fix another edge case in time splitter 2023-11-14 03:32:17 +01:00
7ef05a55c5
Migrate PHPunit config to PHPUnit 10's requirements 2023-11-14 03:28:24 +01:00
54a30e683e
Add class for loading info from distinctly_typed_strings table 2023-11-13 00:11:56 +01:00
c9b0e7085f
Add coverage information to tests, fix coverage of NodaValidationHelper 2023-11-07 23:31:42 +01:00
1a7dbcd6f6
Fix edge cases in time splitter where inputs start with many digits but
are not dates
2023-11-07 00:27:20 +01:00
93c0ff3fa0
Set beStrictAboutOutputDuringTests=true in phpunit.xml 2023-11-06 23:50:12 +01:00
631debcfd8
Add autoloader for tests, phpunit config 2023-11-06 23:46:30 +01:00
53c645b132
Add "vermutl." to list of uncertainty indicators 2023-10-28 21:17:54 +02:00
95de1615ef
Identify, parse and remove some more uncertainty indicators 2023-10-27 19:06:08 +02:00
bbbc84015b
Fix handling of misassigned lcsh / loc links in NodaWikidataFetcher 2023-10-18 02:46:11 +02:00
d55361e29b
Add function to check if a time name is blacklisted 2023-10-18 01:54:40 +02:00
37715bc3e8
Support BCE / CE times 2023-10-15 19:20:16 +02:00
9942c58b12
Improve parsing of LOC / LCSH from Wikidata 2023-09-29 16:20:53 +02:00
0a18449e06
Re-enable infix length in search indexes 2023-09-17 10:59:01 +02:00
efc67b57d3
Remove infix length, increase memory consumed by search indexes 2023-09-17 00:06:42 +02:00
835da05c38
Use wikidata description as fallback if wikipedia description is not
parsable in Wikidata fetcher

Close #16
2023-09-01 12:43:24 +02:00
12a7937218
Comment out debugging lines in NodaWikidataFetcher 2023-08-31 16:11:37 +02:00
a68a03e628
Improve wikidata fetcher 2023-08-31 16:09:21 +02:00
107a4cd640
Improve NodaWikidataFetcher's loading of descriptions
Close #15
2023-08-31 15:38:12 +02:00
869e0f263d Merge branch 'master' of gitea:museum-digital/MDNodaHelpers 2023-08-31 03:29:23 +02:00
0b5d5bdd12
Add functions for getting main synonym in list of synonyms 2023-08-31 03:29:04 +02:00
98f89762ff
Add functions for getting main synonym in list of synonyms 2023-08-31 03:28:38 +02:00
05fb965d8c
Add class NodaLinkedEntityGetter for getting linked entries 2023-08-30 17:39:25 +02:00
2720adf9ed
Limit linking norm data repositories via NodaBatchInserter to those
applicable for a given target vocabulary
2023-08-29 20:14:15 +02:00
67f7bf9fab
Add new functions for linking norm data repositories in batch and use
them in Wikidata fetcher
2023-08-29 17:32:22 +02:00
f27d0900ae
Further modularize syncing of tags with fulltext search index 2023-08-15 15:55:48 +02:00
cb6d0d7b06
Add class NodaDbAdmin 2023-08-15 14:42:07 +02:00
831dbca091
Fix indentation in comment 2023-05-24 03:38:22 +02:00
5906ddd97a
Add additional disallowed time names 2023-04-27 17:44:34 +02:00
574c9cf005
Add "o. D." (with spaces) to list of banned time terms 2023-04-17 18:41:43 +02:00
b6a5b44103
Add "vermutlich um" to list of uncertainty prefixes for time 2023-04-17 00:45:06 +02:00
838a991256
Except new class MDNodaLink for parameter in NodaIDGetter 2023-04-16 02:08:14 +02:00
d63f811367
Add "Ca. " as an uncertainty prefix for times 2023-04-14 22:56:55 +02:00
c5a7a62eb0
Add "Vermutlich" as an uncertainty indicator for places 2023-03-24 16:03:27 +01:00
b6d229eed9
Add functions for logging to import log 2023-03-01 11:43:01 +01:00
446c5d26f4
Extend uncertainty helper with more terms 2023-02-01 15:01:24 +01:00
6d40ae4c83
Fix bug in generating Indonesian date names and add Ukrainian as a
language for autogenerating time translations
2022-12-11 17:18:12 +01:00
d0e11c323e
Further modularize fetching of translations, add new class
NodaBatchInserter for batch inserting translations
2022-11-18 00:26:23 +01:00