This website requires JavaScript.
3409ec7afe
Begin adding autotranslation language CRH / Crimean Tatar
master
Joshua Ramon Enslin
2025-02-18 17:51:36 +01:00
27ac3f255a
Minor typing improvements
Joshua Ramon Enslin
2025-02-15 13:36:50 +01:00
9d7d53a858
Disallow fetching from Wikidata disambiguation pages
Joshua Ramon Enslin
2025-02-13 22:37:17 +01:00
28f6db67ff
Disable XML error warnings when parsing unclean inputs from Wikidata
Joshua Ramon Enslin
2025-02-13 21:48:07 +01:00
2f3bc5f2fa
Prefer wikipedia page titles over wikidata labels
Joshua Ramon Enslin
2025-02-13 21:38:13 +01:00
39362f537a
Merge branch 'master' of gitea:museum-digital/MDNodaHelpers
Joshua Ramon Enslin
2025-02-13 17:19:43 +01:00
de0357473a
Make constant for test language in NodaWikidataFetcherTest public, allowing reuse
Joshua Ramon Enslin
2025-02-13 17:19:06 +01:00
ef43270fb2
Map suffixes material and technique to their respective tag relation types
Joshua Ramon Enslin
2025-02-13 14:04:38 +01:00
338e09f001
Add kannada to list of languages fetched from wikidata
Joshua Ramon Enslin
2025-02-13 13:10:45 +01:00
4cf9eaf4fa
Remove superfluous params passed to function
Joshua Ramon Enslin
2025-02-13 13:10:30 +01:00
18438251a7
Add functions for getting IDs by any translated entry irrespective of the language
Joshua Ramon Enslin
2025-02-12 17:15:19 +01:00
1cf0f9858a
Add tests for loading translations in NodaWikidataFetcher
Joshua Ramon Enslin
2025-02-12 16:02:04 +01:00
1d50027809
Make function getWikidataEntity public
Joshua Ramon Enslin
2025-02-12 15:48:52 +01:00
d1cee17ef5
Add Telugu to list of languages to fetch in Wikidata fetcher
Joshua Ramon Enslin
2025-02-12 12:47:02 +01:00
baf7905e0b
Map gender Q207959
Joshua Ramon Enslin
2025-02-03 09:41:16 +01:00
9bf14d7d91
Add search function for getting entries in NodaIDGetter across vocabs
Joshua Ramon Enslin
2025-01-31 23:25:40 +01:00
a621534136
Update NodaBlacklistedTerms
Joshua Ramon Enslin
2025-01-24 13:45:28 +01:00
51fe9a5e45
Cover more edge cases for splitting time names
Joshua Ramon Enslin
2025-01-15 11:49:20 +01:00
9c2eaa2929
Allow splitting 1945-48
Joshua Ramon Enslin
2025-01-15 10:35:35 +01:00
546c17031a
Make NodaImportLogger more resilient, prevent error in case of duplicate import names
Joshua Ramon Enslin
2024-12-12 12:43:11 +01:00
bf22f5541d
Retrieve "displayed subject" relationship from suffix "<Motiv>", "[Motiv]"
Joshua Ramon Enslin
2024-12-03 16:07:41 +01:00
e036d7881a
Add missing strict typing in function params
Joshua Ramon Enslin
2024-12-01 22:11:17 +01:00
d8db941485
Disallow tags of name "Nichtmünzliches" (de)
Joshua Ramon Enslin
2024-11-24 16:08:14 +01:00
b7bb7364d4
Ensure duplicate time names can be parsed in NodaTimeSplitter (e.g. 1.1.2024-1.1.2024)
Joshua Ramon Enslin
2024-11-20 10:02:10 +01:00
4dcd93b947
Better validate input JSON fetched from Wikipedia
Joshua Ramon Enslin
2024-11-12 15:36:32 +01:00
c72ad51dda
Merge branch 'master' of gitea:museum-digital/MDNodaHelpers
Joshua Ramon Enslin
2024-11-11 09:11:35 +01:00
d6dea3e280
Remove use of SESSION in NodaWikidataFetcher
Joshua Ramon Enslin
2024-11-11 09:11:15 +01:00
6f7ad13c4e
Add class NodaTagRelationIdentifier for parsing tag relation types from input tag names
Joshua Ramon Enslin
2024-11-09 19:44:09 +01:00
48355a6a36
Identify uncertainty before brackets ("Berlin ? (Germany)" > "Berlin (Germany)" + Uncertain)
Joshua Ramon Enslin
2024-11-09 18:42:18 +01:00
7cfe752c94
Handle commas when guessing time certainty
Joshua Ramon Enslin
2024-11-09 15:40:27 +01:00
29ca05f552
Properly handle commas at the end of names when guessing certainty
Joshua Ramon Enslin
2024-11-09 15:33:49 +01:00
eb371d4270
Ensure times can be split despite spaces at random points in given name
Joshua Ramon Enslin
2024-10-23 18:02:23 +02:00
16f36c0852
Improve test coverage
Joshua Ramon Enslin
2024-10-10 14:32:55 +02:00
669a8a1459
Add tests for lookup functions by vocabulary references
Joshua Ramon Enslin
2024-10-10 14:16:52 +02:00
a9c506497c
Respect diacritics when looking up tag, actor, .. IDs
Joshua Ramon Enslin
2024-10-10 09:51:28 +02:00
06f13c1a71
Add functions for loading only norm data links from Wikidata for places + actors
Joshua Ramon Enslin
2024-10-03 16:36:30 +02:00
cd49f194f2
Refactor wikidata fetcher
Joshua Ramon Enslin
2024-10-03 15:56:31 +02:00
9b63a4d95d
Refactor parsing of norm data links from Wikidata into a dedicated function
Joshua Ramon Enslin
2024-10-03 15:03:38 +02:00
96ba020514
Add function for getting actors' names including life dates in batch
Joshua Ramon Enslin
2024-09-28 22:45:12 +02:00
c650e57eda
Remove references to zoom factor for places in Wikidata fetcher
Joshua Ramon Enslin
2024-09-25 15:42:59 +02:00
dea09b17cd
Add safeguard against question marks entering NodaSplitTime
Joshua Ramon Enslin
2024-08-02 03:41:51 +02:00
cc0997f412
Add direct validation function for noda mail checker
Joshua Ramon Enslin
2024-07-28 03:47:35 +02:00
f18e4c3edc
Make class constant public
Joshua Ramon Enslin
2024-07-19 00:59:21 +02:00
f220a77ad7
Remove linking to wikipedia as a noda repository in wikidata fetcher
Joshua Ramon Enslin
2024-07-19 00:49:04 +02:00
58d3569718
Fix edge case 200 b.c. in NodaSplitTime
Joshua Ramon Enslin
2024-07-08 01:27:03 +02:00
27528c9cf7
Ran phpcbf over code
Joshua Ramon Enslin
2024-07-08 00:48:50 +02:00
205e77da0e
Remove group members before deleting group
Joshua Ramon Enslin
2024-05-27 03:13:03 +02:00
f36938b8dd
Add functions for updating / deleting groups
Joshua Ramon Enslin
2024-05-27 02:44:28 +02:00
cfa9cee60d
Add additional option for logging in nodac
Joshua Ramon Enslin
2024-05-23 22:58:42 +02:00
83a557b989
Add additional assertion to NodaTimeSplitterTest for more thorough type safety
Joshua Ramon Enslin
2024-05-05 22:54:33 +02:00
7f342ed3c4
Merge branch 'master' of gitea:museum-digital/MDNodaHelpers
Joshua Ramon Enslin
2024-05-04 01:19:23 +02:00
7d303e219f
Fix broken time splitting for year ranges BC
Joshua Ramon Enslin
2024-05-04 01:17:33 +02:00
ce480f8b9f
Add validator for tag descriptions
Joshua Ramon Enslin
2024-05-01 17:45:07 +02:00
eb14615917
Return start and end date when attempting to split time spans
Joshua Ramon Enslin
2024-04-17 00:02:31 +02:00
bd775bec45
Refactor time splitter, support computing of dates for time entries
Joshua Ramon Enslin
2024-04-16 23:21:34 +02:00
2cdfa2e948
Merge branch 'master' of gitea:museum-digital/MDNodaHelpers
Joshua Ramon Enslin
2024-03-24 02:17:22 +01:00
81a7d64e27
Handle Ukrainian year names (2022 p > 2022)
Joshua Ramon Enslin
2024-03-24 00:59:30 +01:00
6af51323e7
Categorize gender Q2449503
Joshua Ramon Enslin
2024-03-09 22:49:38 +01:00
3c43a3f2d3
Return an integer when checking time ID by stored rewrite
Joshua Ramon Enslin
2024-01-30 00:25:50 +01:00
09518a0a6e
Add function for getting time ID by stored rewrite
Joshua Ramon Enslin
2024-01-30 00:07:31 +01:00
93f8f13e62
Blacklist tag aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
Joshua Ramon Enslin
2024-01-15 01:46:21 +01:00
8a8f55b38c
Extend NodaGroup to allow adding new ones and linking vocabulary entries to one
Joshua Ramon Enslin
2023-12-12 04:29:23 +01:00
f3831965a3
Extend NodaTimeSplitter
Joshua Ramon Enslin
2023-12-09 23:54:45 +01:00
4a49c7a4e7
Fix erroneous splitting of Hungarian time names with additions
Joshua Ramon Enslin
2023-12-09 11:56:15 +01:00
3e9f675fdc
Add Slovak to list of languages to load
Joshua Ramon Enslin
2023-12-08 16:42:24 +01:00
2ab0e75111
Fix erroneous splitting of "YYYY MMD" in Hungarian
Joshua Ramon Enslin
2023-12-08 12:26:59 +01:00
27e259072c
Add new class NodaGroup
Joshua Ramon Enslin
2023-12-07 23:30:24 +01:00
50cb33720b
Allow updating and getting names of vocabulary groups
Joshua Ramon Enslin
2023-12-07 19:07:11 +01:00
a677c605c5
Use MDMysqli's default blend modes for manticore index setup
Joshua Ramon Enslin
2023-12-05 22:10:18 +01:00
681002d844
Extend lists for Ukrainian place hierarchy indicators
Joshua Ramon Enslin
2023-12-04 19:22:50 +01:00
40cf5a5112
Set expansion limit, blend_mode in manticore tables
Joshua Ramon Enslin
2023-12-04 18:42:40 +01:00
b14f2e14eb
Add further indicator for village-level places in Ukrainian
Joshua Ramon Enslin
2023-12-03 22:07:32 +01:00
55931ba3ef
Cover more levels of Ukrainian (current and historical) place hierarchies
Joshua Ramon Enslin
2023-12-02 16:32:46 +01:00
2badc67405
Add rewriting for Ukrainian place names based on specified hierarchies
Joshua Ramon Enslin
2023-12-02 15:21:02 +01:00
b4c941f441
Use MD_STD::json_encode over general \json_encode()
Joshua Ramon Enslin
2023-11-27 01:33:53 +01:00
d9a0985feb
Extend list of uncertainty prefixes for places with uppercase variants
Joshua Ramon Enslin
2023-11-27 01:15:35 +01:00
b36a504277
Add blacklist for unwanted rewrites in consolidating place names
Joshua Ramon Enslin
2023-11-26 23:55:22 +01:00
e610723107
Add functions for automatic rewriting of country names to brackets at the end of place names based on lists
Joshua Ramon Enslin
2023-11-26 00:54:14 +01:00
f6409322e5
Add classes for writing consolidating spellings of actor and place names
Joshua Ramon Enslin
2023-11-25 22:42:07 +01:00
61e83022ae
Extend uncertainty indicator lists with Ukrainian terms
Joshua Ramon Enslin
2023-11-25 13:33:25 +01:00
3d58ce3edf
Add Ukrainian terms
Joshua Ramon Enslin
2023-11-24 12:52:20 +01:00
a33c354ad6
Conform to stricter typing rules
Joshua Ramon Enslin
2023-11-21 22:01:28 +01:00
d6c514c208
Add functions to check for actor IDs by name while including their life dates
Joshua Ramon Enslin
2023-11-20 13:43:05 +01:00
4496a35f5c
Rewrite incomplete time span spellings to extend parsable and splittable time names
Joshua Ramon Enslin
2023-11-20 03:18:02 +01:00
78d5137b96
Add Ukrainian uncertainty indicators
Joshua Ramon Enslin
2023-11-19 03:53:05 +01:00
a102758606
Extend tests to ensure "vermutl." is included in place uncertainty indicators
Joshua Ramon Enslin
2023-11-18 01:36:01 +01:00
700fefd28c
Add "wahrscheinlich" to list of uncertainty indicators for places
Joshua Ramon Enslin
2023-11-17 18:32:23 +01:00
4582f6a697
Fix another edge case in time splitter
Joshua Ramon Enslin
2023-11-14 03:32:17 +01:00
7ef05a55c5
Migrate PHPunit config to PHPUnit 10's requirements
Joshua Ramon Enslin
2023-11-14 03:28:24 +01:00
54a30e683e
Add class for loading info from distinctly_typed_strings table
Joshua Ramon Enslin
2023-11-13 00:11:56 +01:00
c9b0e7085f
Add coverage information to tests, fix coverage of NodaValidationHelper
Joshua Ramon Enslin
2023-11-07 23:31:42 +01:00
1a7dbcd6f6
Fix edge cases in time splitter where inputs start with many digits but are not dates
Joshua Ramon Enslin
2023-11-07 00:27:20 +01:00
93c0ff3fa0
Set beStrictAboutOutputDuringTests=true in phpunit.xml
Joshua Ramon Enslin
2023-11-06 23:50:12 +01:00
631debcfd8
Add autoloader for tests, phpunit config
Joshua Ramon Enslin
2023-11-06 23:46:30 +01:00
53c645b132
Add "vermutl." to list of uncertainty indicators
Joshua Ramon Enslin
2023-10-28 21:17:54 +02:00
95de1615ef
Identify, parse and remove some more uncertainty indicators
Joshua Ramon Enslin
2023-10-27 19:06:08 +02:00
bbbc84015b
Fix handling of misassigned lcsh / loc links in NodaWikidataFetcher
Joshua Ramon Enslin
2023-10-18 02:46:11 +02:00
d55361e29b
Add function to check if a time name is blacklisted
Joshua Ramon Enslin
2023-10-18 01:54:40 +02:00
37715bc3e8
Support BCE / CE times
Joshua Ramon Enslin
2023-10-15 19:20:16 +02:00
9942c58b12
Improve parsing of LOC / LCSH from Wikidata
Joshua Ramon Enslin
2023-09-29 16:20:53 +02:00