This website requires JavaScript.
8a31cf216e
Add shortened 100x A to list of blacklisted tags
master
Joshua Ramon Enslin
2025-05-22 16:25:27 +02:00
ff474341ed
Add iconclass terms BB, CC, DD, to blacklist
Joshua Ramon Enslin
2025-05-08 16:18:05 +02:00
1051e10732
Prevent ambigious splitting of [0-9]{4}-[0-9]{2}
Joshua Ramon Enslin
2025-05-06 22:32:00 +02:00
057cac0f1b
Ensure 1903/1904 cannot be split
Joshua Ramon Enslin
2025-05-05 17:05:47 +02:00
0053fbe030
Support splitting times like "1. Hälfte des 19. Jahrhunderts"
Joshua Ramon Enslin
2025-04-28 17:00:32 +02:00
7a2856ffad
Split times in more cases (300-20 BC, 300-4000 CE)
Joshua Ramon Enslin
2025-04-08 15:16:25 +02:00
00638152cf
Prevent splitting of non-existing exact dates (e.g. 31.04.XXXX)
Joshua Ramon Enslin
2025-04-08 03:48:04 +02:00
dba60dbce6
Fix order of split days and months within a single year BCE
Joshua Ramon Enslin
2025-04-07 18:32:14 +02:00
f84fe1bca5
Fix type error / reference to values now not consistently existing anymore
Joshua Ramon Enslin
2025-04-06 22:56:36 +02:00
423959ac94
Stop early if autotranslation cannot proceed after validation
Joshua Ramon Enslin
2025-04-05 00:11:03 +02:00
e8edb4a459
Time splitter: Handle first/second half
Joshua Ramon Enslin
2025-04-05 00:09:39 +02:00
8491b62a83
Validate against time errors in autogenerating translations for times
Joshua Ramon Enslin
2025-04-04 20:03:59 +02:00
bb2b1c2c32
Update NodaGroup
Joshua Ramon Enslin
2025-03-13 00:30:33 +01:00
5054d3c62f
Use more rigurous trimming in NodaConsolidatedNamesForPersinst
Joshua Ramon Enslin
2025-03-10 04:18:00 +01:00
beba838c0d
Correctly handle multibype hyphens in XXXX-XXXX
Joshua Ramon Enslin
2025-03-10 04:13:59 +01:00
54dd958073
See before
Joshua Ramon Enslin
2025-03-10 04:05:00 +01:00
5b99304b5c
Accept an additional type of hyphen / dash in time splitting
Joshua Ramon Enslin
2025-03-10 03:58:45 +01:00
5cce98f15b
Extend tests
Joshua Ramon Enslin
2025-03-10 03:20:46 +01:00
5036c77f32
Extend test for getting actor ID by life dates + name
Joshua Ramon Enslin
2025-03-10 02:18:28 +01:00
e95415be8f
Add test for getting actor ID by name with life dates
Joshua Ramon Enslin
2025-03-10 01:48:09 +01:00
5192781494
Use Wikipedia API for getting descriptions from Wikipedia rather than parsing HTML in Wikidata fetcher
Joshua Ramon Enslin
2025-03-09 02:08:26 +01:00
d9d9f7fcdc
Continue refactoring tests for time splitter to run provider-based
Joshua Ramon Enslin
2025-02-24 14:02:42 +01:00
dbfa0df17f
Begin restructuring NodaTimeSplitterTest to use data providers
Joshua Ramon Enslin
2025-02-21 10:31:21 +01:00
3409ec7afe
Begin adding autotranslation language CRH / Crimean Tatar
Joshua Ramon Enslin
2025-02-18 17:51:36 +01:00
27ac3f255a
Minor typing improvements
Joshua Ramon Enslin
2025-02-15 13:36:50 +01:00
9d7d53a858
Disallow fetching from Wikidata disambiguation pages
Joshua Ramon Enslin
2025-02-13 22:37:17 +01:00
28f6db67ff
Disable XML error warnings when parsing unclean inputs from Wikidata
Joshua Ramon Enslin
2025-02-13 21:48:07 +01:00
2f3bc5f2fa
Prefer wikipedia page titles over wikidata labels
Joshua Ramon Enslin
2025-02-13 21:38:13 +01:00
39362f537a
Merge branch 'master' of gitea:museum-digital/MDNodaHelpers
Joshua Ramon Enslin
2025-02-13 17:19:43 +01:00
de0357473a
Make constant for test language in NodaWikidataFetcherTest public, allowing reuse
Joshua Ramon Enslin
2025-02-13 17:19:06 +01:00
ef43270fb2
Map suffixes material and technique to their respective tag relation types
Joshua Ramon Enslin
2025-02-13 14:04:38 +01:00
338e09f001
Add kannada to list of languages fetched from wikidata
Joshua Ramon Enslin
2025-02-13 13:10:45 +01:00
4cf9eaf4fa
Remove superfluous params passed to function
Joshua Ramon Enslin
2025-02-13 13:10:30 +01:00
18438251a7
Add functions for getting IDs by any translated entry irrespective of the language
Joshua Ramon Enslin
2025-02-12 17:15:19 +01:00
1cf0f9858a
Add tests for loading translations in NodaWikidataFetcher
Joshua Ramon Enslin
2025-02-12 16:02:04 +01:00
1d50027809
Make function getWikidataEntity public
Joshua Ramon Enslin
2025-02-12 15:48:52 +01:00
d1cee17ef5
Add Telugu to list of languages to fetch in Wikidata fetcher
Joshua Ramon Enslin
2025-02-12 12:47:02 +01:00
baf7905e0b
Map gender Q207959
Joshua Ramon Enslin
2025-02-03 09:41:16 +01:00
9bf14d7d91
Add search function for getting entries in NodaIDGetter across vocabs
Joshua Ramon Enslin
2025-01-31 23:25:40 +01:00
a621534136
Update NodaBlacklistedTerms
Joshua Ramon Enslin
2025-01-24 13:45:28 +01:00
51fe9a5e45
Cover more edge cases for splitting time names
Joshua Ramon Enslin
2025-01-15 11:49:20 +01:00
9c2eaa2929
Allow splitting 1945-48
Joshua Ramon Enslin
2025-01-15 10:35:35 +01:00
546c17031a
Make NodaImportLogger more resilient, prevent error in case of duplicate import names
Joshua Ramon Enslin
2024-12-12 12:43:11 +01:00
bf22f5541d
Retrieve "displayed subject" relationship from suffix "<Motiv>", "[Motiv]"
Joshua Ramon Enslin
2024-12-03 16:07:41 +01:00
e036d7881a
Add missing strict typing in function params
Joshua Ramon Enslin
2024-12-01 22:11:17 +01:00
d8db941485
Disallow tags of name "Nichtmünzliches" (de)
Joshua Ramon Enslin
2024-11-24 16:08:14 +01:00
b7bb7364d4
Ensure duplicate time names can be parsed in NodaTimeSplitter (e.g. 1.1.2024-1.1.2024)
Joshua Ramon Enslin
2024-11-20 10:02:10 +01:00
4dcd93b947
Better validate input JSON fetched from Wikipedia
Joshua Ramon Enslin
2024-11-12 15:36:32 +01:00
c72ad51dda
Merge branch 'master' of gitea:museum-digital/MDNodaHelpers
Joshua Ramon Enslin
2024-11-11 09:11:35 +01:00
d6dea3e280
Remove use of SESSION in NodaWikidataFetcher
Joshua Ramon Enslin
2024-11-11 09:11:15 +01:00
6f7ad13c4e
Add class NodaTagRelationIdentifier for parsing tag relation types from input tag names
Joshua Ramon Enslin
2024-11-09 19:44:09 +01:00
48355a6a36
Identify uncertainty before brackets ("Berlin ? (Germany)" > "Berlin (Germany)" + Uncertain)
Joshua Ramon Enslin
2024-11-09 18:42:18 +01:00
7cfe752c94
Handle commas when guessing time certainty
Joshua Ramon Enslin
2024-11-09 15:40:27 +01:00
29ca05f552
Properly handle commas at the end of names when guessing certainty
Joshua Ramon Enslin
2024-11-09 15:33:49 +01:00
eb371d4270
Ensure times can be split despite spaces at random points in given name
Joshua Ramon Enslin
2024-10-23 18:02:23 +02:00
16f36c0852
Improve test coverage
Joshua Ramon Enslin
2024-10-10 14:32:55 +02:00
669a8a1459
Add tests for lookup functions by vocabulary references
Joshua Ramon Enslin
2024-10-10 14:16:52 +02:00
a9c506497c
Respect diacritics when looking up tag, actor, .. IDs
Joshua Ramon Enslin
2024-10-10 09:51:28 +02:00
06f13c1a71
Add functions for loading only norm data links from Wikidata for places + actors
Joshua Ramon Enslin
2024-10-03 16:36:30 +02:00
cd49f194f2
Refactor wikidata fetcher
Joshua Ramon Enslin
2024-10-03 15:56:31 +02:00
9b63a4d95d
Refactor parsing of norm data links from Wikidata into a dedicated function
Joshua Ramon Enslin
2024-10-03 15:03:38 +02:00
96ba020514
Add function for getting actors' names including life dates in batch
Joshua Ramon Enslin
2024-09-28 22:45:12 +02:00
c650e57eda
Remove references to zoom factor for places in Wikidata fetcher
Joshua Ramon Enslin
2024-09-25 15:42:59 +02:00
dea09b17cd
Add safeguard against question marks entering NodaSplitTime
Joshua Ramon Enslin
2024-08-02 03:41:51 +02:00
cc0997f412
Add direct validation function for noda mail checker
Joshua Ramon Enslin
2024-07-28 03:47:35 +02:00
f18e4c3edc
Make class constant public
Joshua Ramon Enslin
2024-07-19 00:59:21 +02:00
f220a77ad7
Remove linking to wikipedia as a noda repository in wikidata fetcher
Joshua Ramon Enslin
2024-07-19 00:49:04 +02:00
58d3569718
Fix edge case 200 b.c. in NodaSplitTime
Joshua Ramon Enslin
2024-07-08 01:27:03 +02:00
27528c9cf7
Ran phpcbf over code
Joshua Ramon Enslin
2024-07-08 00:48:50 +02:00
205e77da0e
Remove group members before deleting group
Joshua Ramon Enslin
2024-05-27 03:13:03 +02:00
f36938b8dd
Add functions for updating / deleting groups
Joshua Ramon Enslin
2024-05-27 02:44:28 +02:00
cfa9cee60d
Add additional option for logging in nodac
Joshua Ramon Enslin
2024-05-23 22:58:42 +02:00
83a557b989
Add additional assertion to NodaTimeSplitterTest for more thorough type safety
Joshua Ramon Enslin
2024-05-05 22:54:33 +02:00
7f342ed3c4
Merge branch 'master' of gitea:museum-digital/MDNodaHelpers
Joshua Ramon Enslin
2024-05-04 01:19:23 +02:00
7d303e219f
Fix broken time splitting for year ranges BC
Joshua Ramon Enslin
2024-05-04 01:17:33 +02:00
ce480f8b9f
Add validator for tag descriptions
Joshua Ramon Enslin
2024-05-01 17:45:07 +02:00
eb14615917
Return start and end date when attempting to split time spans
Joshua Ramon Enslin
2024-04-17 00:02:31 +02:00
bd775bec45
Refactor time splitter, support computing of dates for time entries
Joshua Ramon Enslin
2024-04-16 23:21:34 +02:00
2cdfa2e948
Merge branch 'master' of gitea:museum-digital/MDNodaHelpers
Joshua Ramon Enslin
2024-03-24 02:17:22 +01:00
81a7d64e27
Handle Ukrainian year names (2022 p > 2022)
Joshua Ramon Enslin
2024-03-24 00:59:30 +01:00
6af51323e7
Categorize gender Q2449503
Joshua Ramon Enslin
2024-03-09 22:49:38 +01:00
3c43a3f2d3
Return an integer when checking time ID by stored rewrite
Joshua Ramon Enslin
2024-01-30 00:25:50 +01:00
09518a0a6e
Add function for getting time ID by stored rewrite
Joshua Ramon Enslin
2024-01-30 00:07:31 +01:00
93f8f13e62
Blacklist tag aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
Joshua Ramon Enslin
2024-01-15 01:46:21 +01:00
8a8f55b38c
Extend NodaGroup to allow adding new ones and linking vocabulary entries to one
Joshua Ramon Enslin
2023-12-12 04:29:23 +01:00
f3831965a3
Extend NodaTimeSplitter
Joshua Ramon Enslin
2023-12-09 23:54:45 +01:00
4a49c7a4e7
Fix erroneous splitting of Hungarian time names with additions
Joshua Ramon Enslin
2023-12-09 11:56:15 +01:00
3e9f675fdc
Add Slovak to list of languages to load
Joshua Ramon Enslin
2023-12-08 16:42:24 +01:00
2ab0e75111
Fix erroneous splitting of "YYYY MMD" in Hungarian
Joshua Ramon Enslin
2023-12-08 12:26:59 +01:00
27e259072c
Add new class NodaGroup
Joshua Ramon Enslin
2023-12-07 23:30:24 +01:00
50cb33720b
Allow updating and getting names of vocabulary groups
Joshua Ramon Enslin
2023-12-07 19:07:11 +01:00
a677c605c5
Use MDMysqli's default blend modes for manticore index setup
Joshua Ramon Enslin
2023-12-05 22:10:18 +01:00
681002d844
Extend lists for Ukrainian place hierarchy indicators
Joshua Ramon Enslin
2023-12-04 19:22:50 +01:00
40cf5a5112
Set expansion limit, blend_mode in manticore tables
Joshua Ramon Enslin
2023-12-04 18:42:40 +01:00
b14f2e14eb
Add further indicator for village-level places in Ukrainian
Joshua Ramon Enslin
2023-12-03 22:07:32 +01:00
55931ba3ef
Cover more levels of Ukrainian (current and historical) place hierarchies
Joshua Ramon Enslin
2023-12-02 16:32:46 +01:00
2badc67405
Add rewriting for Ukrainian place names based on specified hierarchies
Joshua Ramon Enslin
2023-12-02 15:21:02 +01:00
b4c941f441
Use MD_STD::json_encode over general \json_encode()
Joshua Ramon Enslin
2023-11-27 01:33:53 +01:00
d9a0985feb
Extend list of uncertainty prefixes for places with uppercase variants
Joshua Ramon Enslin
2023-11-27 01:15:35 +01:00
b36a504277
Add blacklist for unwanted rewrites in consolidating place names
Joshua Ramon Enslin
2023-11-26 23:55:22 +01:00