Commit Graph

  • 59580720fe Extend list of uncertainty indicators master jrenslin 2026-04-22 16:06:13 +02:00
  • 781180209f Merge branch 'master' of gitea:museum-digital/MDNodaHelpers jrenslin 2026-04-12 01:39:03 +02:00
  • bca32c33c7 Timesplitter: Translate "(terminus post quem)" to since jrenslin 2026-04-12 01:38:37 +02:00
  • 5ad7c5a29d Add: "vmtl." to list of uncertainty indicators jrenslin 2026-04-12 01:38:21 +02:00
  • fa79602300 Merge branch 'master' of gitea:museum-digital/MDNodaHelpers jrenslin 2026-04-05 18:58:23 +02:00
  • adddbdd0f6 Add "Allgemein" and "in general" to list of blacklisted tags jrenslin 2026-04-05 18:58:00 +02:00
  • 59453a2196 Use in_array with strict typing jrenslin 2026-03-11 18:29:08 +01:00
  • 3f2b9091dc Allow splitting "Jahr 1920" jrenslin 2026-02-21 18:29:16 +01:00
  • 7852377a8d Extend blacklist for given names jrenslin 2026-02-11 16:05:29 +01:00
  • 18613beaaa Add blacklist for actors' given names jrenslin 2026-02-11 14:40:11 +01:00
  • a0de048915 Identify time uncertainty for "etwa" jrenslin 2025-11-01 14:02:40 +01:00
  • 3fb6c591d7 Blacklist tag "AA" jrenslin 2025-10-30 19:10:32 +01:00
  • 773ae0659c Support importing Wikidata links to the Catalogue of Life jrenslin 2025-09-23 14:17:33 +02:00
  • b9d5377f42 Update NodaBlacklistedTerms jrenslin 2025-09-13 20:00:31 +02:00
  • c7e577c07e Add "II" to tag blacklist jrenslin 2025-09-13 18:31:06 +02:00
  • c4a174eb7b Add wrappers to check if an actor, place is usable for new terms jrenslin 2025-09-09 17:25:02 +02:00
  • da6d9de888 Set explicit visibility of constant in NodaValidationHelper jrenslin 2025-09-09 14:59:57 +02:00
  • eacfa4544f Blacklist term _ jrenslin 2025-08-25 12:15:56 +02:00
  • 6a99a112f4 Blacklist tag name "K.A" jrenslin 2025-08-21 00:49:35 +02:00
  • 5fedbd898e Add new category for noda log edits: notes jrenslin 2025-08-13 01:41:12 +02:00
  • 58d795c806 Merge branch 'master' of gitea:museum-digital/MDNodaHelpers jrenslin 2025-08-10 18:43:49 +02:00
  • db0aa0671b Accept empty extract data from Wikidata jrenslin 2025-08-10 18:43:33 +02:00
  • 579404d498 Extend list of blacklisted terms jrenslin 2025-08-06 14:54:40 +02:00
  • 6eabb59533 Extend list of blacklisted terms jrenslin 2025-07-29 18:36:49 +02:00
  • aa1d72cb64 Merge branch 'master' of gitea:museum-digital/MDNodaHelpers jrenslin 2025-07-08 17:39:14 +02:00
  • 1798e29186 Validate JSON returned from Wikidata's sparql endpoint, throw an exception if it is invalid jrenslin 2025-07-08 17:38:34 +02:00
  • 5a44bf6540 Tests: replace assertTrue(true) with expectNotToPerformAssertions() jrenslin 2025-06-27 15:12:56 +02:00
  • 119f216907 Merge branch 'master' of gitea:museum-digital/MDNodaHelpers jrenslin 2025-06-08 17:20:24 +02:00
  • 25668b7b16 Ping and reconnect DB in fulltext sync for actors fulltext index jrenslin 2025-06-08 17:19:47 +02:00
  • 8a31cf216e Add shortened 100x A to list of blacklisted tags jrenslin 2025-05-22 16:25:27 +02:00
  • ff474341ed Add iconclass terms BB, CC, DD, to blacklist jrenslin 2025-05-08 16:18:05 +02:00
  • 1051e10732 Prevent ambigious splitting of [0-9]{4}-[0-9]{2} jrenslin 2025-05-06 22:32:00 +02:00
  • 057cac0f1b Ensure 1903/1904 cannot be split jrenslin 2025-05-05 17:05:47 +02:00
  • 0053fbe030 Support splitting times like "1. Hälfte des 19. Jahrhunderts" jrenslin 2025-04-28 17:00:32 +02:00
  • 7a2856ffad Split times in more cases (300-20 BC, 300-4000 CE) jrenslin 2025-04-08 15:16:25 +02:00
  • 00638152cf Prevent splitting of non-existing exact dates (e.g. 31.04.XXXX) jrenslin 2025-04-08 03:48:04 +02:00
  • dba60dbce6 Fix order of split days and months within a single year BCE jrenslin 2025-04-07 18:32:14 +02:00
  • f84fe1bca5 Fix type error / reference to values now not consistently existing anymore jrenslin 2025-04-06 22:56:36 +02:00
  • 423959ac94 Stop early if autotranslation cannot proceed after validation jrenslin 2025-04-05 00:11:03 +02:00
  • e8edb4a459 Time splitter: Handle first/second half jrenslin 2025-04-05 00:09:39 +02:00
  • 8491b62a83 Validate against time errors in autogenerating translations for times jrenslin 2025-04-04 20:03:59 +02:00
  • bb2b1c2c32 Update NodaGroup jrenslin 2025-03-13 00:30:33 +01:00
  • 5054d3c62f Use more rigurous trimming in NodaConsolidatedNamesForPersinst jrenslin 2025-03-10 04:18:00 +01:00
  • beba838c0d Correctly handle multibype hyphens in XXXX-XXXX jrenslin 2025-03-10 04:13:59 +01:00
  • 54dd958073 See before jrenslin 2025-03-10 04:05:00 +01:00
  • 5b99304b5c Accept an additional type of hyphen / dash in time splitting jrenslin 2025-03-10 03:58:45 +01:00
  • 5cce98f15b Extend tests jrenslin 2025-03-10 03:20:46 +01:00
  • 5036c77f32 Extend test for getting actor ID by life dates + name jrenslin 2025-03-10 02:18:28 +01:00
  • e95415be8f Add test for getting actor ID by name with life dates jrenslin 2025-03-10 01:48:09 +01:00
  • 5192781494 Use Wikipedia API for getting descriptions from Wikipedia rather than parsing HTML in Wikidata fetcher jrenslin 2025-03-09 02:08:26 +01:00
  • d9d9f7fcdc Continue refactoring tests for time splitter to run provider-based jrenslin 2025-02-24 14:02:42 +01:00
  • dbfa0df17f Begin restructuring NodaTimeSplitterTest to use data providers jrenslin 2025-02-21 10:31:21 +01:00
  • 3409ec7afe Begin adding autotranslation language CRH / Crimean Tatar jrenslin 2025-02-18 17:51:36 +01:00
  • 27ac3f255a Minor typing improvements jrenslin 2025-02-15 13:36:50 +01:00
  • 9d7d53a858 Disallow fetching from Wikidata disambiguation pages jrenslin 2025-02-13 22:37:17 +01:00
  • 28f6db67ff Disable XML error warnings when parsing unclean inputs from Wikidata jrenslin 2025-02-13 21:48:07 +01:00
  • 2f3bc5f2fa Prefer wikipedia page titles over wikidata labels jrenslin 2025-02-13 21:38:13 +01:00
  • 39362f537a Merge branch 'master' of gitea:museum-digital/MDNodaHelpers jrenslin 2025-02-13 17:19:43 +01:00
  • de0357473a Make constant for test language in NodaWikidataFetcherTest public, allowing reuse jrenslin 2025-02-13 17:19:06 +01:00
  • ef43270fb2 Map suffixes material and technique to their respective tag relation types jrenslin 2025-02-13 14:04:38 +01:00
  • 338e09f001 Add kannada to list of languages fetched from wikidata jrenslin 2025-02-13 13:10:45 +01:00
  • 4cf9eaf4fa Remove superfluous params passed to function jrenslin 2025-02-13 13:10:30 +01:00
  • 18438251a7 Add functions for getting IDs by any translated entry irrespective of the language jrenslin 2025-02-12 17:15:19 +01:00
  • 1cf0f9858a Add tests for loading translations in NodaWikidataFetcher jrenslin 2025-02-12 16:02:04 +01:00
  • 1d50027809 Make function getWikidataEntity public jrenslin 2025-02-12 15:48:52 +01:00
  • d1cee17ef5 Add Telugu to list of languages to fetch in Wikidata fetcher jrenslin 2025-02-12 12:47:02 +01:00
  • baf7905e0b Map gender Q207959 jrenslin 2025-02-03 09:41:16 +01:00
  • 9bf14d7d91 Add search function for getting entries in NodaIDGetter across vocabs jrenslin 2025-01-31 23:25:40 +01:00
  • a621534136 Update NodaBlacklistedTerms jrenslin 2025-01-24 13:45:28 +01:00
  • 51fe9a5e45 Cover more edge cases for splitting time names jrenslin 2025-01-15 11:49:20 +01:00
  • 9c2eaa2929 Allow splitting 1945-48 jrenslin 2025-01-15 10:35:35 +01:00
  • 546c17031a Make NodaImportLogger more resilient, prevent error in case of duplicate import names jrenslin 2024-12-12 12:43:11 +01:00
  • bf22f5541d Retrieve "displayed subject" relationship from suffix "<Motiv>", "[Motiv]" jrenslin 2024-12-03 16:07:41 +01:00
  • e036d7881a Add missing strict typing in function params jrenslin 2024-12-01 22:11:17 +01:00
  • d8db941485 Disallow tags of name "Nichtmünzliches" (de) jrenslin 2024-11-24 16:08:14 +01:00
  • b7bb7364d4 Ensure duplicate time names can be parsed in NodaTimeSplitter (e.g. 1.1.2024-1.1.2024) jrenslin 2024-11-20 10:02:10 +01:00
  • 4dcd93b947 Better validate input JSON fetched from Wikipedia jrenslin 2024-11-12 15:36:32 +01:00
  • c72ad51dda Merge branch 'master' of gitea:museum-digital/MDNodaHelpers jrenslin 2024-11-11 09:11:35 +01:00
  • d6dea3e280 Remove use of SESSION in NodaWikidataFetcher jrenslin 2024-11-11 09:11:15 +01:00
  • 6f7ad13c4e Add class NodaTagRelationIdentifier for parsing tag relation types from input tag names jrenslin 2024-11-09 19:44:09 +01:00
  • 48355a6a36 Identify uncertainty before brackets ("Berlin ? (Germany)" > "Berlin (Germany)" + Uncertain) jrenslin 2024-11-09 18:42:18 +01:00
  • 7cfe752c94 Handle commas when guessing time certainty jrenslin 2024-11-09 15:40:27 +01:00
  • 29ca05f552 Properly handle commas at the end of names when guessing certainty jrenslin 2024-11-09 15:33:49 +01:00
  • eb371d4270 Ensure times can be split despite spaces at random points in given name jrenslin 2024-10-23 18:02:23 +02:00
  • 16f36c0852 Improve test coverage jrenslin 2024-10-10 14:32:55 +02:00
  • 669a8a1459 Add tests for lookup functions by vocabulary references jrenslin 2024-10-10 14:16:52 +02:00
  • a9c506497c Respect diacritics when looking up tag, actor, .. IDs jrenslin 2024-10-10 09:51:28 +02:00
  • 06f13c1a71 Add functions for loading only norm data links from Wikidata for places + actors jrenslin 2024-10-03 16:36:30 +02:00
  • cd49f194f2 Refactor wikidata fetcher jrenslin 2024-10-03 15:56:31 +02:00
  • 9b63a4d95d Refactor parsing of norm data links from Wikidata into a dedicated function jrenslin 2024-10-03 15:03:38 +02:00
  • 96ba020514 Add function for getting actors' names including life dates in batch jrenslin 2024-09-28 22:45:12 +02:00
  • c650e57eda Remove references to zoom factor for places in Wikidata fetcher jrenslin 2024-09-25 15:42:59 +02:00
  • dea09b17cd Add safeguard against question marks entering NodaSplitTime jrenslin 2024-08-02 03:41:51 +02:00
  • cc0997f412 Add direct validation function for noda mail checker jrenslin 2024-07-28 03:47:35 +02:00
  • f18e4c3edc Make class constant public jrenslin 2024-07-19 00:59:21 +02:00
  • f220a77ad7 Remove linking to wikipedia as a noda repository in wikidata fetcher jrenslin 2024-07-19 00:49:04 +02:00
  • 58d3569718 Fix edge case 200 b.c. in NodaSplitTime jrenslin 2024-07-08 01:27:03 +02:00
  • 27528c9cf7 Ran phpcbf over code jrenslin 2024-07-08 00:48:50 +02:00
  • 205e77da0e Remove group members before deleting group jrenslin 2024-05-27 03:13:03 +02:00
  • f36938b8dd Add functions for updating / deleting groups jrenslin 2024-05-27 02:44:28 +02:00