Commit Graph

  • 1fd87c7e6d Simplify NodaWikidataFetcher, unify list of langs, simplify linking to noda sources jrenslin 2021-03-17 22:06:08 +01:00
  • f0b5a08cdf Move NodaWikidataFetcher to this repository jrenslin 2021-03-17 16:10:49 +01:00
  • d7e3a88320 Fix file path of MD_STD jrenslin 2021-03-09 20:15:22 +01:00
  • 1fe795d219 Use mysqli->autocommit(false) to speed up autotranslating jrenslin 2021-03-08 21:23:38 +01:00
  • 7ab7b8341f Add license jrenslin 2021-02-06 13:52:19 +01:00
  • b8ae4f8b3e Add README jrenslin 2021-02-06 13:40:54 +01:00
  • 668477f199 Add missing check in NodaTimeAutotranslater jrenslin 2021-01-31 19:39:09 +01:00
  • 7ccdfd4659 Fix function comment in NodaTimeSplitter jrenslin 2021-01-31 01:50:25 +01:00
  • aca4f86da5 Add "Neu" und "Neu hergestellt" to list of disallowed time entries jrenslin 2021-01-29 20:03:06 +01:00
  • a761a9dfd7 Stop time splitter for start / end, if common time splitter can be used jrenslin 2021-01-07 11:43:20 +01:00
  • c02165df7b Add exception catching in splitting times / dates jrenslin 2021-01-06 23:11:05 +01:00
  • 54764e741a Add option to split and translate times with start and end dates jrenslin 2021-01-06 23:05:26 +01:00
  • fcc63c4ea0 Merge branch 'master' of gitea:museum-digital/MDNodaHelpers jrenslin 2021-01-06 16:07:46 +01:00
  • a6030e4a5f Fix bug in month names similar in English and German jrenslin 2021-01-06 16:07:21 +01:00
  • 7ef09db72c Add static function in NodaIDGetter to get tag ID by import log jrenslin 2021-01-04 23:06:36 +01:00
  • 9f67d253da Add functions to get actor and place IDs by import logs jrenslin 2021-01-04 22:50:26 +01:00
  • 8f612dede1 Read 1917-ig. as similar to 1917-ig in time splitter jrenslin 2020-12-28 14:40:04 +01:00
  • 6e910cd676 Add English month names for splitting time terms jrenslin 2020-12-22 12:22:14 +01:00
  • 8ac22165fc Add "ohne angabe" to list of disallowed terms jrenslin 2020-12-21 15:32:16 +01:00
  • d8e44550fc Add "Ohne Datum" to list of disallowed time terms jrenslin 2020-12-21 15:15:00 +01:00
  • af454ec013 Setup ID getter by rewrite for tags to return arrays jrenslin 2020-12-20 23:24:08 +01:00
  • fce933c12a Extend list of disallowed noda terms jrenslin 2020-12-20 15:40:30 +01:00
  • b27f0ec918 Add "Keine Angaben" to list of disallowed inputs for places jrenslin 2020-12-19 02:37:38 +01:00
  • a070970554 Remove empty newlines in class defs jrenslin 2020-12-19 02:36:38 +01:00
  • ca13f36c0d Add function to get tag IDs by their translated names jrenslin 2020-12-07 13:43:07 +01:00
  • 50ff1a2339 Add script to get highest related tag jrenslin 2020-10-30 16:30:23 +01:00
  • 0ea9c31845 Explicitly use global namespace in function calls jrenslin 2020-10-23 17:03:51 +02:00
  • 14e82826ae Fix bug in getting place IDs by noda links jrenslin 2020-10-05 12:05:48 +02:00
  • 99aa1d74ad Improve / make more explicit: type safety jrenslin 2020-10-04 23:59:40 +02:00
  • 97566ea2d9 Split more time variations jrenslin 2020-10-04 23:57:59 +02:00
  • 8a4a8f7ed8 Split more variations of dots in dates, century ranges jrenslin 2020-10-04 23:20:58 +02:00
  • d0fe1e89ed Improve trimming inputs when cleaning certainty indicators jrenslin 2020-10-04 22:52:15 +02:00
  • 1f4d692fb5 Enable automatic translations of times "before" a given date jrenslin 2020-10-04 19:34:17 +02:00
  • 1685d78f65 Allow splitting times "before <X>" jrenslin 2020-10-04 19:27:23 +02:00
  • a0037c9883 Allow splitting times after <year><month> jrenslin 2020-10-04 19:17:18 +02:00
  • be46c39efd Fix wrong assumption on handling counting times when autotranslating "after <month>" jrenslin 2020-10-04 18:36:03 +02:00
  • c9a1a74bce Enable autotranslating of times 'after' a certain date jrenslin 2020-10-04 18:21:33 +02:00
  • 5e90e5d3f2 Add strings for expressing times 'after' and 'before' stefan 2020-10-04 17:40:51 +02:00
  • 2a57537436 Allow splitting times "Nach 1905" ("Nach " followed by 4 digit time number) jrenslin 2020-10-04 17:39:34 +02:00
  • 36d27e0f73 Remove / disallow certain input names in NodaUncertaintyHelper jrenslin 2020-10-04 02:40:21 +02:00
  • 4e934e380c Use [0-9]{4} spelling time jrenslin 2020-10-03 19:13:27 +02:00
  • ff35ca7bd9 Enable time splitter to deal with some roman numbers jrenslin 2020-10-03 16:10:43 +02:00
  • 80cd88222d Enable time splitter to recognize sz as abbr. for század jrenslin 2020-10-03 15:59:49 +02:00
  • 3664bcf3f6 Add getting places by noda links to NodaIDGetter jrenslin 2020-10-01 12:53:27 +02:00
  • 67cc76cff9 Allow splitting of German short decade names: 20er or 1920er jrenslin 2020-09-27 17:12:34 +02:00
  • 91f435a2e4 Enable parsing of months: 2020-01 jrenslin 2020-09-27 17:10:17 +02:00
  • baeb9ba72e Add test assertion for splitting international dates jrenslin 2020-09-27 12:40:15 +02:00
  • de7968fbbd Only allow splitting by international format if month < 13 jrenslin 2020-09-27 12:38:38 +02:00
  • 48f3bd2c3f Allow splitting international dates (2020-12-20) jrenslin 2020-09-27 12:36:34 +02:00
  • 830b37f547 Improve autotranslating of times before 1.1.1000 jrenslin 2020-09-26 16:10:26 +02:00
  • c9d8d4bdbd Allow automatic translations of days before 1000 CE jrenslin 2020-09-26 16:02:18 +02:00
  • b405855fc2 Disallow translating as decade before 1000 CE jrenslin 2020-09-26 15:30:30 +02:00
  • 8eda7d4c7f Improve type-safety jrenslin 2020-09-26 15:21:32 +02:00
  • 2b8b5d5743 Add check for improved type safety jrenslin 2020-09-26 15:19:04 +02:00
  • 3058f25a1c Add tests for German dates, enable splitting of 5 digit timespans jrenslin 2020-09-26 15:10:06 +02:00
  • 14b0d8037d Add tests for splitting Hungarian dates jrenslin 2020-09-26 14:15:15 +02:00
  • d56d47aee1 Add test for NodaTimeAutotranslater, allow parsing days and months BC jrenslin 2020-09-26 13:20:22 +02:00
  • cb2eff61a3 Use local representation of DB connection in NodaTimeAutotranslater jrenslin 2020-09-26 12:21:00 +02:00
  • 7a1dcbb14f Fix jrenslin 2020-09-26 10:23:37 +02:00
  • 1cfbfe7743 Improve type-safety jrenslin 2020-09-25 22:29:56 +02:00
  • 68d07c03d8 Allow splitting timespans BC.E. stefan 2020-09-25 09:00:54 +02:00
  • 7bc5bdf335 Allow parsing of "1910-1925." jrenslin 2020-09-24 17:59:15 +02:00
  • 9f39437c6e Improve splitting and translating of times BC jrenslin 2020-09-24 17:35:40 +02:00
  • 785b1c5156 Allow parsing of single-digits century spans also in the form of 1-3. század jrenslin 2020-09-24 15:49:42 +02:00
  • 1668495573 Add missing abbreviations for Hungarian months, parse -tól jrenslin 2020-09-24 15:45:50 +02:00
  • ddaa31646c Enable splitting of es évek years jrenslin 2020-09-24 13:24:48 +02:00
  • 36d8257ca0 Fix problem in last-syllable-depending time suffixes in Hungarian (as évek vs. es évek) jrenslin 2020-09-24 11:54:01 +02:00
  • d7e2c7f4ed Add automatic translation of decade names jrenslin 2020-09-24 11:47:54 +02:00
  • a4a94a8f8a Allow autotranslation of time spans before 1000 CE jrenslin 2020-09-23 17:03:12 +02:00
  • 0f6a6ebc84 Add automatic splitting and translation of centuries (CE) jrenslin 2020-09-23 10:28:04 +02:00
  • 308e11b4f8 Add automatic translation of times since and until another time jrenslin 2020-09-22 22:46:48 +02:00
  • 4f1e65934a Enable NodaTimeSplitter to split dates with uncertain end or start (seit, bis) jrenslin 2020-09-22 17:58:26 +02:00
  • 707f781f1e Fix attempt to parse 5-digit times as time German dates jrenslin 2020-09-22 11:23:21 +02:00
  • b8dbfb32df Add körül as a known time uncertainty indicator (suffix) jrenslin 2020-09-22 11:03:39 +02:00
  • dd2fbafd25 Improve type-safety / explicitness jrenslin 2020-09-21 10:49:34 +02:00
  • e4558ae227 Add trim to place names for checking uncertainty also at start of checker jrenslin 2020-09-21 02:11:58 +02:00
  • e53eec84e6 Add functions cleaning of uncertainty indicators to NodaUncertaintyHelper jrenslin 2020-09-21 01:57:21 +02:00
  • 923505f146 Add NodaUncertaintyHelper for guessing uncertainty of noda entries jrenslin 2020-09-21 01:24:07 +02:00
  • 7bbd50a586 Add class NodaIDGetter for collecting functions to identify noda entities by available attributes jrenslin 2020-09-20 22:40:40 +02:00
  • 8f7df866d7 Fix bug in splitter causing wrong positives jrenslin 2020-09-20 18:50:26 +02:00
  • ce6e388866 Fix jrenslin 2020-09-20 18:32:48 +02:00
  • d1c9e6e15f Fix missing output value in some cases of time splitting jrenslin 2020-09-20 18:04:41 +02:00
  • f268ab412c Add capability to parse dates like "2300-800 v. Chr." (German) and "Kr. e. 1200" (Hungarian) jrenslin 2020-09-20 17:40:45 +02:00
  • 8da158aa77 Enable splitting of timespans of three-digit year names BC (in German) jrenslin 2020-09-20 17:33:56 +02:00
  • c0047a5956 Enable splitting of 4-digit times BC (in German) jrenslin 2020-09-20 17:32:20 +02:00
  • 974fe39cde Allow translations of times before 1000 CE jrenslin 2020-09-20 17:13:42 +02:00
  • fd0bd48995 Strip away variations of n. Chr. from time strings for splitting jrenslin 2020-09-20 15:52:02 +02:00
  • affe8e3741 Teach time splitter to handle multi-year time spans in CE jrenslin 2020-09-20 15:42:59 +02:00
  • c298794a32 Improve type-safety jrenslin 2020-09-18 21:38:49 +02:00
  • 4aa2a5df2f Add automatic translation of month names for main time names in splitter jrenslin 2020-09-18 21:24:21 +02:00
  • 130140e910 Remove "közott" for timespans jrenslin 2020-09-18 19:17:10 +02:00
  • f05938c867 Initial jrenslin 2020-09-18 18:48:40 +02:00