Commit Graph

298 Commits

Author SHA1 Message Date
1dd05a3822
Add blacklist for unwanted tag names
Close #4
2021-04-25 00:16:53 +02:00
9157e8a0f1
Add fix for empty noda references in fetching tags from Wikidata 2021-04-24 01:13:27 +02:00
e1a9a99797
Use ++$i over $i++ outside of loops in Wikidata fetcher
This is a slightly more performant way of incrementing an integer.
2021-04-12 12:54:07 +02:00
792754c20c
Fetch orcid IDs in wikidata fetcher 2021-04-07 11:33:49 +02:00
e957db4210
Add condition to split times like "xxxx bis yyyy" 2021-03-26 12:32:27 +01:00
c964053c91
Add function for reading Wikidata ID from a Wikipedia page 2021-03-18 01:23:45 +01:00
1fd87c7e6d
Simplify NodaWikidataFetcher, unify list of langs, simplify linking to noda sources
Close #2
2021-03-17 22:06:08 +01:00
f0b5a08cdf
Move NodaWikidataFetcher to this repository 2021-03-17 16:11:06 +01:00
d7e3a88320
Fix file path of MD_STD 2021-03-09 20:15:22 +01:00
1fe795d219
Use mysqli->autocommit(false) to speed up autotranslating 2021-03-08 21:23:38 +01:00
7ab7b8341f
Add license 2021-02-06 13:52:19 +01:00
b8ae4f8b3e
Add README 2021-02-06 13:40:54 +01:00
668477f199
Add missing check in NodaTimeAutotranslater 2021-01-31 19:39:09 +01:00
7ccdfd4659
Fix function comment in NodaTimeSplitter 2021-01-31 01:50:25 +01:00
aca4f86da5
Add "Neu" und "Neu hergestellt" to list of disallowed time entries 2021-01-29 20:03:06 +01:00
a761a9dfd7
Stop time splitter for start / end, if common time splitter can be used 2021-01-07 11:43:20 +01:00
c02165df7b
Add exception catching in splitting times / dates 2021-01-06 23:11:05 +01:00
54764e741a
Add option to split and translate times with start and end dates
Close #1
2021-01-06 23:05:26 +01:00
fcc63c4ea0
Merge branch 'master' of gitea:museum-digital/MDNodaHelpers 2021-01-06 16:07:46 +01:00
a6030e4a5f
Fix bug in month names similar in English and German 2021-01-06 16:07:21 +01:00
7ef09db72c
Add static function in NodaIDGetter to get tag ID by import log 2021-01-04 23:06:36 +01:00
9f67d253da
Add functions to get actor and place IDs by import logs 2021-01-04 22:50:26 +01:00
8f612dede1
Read 1917-ig. as similar to 1917-ig in time splitter 2020-12-28 14:40:04 +01:00
6e910cd676
Add English month names for splitting time terms 2020-12-22 12:22:14 +01:00
8ac22165fc
Add "ohne angabe" to list of disallowed terms 2020-12-21 15:32:16 +01:00
d8e44550fc
Add "Ohne Datum" to list of disallowed time terms 2020-12-21 15:15:00 +01:00
af454ec013
Setup ID getter by rewrite for tags to return arrays
Tag rewrites can now be set for multiple target tags.
2020-12-20 23:24:08 +01:00
fce933c12a
Extend list of disallowed noda terms 2020-12-20 15:40:30 +01:00
b27f0ec918
Add "Keine Angaben" to list of disallowed inputs for places 2020-12-19 02:37:38 +01:00
a070970554
Remove empty newlines in class defs 2020-12-19 02:36:38 +01:00
ca13f36c0d
Add function to get tag IDs by their translated names 2020-12-07 13:43:07 +01:00
50ff1a2339
Add script to get highest related tag 2020-10-30 16:30:23 +01:00
0ea9c31845
Explicitly use global namespace in function calls 2020-10-23 17:03:51 +02:00
14e82826ae Fix bug in getting place IDs by noda links 2020-10-05 12:05:48 +02:00
99aa1d74ad Improve / make more explicit: type safety 2020-10-04 23:59:40 +02:00
97566ea2d9 Split more time variations 2020-10-04 23:57:59 +02:00
8a4a8f7ed8 Split more variations of dots in dates, century ranges 2020-10-04 23:20:58 +02:00
d0fe1e89ed Improve trimming inputs when cleaning certainty indicators 2020-10-04 22:52:15 +02:00
1f4d692fb5 Enable automatic translations of times "before" a given date 2020-10-04 19:34:17 +02:00
1685d78f65 Allow splitting times "before <X>" 2020-10-04 19:27:23 +02:00
a0037c9883 Allow splitting times after <year><month> 2020-10-04 19:17:18 +02:00
be46c39efd Fix wrong assumption on handling counting times when autotranslating
"after <month>"
2020-10-04 18:36:03 +02:00
c9a1a74bce Enable autotranslating of times 'after' a certain date 2020-10-04 18:21:33 +02:00
5e90e5d3f2 Add strings for expressing times 'after' and 'before' 2020-10-04 17:40:51 +02:00
2a57537436 Allow splitting times "Nach 1905" ("Nach " followed by 4 digit time
number)
2020-10-04 17:39:34 +02:00
36d27e0f73 Remove / disallow certain input names in NodaUncertaintyHelper 2020-10-04 02:40:21 +02:00
4e934e380c Use [0-9]{4} spelling time 2020-10-03 19:13:27 +02:00
ff35ca7bd9 Enable time splitter to deal with some roman numbers 2020-10-03 16:10:43 +02:00
80cd88222d Enable time splitter to recognize sz as abbr. for század 2020-10-03 15:59:49 +02:00
3664bcf3f6 Add getting places by noda links to NodaIDGetter 2020-10-01 12:53:27 +02:00