The timenorm library provides models for finding natural language expressions of dates and times and converting them to a normalized form.
TimeNorm as modified by Eli Goldner for the ChemoTimelines project, with a simple Java API.
The primary entry point for the library is the TemporalNeuralParser class,
which implements a character-based recurrent neural network for finding and
normalizing time expressions, as described in:
Egoitz Laparra, Dongfang Xu, and Steven Bethard. 2018. From Characters to Time Intervals: New Paradigms for Evaluation and Neural Parsing of Time Normalizations. In: Transactions of the Association for Computational Linguistics 2018, Vol. 6, pp. 343–356
Dongfang Xu, Egoitz Laparra and Steven Bethard. 2019. Pre-trained Contextualized Character Embeddings Lead to Major Improvements in Time Normalization: a Detailed Analysis. In: Proceedings of The Eighth Joint Conference on Lexical and Computational Semantics.
To use the parser, create an instance of TemporalNeuralParser, and provide
as an anchor the time at which your text was written.
scala> import org.clulab.timenorm.scate._
import org.clulab.timenorm.scate._
scala> val parser = new TemporalNeuralParser
parser: org.clulab.timenorm.scate.TemporalNeuralParser = org.clulab.timenorm.scate.TemporalNeuralParser@44c2e8a8
scala> val anchor = SimpleInterval.of(2019, 5, 30)
anchor: org.clulab.timenorm.scate.SimpleInterval = SimpleInterval(2019-05-30T00:00,2019-05-31T00:00)When you pass text to the parser, it will return the predicted time expressions. Each time expression contains the span of characters that evoked the time expression. Interval-type time expressions also contain their start and endpoints on the timeline.
scala> val text = "I have not seen her since last year. We hope to meet in the next two weeks."
text: String = I have not seen her since last year. We hope to meet in the next two weeks.
scala> for (timex <- parser.parse(text, anchor)) timex match {
| case interval: Interval =>
| val Some((charStart, charEnd)) = interval.charSpan
| println(s"${interval.start} ${interval.end} ${text.substring(charStart, charEnd)}")
| }
2018-01-01T00:00 2019-05-31T00:00 since last year
2019-05-31T00:00 2019-06-14T00:00 next two weeksThe parser runs faster if you pass batches of texts via the parseBatch
(instead of the parse) method. So if you know, say, the sentence segmentation
for your text, you may prefer to use that method.
The TimeExpression objects returned by the neural parser are based on the set
of semantically compositional operators described in:
Steven Bethard and Jonathan Parker. 2016. A Semantically Compositional Annotation Scheme for Time Normalization. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). pp. 3779-3786.
If you would like to manually construct complex time expressions, the operators
are available in the org.clulab.timenorm.scate package, and can be combined
with fields and units from the java.time library:
scala> import org.clulab.timenorm.scate._, java.time.temporal.ChronoField._, java.time.temporal.ChronoUnit._
import org.clulab.timenorm.scate._
import java.time.temporal.ChronoField._
import java.time.temporal.ChronoUnit._
scala> // the 3-year period following the year 1985
scala> NextP(Year(1985), SimplePeriod(YEARS, 3))
res0: org.clulab.timenorm.scate.NextP = NextP(Year(1985,0,None),SimplePeriod(Years,IntNumber(3,None),None,None),None)
scala> (res0.start, res0.end)
res1: (java.time.LocalDateTime, java.time.LocalDateTime) = (1986-01-01T00:00,1989-01-01T00:00)
scala> // the Friday the 13th following the 15th day of the 3rd month of 1985
scala> NextRI(
| ThisRI(
| ThisRI(
| Year(1985),
| RepeatingField(MONTH_OF_YEAR, 3)),
| RepeatingField(DAY_OF_MONTH, 15)),
| IntersectionRI(Set(
| RepeatingField(DAY_OF_WEEK, 5),
| RepeatingField(DAY_OF_MONTH, 13))))
res2: org.clulab.timenorm.scate.NextRI = NextRI(ThisRI(ThisRI(Year(1985,0,None),RepeatingField(MonthOfYear,3,None,None),None),RepeatingField(DayOfMonth,15,None,None),None),IntersectionRI(Set(RepeatingField(DayOfWeek,5,None,None), RepeatingField(DayOfMonth,13,None,None)),None),<function1>,None)
scala> (res2.start, res2.end)
res3: (java.time.LocalDateTime, java.time.LocalDateTime) = (1985-09-13T00:00,1985-09-14T00:00)This portion of the library is no longer recommended, but it is still included in the distribution.
The scfg portion of the library can take a time expression and normalize it
to TimeML format using a synchronous context free grammar, as described in:
Steven Bethard. 2013. A Synchronous Context Free Grammar for Time Normalization. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 821-826.
Note that the scfg model cannot find time expressions in text; it can only
normalize them after they are found. Sample usage:
scala> import org.clulab.timenorm.scfg._, scala.util.Success
import org.clulab.timenorm.scfg._
import scala.util.Success
scala> val parser = TemporalExpressionParser.en // English, Italian, and also Spanish are available.
parser: org.clulab.timenorm.scfg.TemporalExpressionParser = org.clulab.timenorm.scfg.TemporalExpressionParser@d653e41
scala> val Success(temporal) = parser.parse("two weeks ago", TimeSpan.of(2013, 1, 4))
temporal: org.clulab.timenorm.scfg.Temporal = TimeSpan(2012-12-17T00:00Z,2012-12-24T00:00Z,Period(Map(Weeks -> 1),Exact),Exact)
scala> temporal.timeMLValue
res0: String = 2012-W51The TemporalExpressionParser is available in three languages: English (en), Italian (it), Spanish (es). The last is thanks to contributions of @NGEscribano from the timenorm-es project which includes additional helpful information in its Spanish TimeNorm document.
Add HealthNLP-TimeNorm to your Maven project as a dependency in your pom.xml:
<dependency>
<groupId>org.healthnlp</groupId>
<artifactId>hnlp-timenorm</artifactId>
<version>1.0.0</version>
</dependency>Add a download of HealthNLP-TimeNorm to your maven repositories .m2/, to take place before your project compiles:
<plugin>
<groupId>io.github.download-maven-plugin</groupId>
<artifactId>download-maven-plugin</artifactId>
<version>2.0.0</version>
<executions>
<execution>
<id>get-hnlp-timenorm</id>
<phase>clean</phase>
<goals>
<goal>wget</goal>
</goals>
</execution>
</executions>
<configuration>
<url>https://github.com/HealthNLPorg/hnlp-timenorm/releases/download/1.0.0/hnlp-timenorm.zip</url>
<unpack>true</unpack>
<outputDirectory>${settings.localRepository}/org/healthnlp/hnlp-timenorm</outputDirectory>
</configuration>
</plugin>Java and/or your IDE will not be able to find hnlp-timenorm until you perform mvn clean.
Simple Java interface for normalization of temporal expressions using TimeNorm. A timeout is used for calls to TimeNorm to prevent hanging processes. Viewed in the JetBrains IntelliJ IDE, you may see indications of missing classes for the TimeNorm scala class imports. If this is the case, make sure that you have the JetBrain Scala plugin installed and enabled. Right-click on the project/module in your "Project" window, and select Maven > "Generate Sources and Update Folders".
Simple Java interface for normalization of temporal expressions using TimeNorm. A timeout is used for calls to TimeNorm to prevent hanging processes. This will use a default timeout of 1 second and return simple normalization text.
Simple Java interface for normalization of temporal expressions using TimeNorm. A timeout is used for calls to TimeNorm to prevent hanging processes. This will return simple normalization text.
- Parameters:
timeoutMillis— Millisecond timeout for calls to TimeNorm to prevent hanging processes.
- Exceptions:
IllegalArgumentException— if the given timeout is less than 100 or greater than 10,000.
public TimexNormalizer( final boolean simpleFormat ) throws IllegalArgumentException``public TimexNormalizer( final int timeoutMillis, final boolean simpleFormat ) throws IllegalArgumentException
Simple Java interface for normalization of temporal expressions using TimeNorm. A timeout is used for calls to TimeNorm to prevent hanging processes. This will use a default timeout of 1 second and return simple normalization text.
- Parameters:
simpleFormat— true if the returned normalization should be simple "2012-05-13" rather than structured "TimeSpan(2026-06-15T00:00Z,2026-06-16T00:00Z,Period(Map(Days -> 1),Exact),Exact)"
public TimexNormalizer( final int timeoutMillis, final boolean simpleFormat ) throws IllegalArgumentException``public TimexNormalizer( final int timeoutMillis, final boolean simpleFormat ) throws IllegalArgumentException
Simple Java interface for normalization of temporal expressions using TimeNorm. A timeout is used for calls to TimeNorm to prevent hanging processes.
- Parameters:
timeoutMillis— Millisecond timeout for calls to TimeNorm to prevent hanging processes.simpleFormat— true if the returned normalization should be simple "2012-05-13" rather than structured "TimeSpan(2026-06-15T00:00Z,2026-06-16T00:00Z,Period(Map(Days -> 1),Exact),Exact)"
- Exceptions:
IllegalArgumentException— if the given timeout is less than 100 or greater than 10,000.
Normalize time using today (right this minute) as the anchor date.
- Parameters:
timex— Text containing temporal expression.
- Returns:
- Normalized expression of the given temporal expression.
- Exceptions:
IllegalArgumentException— if the temporal expression is empty or cannot be normalized.
public String normalize( final String timex, final int year, final int month, final int day ) throws IllegalArgumentException
Normalize time using a provided anchor date.
- Parameters:
timex— Text containing temporal expression.year— year of the anchor date.month— month-of-year of the anchor date, from 1 to 12.day— day-of-month of the anchor date, from 1 to 31
- Returns:
- Normalized expression of the given temporal expression.
- Exceptions:
IllegalArgumentException— if the temporal expression is empty or cannot be normalized or the anchor is bad.
public String normalize( final String timex, final int year, final int month, final int day, final int hour, final int minute ) throws IllegalArgumentException
Normalize time using a provided anchor date.
- Parameters:
timex— Text containing temporal expression.year— year of the anchor date.month— month-of-year of the anchor date, from 1 to 12.day— day-of-month of the anchor date, from 1 to 31hour— hour-of-day of the anchor time, from 0 to 23minute— the minute-of-hour of the anchor time, from 0 to 59
- Returns:
- Normalized expression of the given temporal expression.
- Exceptions:
IllegalArgumentException— if the temporal expression is empty or cannot be normalized or the anchor is bad.
public String normalize( final String timex, final LocalDate anchorDate ) throws IllegalArgumentException
Normalize time using a provided anchor date.
- Parameters:
timex— Text containing temporal expression.anchorDate— The anchor time (required for resolving relative times like "today").
- Returns:
- Normalized expression of the given temporal expression.
- Exceptions:
IllegalArgumentException— if the temporal expression is empty or cannot be normalized.
public String normalize( final String timex, final LocalDateTime anchorTime ) throws IllegalArgumentException
Normalize time using a provided anchor date.
- Parameters:
timex— Text containing temporal expression.anchorTime— The anchor time (required for resolving relative times like "in an hour").
- Returns:
- Normalized expression of the given temporal expression.
- Exceptions:
IllegalArgumentException— if the temporal expression is empty or cannot be normalized.
public String normalize( final String timex, final TimeSpan anchorTime ) throws IllegalArgumentException
Normalize time using a provided anchor date.
- Parameters:
timex— Text containing temporal expression.anchorTime— The anchor time (required for resolving relative times like "today").
- Returns:
- Normalized expression of the given temporal expression.
- Exceptions:
IllegalArgumentException— if the temporal expression is empty or cannot be normalized.
- Returns:
- the timeout in milliseconds used by this normalizer.
- Returns:
- true if normalization returns simple text, e.g. 2012-05-13" rather than structured "TimeSpan(2026-06-15T00:00Z,2026-06-16T00:00Z,Period(Map(Days -> 1),Exact),Exact)"
shut down the executor.
Simple Callable that runs TimeNorm normalization so that it can be interrupted.
private TimeNormCallable( final String timex, final TimeSpan anchorTime, final boolean simpleFormat )
- Parameters:
timex— Text containing temporal expression.anchorTime— The anchor time (required for resolving relative times like "today").simpleFormat— true if the returned normalization should be simple "2012-05-13" rather than structured "TimeSpan(2026-06-15T00:00Z,2026-06-16T00:00Z,Period(Map(Days -> 1),Exact),Exact)"
- Returns:
- normalized version of the temporal expression.
- Create a directory named
1.0.0. - Run
mvn package. - Copy the jar file created by
mvn packageto that directory. - Copy the
pom.xmlinto that directory. - Rename the pom in that directory to
hnlp-timenorm-1.0.0.pom. - Zip the directory as a file named
hnlp-timenorm.zip. - Upload the zip file as an Asset to the release area at
https://github.com/HealthNLPorg/hnlp-timenorm/releases/tag/1.0.0