IBM SystemT is a declarative information extraction system. It was first built in 2005, as a research project at IBM's IBM Almaden Research Center. Its name is partially inspired by System R, a seminal project from the same research center.
Developer(s) | IBM |
---|---|
Written in | AQL , Java |
Operating system | Linux, macOS, Windows |
Type | Information Extraction, Text mining |
Website | SystemT |
SystemT[1] comprises the following three main components: (1) AQL, a declarative rule language with a similar syntax to SQL; (2) Optimizer, which accepts AQL statements as input and generates high-performance algebraic execution plans; and (3) Executing engine, which executes the plan generated by the Optimizer and performs information extraction over input documents.
SystemT is available as part of IBM BigInsights,[2] and has also been taught in multiple universities around the globe. A version of SystemT was available (starting in September 2016) as a companion to a sequence of online courses in Text Analytics.[3][4]
References
edit- ^ Chiticariu, Laura; Krishnamurthy, Rajasekar; Li, Yunyao; Raghavan, Sriram; Reiss, Frederick R.; Vaithyanathan, Shivakumar (2010-01-01). "SystemT: An Algebraic Approach to Declarative Information Extraction". Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. ACL '10. Uppsala, Sweden: Association for Computational Linguistics: 128–137.
- ^ IBM BigInsights
- ^ Text Analytics: Getting Results with SystemT
- ^ Chiticariu, Laura; Danilevsky, Marina; Li, Yunyao; Reiss, Frederick; Zhu, Huaiyu. "SystemT: Declarative Text Understanding for Enterprise". Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 3 (Industry Papers). Association for Computational Linguistics: 76–83.
External links
edit