IBM SystemT is a declarative information extraction system. It was first built in 2005, as a research project at IBM's IBM Almaden Research Center. Its name is partially inspired by System R, a seminal project from the same research center.

IBM SystemT
Developer(s)IBM
Written inAQL [de], Java
Operating systemLinux, macOS, Windows
TypeInformation Extraction, Text mining
WebsiteSystemT

SystemT[1] comprises the following three main components: (1) AQL, a declarative rule language with a similar syntax to SQL; (2) Optimizer, which accepts AQL statements as input and generates high-performance algebraic execution plans; and (3) Executing engine, which executes the plan generated by the Optimizer and performs information extraction over input documents.

SystemT is available as part of IBM BigInsights,[2] and has also been taught in multiple universities around the globe. A version of SystemT was available (starting in September 2016) as a companion to a sequence of online courses in Text Analytics.[3][4]

References

edit
  1. ^ Chiticariu, Laura; Krishnamurthy, Rajasekar; Li, Yunyao; Raghavan, Sriram; Reiss, Frederick R.; Vaithyanathan, Shivakumar (2010-01-01). "SystemT: An Algebraic Approach to Declarative Information Extraction". Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. ACL '10. Uppsala, Sweden: Association for Computational Linguistics: 128–137.
  2. ^ IBM BigInsights
  3. ^ Text Analytics: Getting Results with SystemT
  4. ^ Chiticariu, Laura; Danilevsky, Marina; Li, Yunyao; Reiss, Frederick; Zhu, Huaiyu. "SystemT: Declarative Text Understanding for Enterprise". Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 3 (Industry Papers). Association for Computational Linguistics: 76–83.
edit