GroupDocs.Classification is a simple document and text classification API for C#, ASP.NET, VB.NET, J# or any other .NET based applications. Developers can work with two different types of taxonomies to perform advanced classifications, either by using IAB-2 for assigning standardized text categories to text or document taxonomy as developed by Aspose. The library analyses text, sentences, even words and supports classifying a variety of industry standard document formats including PDF, Microsoft Word, OpenDocument, RTF and text.
GroupDocs.Classification for .NET uses its own document processing engine and does not require any external tools be installed on the system. It targets .NET platform to develop applications and supports all popular operating systems (Windows, Linux, MacOS) where .NET frameworks (including .NET Core) can be installed.
A summerised overview of features offered by GroupDocs.Classification for .NET.
- Documents classification by path and stream
- Classify raw text
- IAB-2 and documents taxonomies supported
- Supports multiple document formats
GroupDocs.Classification for .NET supports a number of popular document formats.
- Word: DOC, DOCX, DOCM, DOT, DOTX, DOTM, RTF
- Fixed Layout: PDF
- OpenDocument: ODT, OTT
- Text: TXT
GroupDocs.Classification for .NET supports following Operating Systems, Frameworks & Package Managers: Operating Systems
Windows 10 (x64)
Windows Desktop (x64)
Windows Server (x64)
Mac OS X x64 (10.12+)
.NET Core 2.0 or later
.NET Framework 2.0 or higher
Advanced Text & Documents Classification API Features
Classify documents by path using IAB-2 or documents taxonomies
Perform Raw Text Classification as per documents or IAB-2 taxonomies
Choose the number of classified results to return
Work with PDF, Docs, OpenOffice and Rich Text documents
100% Working Examples & Demos are Given to Quickly Learn the Supported Features
Unlimited Free Technical Support Provided through Product Forums
Precise Document Classification
GroupDocs.Classification for .NET supports classifying a variety of document formats with the next format. The below C# code example shows how to classify a PDF file with IAB-2 taxonomy by returning 3 best results.
Document Classification by Path using IAB-2 Taxonomy - C#
var response = classifier.Classify("document.pdf", ".", 3, Taxonomy.Iab2); Console.WriteLine(response.BestClassName, response.BestClassProbability);