Classification
Identifying the document type is one of the first steps to begin document processing. Users can specify the document type by populating the document_type field with one of the document types listed below when uploading or updating a document. Sikoia also automatically classifies the uploaded file/document and populates the sikoia_document_type field too.
Document type comparison checkMany customers compare the
document_typeand thesikoia_document_typeto request an automatic resubmission when the two disagree.
Sikoia leverages advanced ML/AI deep-learning models that combine layout and language features to accurately detect and identify documents.
Document file splitting
During classification, Sikoia also detects whether a single file contains more than one document. When it does, the file is split so that each document can be handled on its own: every individual document identified within the file is assigned its own data_source_id, distinct from the file-level document_id/source_id. This split is what allows each document to be classified, validated, and extracted independently, and it's the same data_source_id referenced throughout the document lifecycle.
Multiple Documents in a Single FileCurrently our platform supports input files containing multiple documents of the same document type within a single file.
Each document is detected and processed individually, then mapped into a separate internal structure called a 'data source' to ensure accurate and organised data handling. Data sources are utilized for features such as Income & Employer Verification, Affordability Insights, and Completeness Checks. Learn more about data sources here
Learn more about data sourceshere.
Document Types
The currently supported document types are:
| Document type | Description | Standard extraction | Deep extraction |
|---|---|---|---|
| BankStatement | Detailed report of transactions and balances in a bank account. | ✅ | ✅ |
| BenefitAwardLetter | Letter from government agency confirming eligibility for social welfare benefits. | ✅ | |
| ChildBenefitLetter | Letter issued from the Child Benefit Office. | ✅ | |
| CompanyRegistration | Official record confirming a company's legal existence. | ✅ | |
| Contract | Any form of contractual agreement. | ✅ | |
| CreditReport | Document summarising the credit history of an individual or business. | ✅ | |
| EmploymentContract | Agreement setting out employment conditions, rights and responsibilities. | ✅ | |
| EmploymentLetter | Letter from an employer stating employment details. | ✅ | |
| FactFind | Document summarising essential information about an individual or business. | ✅ | |
| FinancialAccounts | Document showing profit and loss, balance sheet, and cash flow statements for an entity. | ✅ | |
| Identity | Government issued card or record to verify an individual's identity. | ✅ | |
| P45 | HMRC form filled by the employer when an employee stops working. | ✅ | |
| P60 | End-of-year certificate from HMRC. | ✅ | |
| Payslip | Document issued by an employer showing salary details. | ✅ | ✅ |
| PensionStatement | Pension plan statement detailing the amount in a pension pot | ✅ | |
| SA302 | Detailed income and tax calculation summary, either generated via HMRC or submitted by the individual. | ✅ | ✅ |
| SavingsStatement | Bank statement on savings, ISA, and other accounts. | ✅ | ✅ |
| StakeholdersDisclosure | Official record about entities holding an interest in a business or project. | ✅ | |
| StudentLoanLetter | Letter from a lender specifying details of a student loan. | ✅ | |
| Tax | Any other document issued by tax authorities. | ✅ | |
| TaxOverview | Less detailed tax calculation summary from HMRC. | ✅ | ✅ |
| UtilityBill | Statement detailing charges for essential services (e.g. electricity, water, gas). | ✅ | |
| Other | Any other document that doesn’t fit the types above. | ✅ |
