Data Sources

Data sources are internal Sikoia data structures that represent grouped units of data. These units can be utilized for various features, including Income & Employer Verification, Affordability Insights, and Completeness Checks.

Examples of Data Sources:

  • An individual payslip is an example of a data source.
  • An individual bank statement is an example of a data source.
  • An open banking account connection, which you can learn more about here, is also considered a data source.

Data Source status

The possible statuses for data sources are:

  • Provided: The document has been successfully uploaded into the system but is not supported for further processing.
  • Pending: The data source is being processed.  
    • If the data source is a document, it is in the process of being classified or being extracted.
    • If the data source is an open banking connection, the user has not yet completed the authorisation process.
  • Complete: The data source has been successfully processed.
  • Failed: The data source could not be processed for some reason. If a failed status is returned, a message will also be returned which includes more information on what the data source could not be processed.

Why might a single File yield multiple Data Sources?

A single uploaded file/document (identified by a unique document_id) may contain multiple distinct physical documents. Our platform detects these using document classification, and each detected document is represented as a separate data source in our system.

For example, if one uploaded file with a document_id contains two payslip documents, after processing and analysis, the system will create two separate data sources.

Why might a single Open Banking connection yield multiple Data Sources?

A single open banking connection (identified by a unique bank_connection_id) can be used to connect one or more bank accounts.

For example, if a person has a checking account, a savings account, and a joint account with the same financial institution and all of these accounts are connected through a single open banking connection, then they will share the same bank_connection_id. After processing, each account will be a distinct data source.

Learn more about bank and account connections here.

📘

You can retrieve all data sources for an entity with the endpoints here.