2021-11-02 13:13:27 +00:00
|
|
|
# Resources definition
|
2021-10-08 11:57:29 +00:00
|
|
|
|
2021-10-31 11:53:58 +00:00
|
|
|
The resources section defines the files that are grouped
|
2021-10-10 02:07:46 +00:00
|
|
|
together by association. This association is not defined but can
|
|
|
|
|
include different formats of the same data or a common batch extract
|
|
|
|
|
such as end of day.
|
2021-10-08 11:57:29 +00:00
|
|
|
|
2021-10-10 02:07:46 +00:00
|
|
|
Some files may expand to multiple files if they are
|
|
|
|
|
compressed with a utility such as WinZIP or 7ZIP. In the situation
|
|
|
|
|
where a ZIP file expands to multiple documents, then the expectation is
|
2021-10-14 10:50:55 +00:00
|
|
|
that the ZIP file contains a **martiLQ** document describing its contents.
|
2021-10-10 02:07:46 +00:00
|
|
|
|
2021-10-31 11:53:58 +00:00
|
|
|
The elements in the resource section are:
|
2021-10-08 11:57:29 +00:00
|
|
|
|
|
|
|
|
* Title
|
|
|
|
|
* Document name - Commonly being absolute or relative file name.
|
|
|
|
|
This value could also be an URL address or network path
|
|
|
|
|
* Issued date - When the document was made available. The date can include time
|
|
|
|
|
* Modified - When the document was created or modified. This is the data and time
|
2021-10-10 02:07:46 +00:00
|
|
|
* Size of file - The file size in bytes
|
|
|
|
|
* Hash of file - The hash of the file, which can be blank especially for large files
|
2021-10-08 11:57:29 +00:00
|
|
|
* Hash algorithm
|
2021-10-31 11:53:58 +00:00
|
|
|
* Attributes - List of attributes associated with the document
|
2021-10-08 11:57:29 +00:00
|
|
|
|
2021-10-31 11:53:58 +00:00
|
|
|
The following are optional in the resource section.
|
2021-10-08 11:57:29 +00:00
|
|
|
|
|
|
|
|
* Identifier
|
|
|
|
|
* Description
|
|
|
|
|
* Download URL
|
2021-10-10 02:07:46 +00:00
|
|
|
* Version - File version. The same file could be updated or this might denote the next version
|
2021-10-08 11:57:29 +00:00
|
|
|
of a regular report. For example a daily extract will have the version number incremented
|
2021-10-10 02:07:46 +00:00
|
|
|
every day and provide a new URL. The previous file can be retained.
|
2021-10-31 11:53:58 +00:00
|
|
|
* Content type - if not specified then the consumer will in all likelihood use the file extension / mime type
|
2021-10-10 02:07:46 +00:00
|
|
|
* Expiry Date - The date and time that this file expires and can be removed from the download URL
|
|
|
|
|
location. This is not the file retention period as might be required for archiving.
|
|
|
|
|
* Described By - A link to the metadata describing this file data and format
|
2021-10-08 11:57:29 +00:00
|
|
|
* Compression - Type of compression used if any
|
|
|
|
|
* Encryption - Type of encryption used if any
|
|
|
|
|
|
|
|
|
|
|
2021-11-02 13:13:27 +00:00
|
|
|
## Compression
|
2021-10-08 11:57:29 +00:00
|
|
|
|
2021-10-10 02:07:46 +00:00
|
|
|
Files can be compressed using a utility. A single compressed file can contain
|
2021-10-14 10:50:55 +00:00
|
|
|
multiple files. The **martiLQ** definition document applies to the compressed file
|
2021-10-10 02:07:46 +00:00
|
|
|
and not to the contents, which could be multiple files.
|
2021-10-08 11:57:29 +00:00
|
|
|
|
2021-10-14 10:50:55 +00:00
|
|
|
In the case of a compressed files, there should be a **martiLQ** definition document in the
|
2021-10-10 02:07:46 +00:00
|
|
|
compressed file.
|
2021-10-08 11:57:29 +00:00
|
|
|
|
2021-10-10 02:07:46 +00:00
|
|
|
Compression of files always occur before encryption.
|
2021-10-08 11:57:29 +00:00
|
|
|
|
2021-11-02 13:13:27 +00:00
|
|
|
### martiLQ definition for Compressed File
|
2021-10-08 11:57:29 +00:00
|
|
|
|
2021-10-10 02:07:46 +00:00
|
|
|
For a compressed file that is not encrypted, the distribution definition will be:
|
2021-10-08 11:57:29 +00:00
|
|
|
|
2021-10-10 02:07:46 +00:00
|
|
|
* Title - The compressed file title which could be a group name
|
2021-10-08 11:57:29 +00:00
|
|
|
* Document name - Commonly being absolute or relative file name.
|
|
|
|
|
This value could also be an URL address or network path
|
2021-10-10 02:07:46 +00:00
|
|
|
* Issued date - When the compressed file was made available.
|
|
|
|
|
* Modified - When the compressed file was created or modified. This is the date and time
|
|
|
|
|
and is not the modified date of the file in the compressed file.
|
|
|
|
|
* Size of file - The compressed file size in bytes
|
|
|
|
|
* Hash of file - The hash of the compressed file, which can be
|
|
|
|
|
blank especially for large files
|
2021-10-08 11:57:29 +00:00
|
|
|
* Hash algorithm
|
|
|
|
|
|
|
|
|
|
The reason for this approach is it allows a generic tool to be deployed to
|
|
|
|
|
check the validity of the contents without unpacking the received /fetched
|
2021-10-10 02:07:46 +00:00
|
|
|
file. That is you can perform load quality pipeline processing.
|
2021-10-08 11:57:29 +00:00
|
|
|
|
2021-11-02 13:13:27 +00:00
|
|
|
## Encryption
|
2021-10-08 11:57:29 +00:00
|
|
|
|
|
|
|
|
The encryption of content is always applied after compression not before, if
|
|
|
|
|
you are not using the compression tool native encryption. WinZIP and 7ZIP
|
|
|
|
|
provide encryption within the tool execution.
|
|
|
|
|
|
|
|
|
|
If the compression is TAR or GZIP then you may consider applying a GPG
|
|
|
|
|
or other encryption algorithm to the compressed file.
|
|
|
|
|
|
2021-10-10 02:07:46 +00:00
|
|
|
* Title - The encrypted file title
|
2021-10-08 11:57:29 +00:00
|
|
|
* Document name - Commonly being absolute or relative file name.
|
|
|
|
|
This value could also be an URL address or network path
|
2021-10-10 02:07:46 +00:00
|
|
|
* Issued date - When the **encrypted** file was made available.
|
|
|
|
|
* Modified - When the **encrypted** file was created or modified.
|
|
|
|
|
This is the data and time and is not the modified date of the encrypted file.
|
|
|
|
|
* Size of file - The **decrypted** file size in bytes
|
|
|
|
|
* Hash of file - The hash of the **decrypted** file, which can be
|
|
|
|
|
blank especially for large files
|
2021-10-08 11:57:29 +00:00
|
|
|
* Hash algorithm
|
|
|
|
|
|
2021-10-10 02:07:46 +00:00
|
|
|
The rational for using the decrypted file attributes is that an ecrypted
|
|
|
|
|
file is unlikely to be able to be modified without knowing encryption keys.
|
|
|
|
|
Checking the decrypted fille attributes is a better check.
|
2021-10-08 11:57:29 +00:00
|
|
|
|
|
|
|
|
The reason for this approach is it allows a generic tool to be deployed to
|
2021-10-10 02:07:46 +00:00
|
|
|
decrypt and check the validity of the received / fetched file without
|
2021-10-08 11:57:29 +00:00
|
|
|
needing to understand the contents. That is you can perform load quality
|
|
|
|
|
pipeline processing.
|