Epstein Files Full PDF

CLICK HERE
Technopedia Center
PMB University Brochure
Faculty of Engineering and Computer Science
S1 Informatics S1 Information Systems S1 Information Technology S1 Computer Engineering S1 Electrical Engineering S1 Civil Engineering

faculty of Economics and Business
S1 Management S1 Accountancy

Faculty of Letters and Educational Sciences
S1 English literature S1 English language education S1 Mathematics education S1 Sports Education
teknopedia

  • Registerasi
  • Brosur UTI
  • Kip Scholarship Information
  • Performance
Flag Counter
  1. World Encyclopedia
  2. MPEG-G - Wikipedia
MPEG-G - Wikipedia
From Wikipedia, the free encyclopedia
ISO/IEC standard for genomic information representation
This article's use of external links may not follow Wikipedia's policies or guidelines. Please improve this article by removing excessive or inappropriate external links, and converting useful links where appropriate into footnote references. (December 2021) (Learn how and when to remove this message)
This article may have too many section headings. Please help consolidate the article. (December 2021) (Learn how and when to remove this message)

MPEG-G (ISO / IEC 23092) is an ISO/IEC standard designed for genomic information representation by the collaboration of the ISO/IEC JTC 1/SC 29/WG 9 (MPEG) and ISO TC 276 "Biotechnology" Work Group 5. The goal of the standard is to provide interoperable solutions for data storage, access, and protection across different possible implementations for data information generated by high-throughput sequencing machines and their subsequent processing and analysis.[1][2] The standard is composed of different parts, each one addressing a specific aspect, such as compression, metadata association, Application Programming Interfaces (APIs), and a reference software for data decoding. Together with the reference decoder software, commercial and open source[3] implementations started to be available in 2019, covering progressively more of the published parts of the standard.

Background

[edit]

The advent of high-throughput sequencing (HTS) technologies has revolutionized the field of quantitative biology. Availability of large collections of genomic information has now entered everyday practice and has become a cornerstone of a number of disciplines, ranging from biological research to personalized medicine in the clinic. At the moment, genomic information is mostly exchanged through a variety of data formats, such as FASTA/FASTQ for unaligned sequencing reads and SAM/BAM/CRAM for aligned reads. The ISO/IEC 23092 (MPEG-G) standard aims to provide a unified format for the efficient representation and compression of such diverse data, both for file storage and data transport. In order to do that, the standard is divided in several parts.

Structure of the standard

[edit]

The MPEG-G standard utilizes technology and data representation architectures previously validated in the field of digital media. They allow to compress and transport genome sequencing data even in complex scenarios, for instance when access is needed to large amounts of possibly distributed data, or when part of the data needs to be encrypted for privacy reasons. Conceptually, such requirements lead to the definition of a number of mutually interrelated mechanisms, which are summarized in the following list:

  • Data format and compression [4]
  • Data streaming[4]
  • Compressed file concatenation[4]
  • Incremental update of sequencing data and metadata[4]
  • Selective access to compressed data, e.g. fast queries by genomic range[5]
  • Metadata association[6]
  • Enforcement of privacy rules[6]
  • Selective encryption of data and metadata[6]
  • Annotation and linkage of genomic segments.[7]

In turn, some of these topic have been collected together, in order to make the standard easier to understand and implement. As a result, the ISO/IEC 23092 standard is physically structured as a series of separate document, as follows:

MPEG-G Parts
Part Number First public release date (First edition) Latest public release date (edition) Latest amend- ment Title Description
Part 1 ISO/IEC 23092-1 2019 2019 Transport and Storage of Genomic Information Specification of file format, streaming and indexing[4]
Part 2 ISO/IEC 23092-2 2019 2019 Coding of Genomic Information Compression of unmapped (raw) and aligned genome sequencing data[5]
Part 3 ISO/IEC 23092-3 2020 2020 Metadata and Application Programming Interfaces (APIs) Specification of standard interfaces, syntax for metadata and description of content protection mechanisms[6]
Part 4 ISO/IEC 23092-4 (2020) Reference Software It describes the open source implementation of a normative decoder and informative encoder. It also provides compressed bitstreams that can be used for reference purposes. Note that other open source implementations developed by independent groups do exist[8][9]
Part 5 ISO/IEC 23092-5 (2020) Conformance testing It details the testing procedure and associated compressed reference bitstreams to be used when one wants to assess the conformance of a decoder implementation with the MPEG-G standard[10]
Part 6 ISO/IEC 23092-6 (2021) Coding of genomic annotations Compressed representation of genomic annotations — that is, a number of heterogeneous data types associated with intervals of the reference genome that the sequencing data has been aligned to.[7]

ISO/IEC 23092-1 MPEG-G Part 1

[edit]

ISO/IEC 23092-1 specifies how the genomic data is organized within MPEG-G structures for transport (i.e., streaming) and storage. Formats of genomic record, reference record, MPEG-G file and transport stream are defined in this part. It introduces Access Unit as the container of the compressed genomic data and provides a reference conversion process among different formats.

ISO/IEC 23092-2 MPEG-G Part 2

[edit]

ISO/IEC 23092-2 specifies the syntax and methods for MPEG-G lossless compression of sequencing data and lossy compression of associated quality scores. MPEG-G, as is typical for MPEG standards, only specifies the decoding process while the encoding process is left open to algorithmic and implementation-specific innovations. All MPEG-G conformed decoders produce identical outputs from the multiplexed bitstreams included in MPEG-G files and the data streams in streaming scenarios.

The input data of the encoder are genomic records or metadata, with optional reference data, while its output is MPEG-G file or transport streams.

ISO/IEC 23092-3 MPEG-G Part 3

[edit]

ISO/IEC 23092-3 specifies a metadata format and provides genomic data representation APIs to support interoperability among existing tools and systems. Part 3 specifies how an MPEG-G compliant bitstream can be integrated with metadata as well as mechanisms to implement access control, integrity verification, authentication and authorization mechanisms. This part also contains an informative section devoted to the mapping between SAM and MPEG-G data structures, including backward compatibility with existing SAM content. It defines:

Groups of API Functions
Functions Group Brief Description
Genomic Information Functions used to query the structure of, and retrieve, the genomic information coded in a bitstream compliant with ISO/IEC 23092 series.
Metadata Functions used to query the structure of, and retrieve, the metadata associated with the coded genomic data.
Protection Functions used to retrieve the protection metadata associated with the coded genomic data.
Reference Functions used to retrieve the reference associated with a dataset.
Statistics Functions used to retrieve statistics associated with a dataset.

ISO/IEC 23092-4 MPEG-G Part 4

[edit]

ISO/IEC 23092-4[9] specifies genomic information representation reference software, referred to as the genomic model (GM). It consists of two components: the reference encoder software and the reference decoder software. While the reference decoder software is provided to assess the conformance to the requirements of ISO/IEC 23092-1,[4] ISO/IEC 23092-2[5] and ISO/IEC 23092-6,[7] the reference encoder software serves as a guide for the implementation of the aforementioned standards. The reference encoder software called Genie[3] is an open source software developed by a group of individuals from multiple universities and companies around the world. It features the following components:

Reference Software Components
Part Number Component Description
Part 1[4] ISO/IEC 23092-1 Encapsulation
Indexing
Part 2[5] ISO/IEC 23092-2 Classification
Reference engine
Quality value quantization
Descriptor subsequence generation
Transformations
Entropy encoding
Part 6 ISO/IEC 23092-6 (To be determined)

ISO/IEC 23092-5 MPEG-G Part 5

[edit]

ISO/IEC 23092-5 specifies conformance of the coding of genomic information. Part 5 provides a means to test and validate the correct implementation of the MPEG-G technology in different devices and applications to ensure the interoperability among all systems. It specifies a normative procedure to assess conformity to the standard on an exhaustive set of compressed data.

MIME Type and Filename extensions

[edit]

No MIME type (RFC 6838 based IANA media type) currently defined for MPEG-G file.

No conventional file extensions are defined.

See also

[edit]
  • MPEG
  • ISO/IEC JTC 1/SC 29

References

[edit]
  1. ^ Alberti, Claudio; Paridaens, Tom; Voges, Jan; Naro, Daniel; Ahmad, Junaid; Ravasi, Massimo; Renzi, Daniele; Zoia, Giorgio; Ribeca, Paolo; Ochoa, Idoia; Mattavelli, Marco; Delgado, Jaime; Hernaez, Mikel (October 2018). "An introduction to MPEG-G, the new ISO standard for genomic information representation". bioRxiv 10.1101/426353.
  2. ^ Hernaez, Mikel; Pavlichin, Dmitri; Weissman, Tsachy; Ochoa, Idoia (2019-07-20). "Genomic Data Compression". Annual Review of Biomedical Data Science. 2 (1): 19–37. doi:10.1146/annurev-biodatasci-072018-021229. ISSN 2574-3414. S2CID 88495878.
  3. ^ a b "Genie, Open Source MPEG-G Codec". GitHub. 22 June 2021.
  4. ^ a b c d e f g "ISO/IEC 23092-1 Transport and Storage of Genomic Information".
  5. ^ a b c d "ISO/IEC 23092-2 Coding of Genomic Information".
  6. ^ a b c d "ISO/IEC 23092-3 Metadata and APIs".
  7. ^ a b c "ISO/IEC 23092-6 Coding of Genomic Annotations".
  8. ^ Bliss, Brian; Allen, Joshua; Baheti, Saurabh; Bockol, Matthew; Delgado, Jaime; Fostier, Jan; Gelpi, Josep; Hart, Steven; Hernaez, Mikel; Hudson, Matthew; Kalmbach, Michael; Klee, Eric; Mainzer, Liudmila; Müntefering, Fabian; Naro, Daniel; Ochoa, Idoia; Ostermann, Joern; Paridaens, Tom; Ross, Christian; Voges, Jan; Wieben, Eric; Yang, Mingyu; Weissman, Tsachy; Wiepert, Mathieu (November 2019). "Genie: an MPEG-G conformant software to compress genomic data" (PDF).
  9. ^ a b "ISO/IEC 23092-4 Reference Software".
  10. ^ "ISO/IEC 23092-5 Conformance".

External links

[edit]
  • mpeg-g.org
  • MPEG web site
  • ISO/IEC 23092-1
  • ISO/IEC 23092-2
  • ISO/IEC 23092-3
  • ISO/IEC 23092-4
  • ISO/IEC 23092-5
  • ISO/IEC 23092-6
  • v
  • t
  • e
MPEG (Moving Picture Experts Group)
  • MPEG-1
  • 2
  • 3
  • 4
  • 7
  • 21
  • A
  • B
  • C
  • D
  • E
  • G
  • V
  • M
  • U
  • H
  • I
  • 5
MPEG-1 Parts
  • Part 1: Systems
    • Program stream
  • Part 2: Video
    • based on H.261
  • Part 3: Audio
    • Layer I
    • Layer II
    • Layer III
MPEG-2 Parts
  • Part 1: Systems (H.222.0)
    • Transport stream
    • Program stream
  • Part 2: Video (H.262)
  • Part 3: Audio
    • Layer I
    • Layer II
    • Layer III
    • MPEG Multichannel
  • Part 6: DSM CC
  • Part 7: Advanced Audio Coding
MPEG-4 Parts
  • Part 2: Video
    • based on H.263
  • Part 3: Audio
  • Part 6: DMIF
  • Part 10: Advanced Video Coding (H.264)
  • Part 11: Scene description
  • Part 12: ISO base media file format
  • Part 14: MP4 file format
  • Part 17: Streaming text format
  • Part 20: LASeR
  • Part 22: Open Font Format
  • Part 33: Internet Video Coding
MPEG-7 Parts
  • Part 2: Description definition language
MPEG-21 Parts
  • Parts 2, 3 and 9: Digital Item
  • Part 5: Rights Expression Language
MPEG-D Parts
  • Part 1: MPEG Surround
  • Part 3: Unified Speech and Audio Coding
MPEG-G Parts
  • Part 1: Transport and Storage of Genomic Information
  • Part 2: Coding of Genomic Information
  • Part 3: APIs
  • Part 4: Reference Software
  • Part 5: Conformance
MPEG-H Parts
  • Part 1: MPEG media transport
  • Part 2: High Efficiency Video Coding (H.265)
  • Part 3: MPEG-H 3D Audio
  • Part 12: High Efficiency Image File Format
MPEG-I Parts
  • Part 3: Versatile Video Coding (H.266)
MPEG-5 Parts
  • Part 1: Essential Video Coding
  • Part 2: Low Complexity Enhancement Video Coding
Other
MPEG-DASH
  • v
  • t
  • e
IEC standards
IEC
  • 60027
  • 60034
  • 60038
  • 60062
  • 60063
  • 60068
  • 60112
  • 60228
  • 60269
  • 60297
  • 60309
  • 60320
  • 60364
  • 60446
  • 60559
  • 60601
  • 60870
    • 60870-5
    • 60870-6
  • 60906-1
  • 60908
  • 60929
  • 60958
  • 60980-344
  • 61030
  • 61131
    • 61131-3
    • 61131-9
  • 61158
  • 61162
  • 61334
  • 61355
  • 61360
  • 61400
  • 61499
  • 61508
  • 61511
  • 61784
  • 61850
  • 61851
  • 61883
  • 61960
  • 61968
  • 61970
  • 62014-4
  • 62026
  • 62056
  • 62061
  • 62196
  • 62262
  • 62264
  • 62304
  • 62325
  • 62351
  • 62365
  • 62366
  • 62379
  • 62386
  • 62455
  • 62680
  • 62682
  • 62700
  • 63110
  • 63119
  • 63382
ISO/IEC
  • 646
  • 1989
  • 2022
  • 4909
  • 5218
  • 6429
  • 6523
  • 7810
  • 7811
  • 7812
  • 7813
  • 7816
  • 7942
  • 8613
  • 8632
  • 8652
  • 8859
  • 9126
  • 9293
  • 9496
  • 9529
  • 9592
  • 9593
  • 9899
  • 9945
  • 9995
  • 10021
  • 10116
  • 10165
  • 10179
  • 10279
  • 10646
  • 10967
  • 11172
  • 11179
  • 11404
  • 11544
  • 11801
  • 12207
  • 13250
  • 13346
  • 13522-5
  • 13568
  • 13816
  • 13818
  • 14443
  • 14496
  • 14651
  • 14882
  • 15288
  • 15291
  • 15408
  • 15444
  • 15445
  • 15504
  • 15511
  • 15693
  • 15897
  • 15938
  • 16262
  • 16485
  • 17024
  • 17025
  • 18004
  • 18014
  • 18181
  • 19752
  • 19757
  • 19770
  • 19788
  • 20000
  • 20802
  • 21000
  • 21827
  • 22275
  • 22537
  • 23000
  • 23003
  • 23008
  • 23270
  • 23360
  • 24707
  • 24727
  • 24744
  • 24752
  • 26300
  • 27000
  • 27000 family
  • 27002
  • 27040
  • 29110
  • 29119
  • 33001
  • 38500
  • 39075
  • 42010
  • 80000
  • 81346
Related
  • International Electrotechnical Commission
Retrieved from "https://teknopedia.ac.id/w/index.php?title=MPEG-G&oldid=1336356067"
Categories:
  • ISO/IEC standards
  • Open standards covered by patents
Hidden categories:
  • Articles with short description
  • Short description matches Wikidata
  • Wikipedia external links cleanup from December 2021
  • Articles needing cleanup from December 2021
  • All pages needing cleanup

  • indonesia
  • Polski
  • العربية
  • Deutsch
  • English
  • Español
  • Français
  • Italiano
  • مصرى
  • Nederlands
  • 日本語
  • Português
  • Sinugboanong Binisaya
  • Svenska
  • Українська
  • Tiếng Việt
  • Winaray
  • 中文
  • Русский
Sunting pranala
url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url url
Pusat Layanan

UNIVERSITAS TEKNOKRAT INDONESIA | ASEAN's Best Private University
Jl. ZA. Pagar Alam No.9 -11, Labuhan Ratu, Kec. Kedaton, Kota Bandar Lampung, Lampung 35132
Phone: (0721) 702022
Email: pmb@teknokrat.ac.id