DNA Patterns

DNA patterns are graphs of DNA or RNA sequences. Various functional structures such as promoters and genes, or larger structures like bacterial or viral genomes, can be analyzed using DNA patterns.

Method

The technique was described in 2012 by Paul Gagniuc and Constantin Ionescu-Tirgoviste. They adapted algorithms from cryptography and optical character recognition to make their graphs. To graph a DNA pattern, two values, kappa index of coincidence and the total percentage of cytosine plus guanine (C + G)% are calculated from a sliding window which is "circulated" over the DNA sequence. The kappa index of coincidence measures the degree of organization or randomness of a sequence.

ten generic classes of gene promoters

The analysis of such two-dimensional patterns can be performed by considering their shape and density (using optical character recognition algorithms) and the trend-line of the points. Inside a pattern, long homopolymeric tracts will be plotted in the upper part of the pattern (relative to the nucleotide frequency of the entire sequence) and tandem short tracts will be plotted in the middle of the pattern. As the homopolymeric tracts become shorter and shorter (up to di- or tri- nucleotide formations), the kappa value decreases and the point on the pattern will be placed also in the middle, but lower on the Y-axis. All the values generated by the same repetitive sequences will be positioned in exactly the same point on the pattern (total points inside the pattern = promoter length - sliding window length).

Example

Human INS (insulin) gene promoter ranging from -499b to 100b, relative to the TSS (transcription start site). >gi|224514737|ref|NT_009237.18|:c2122939-2121009 H.sapiens INS gene region, 500 bases upstream of TSS:

GGTGTGGGGACAGGGGTGTGGGGACAGGGGTCTGGGGACAGGGGTGTGGG
GACAGGGGTCCTGGGGACAGGGGTGTGGGGATAGGGGTGTGGGGACAGGG
GTGTGGGGACAGGGGTGTGGGGACAGGGGTCTGGGGACAGCAGCGCAAAG
AGCCCCGCCCTGCAGCCTCCAGCTCTCCTGGTCTAATGTGGAAAGTGGCC
CAGGTGAGGGCTTTGCTCTCCTGGAGACATTTGCCCCCAGCTGTGAGCAG
GGACAGGTCTGGCCACCGGGCCCCTGGTTAAGACTCTAATGACCCGCTGG
TCCTGAGGAAGAGGTGCTGACGACCAAGGAGATCTTCCCACAGACCCAGC
ACCAGGGAAATGGTCCGGAAATTGCAGCCTCAGCCCCCAGCCATCTGCCG
ACCCCCCCACCCCAGGCCCTAATGGGCCAGGCGGCAGGGGTTGAGAGGTA
GGGGAGATGGGCTCTGAGACTATAAAGCCAGCGGGGGCCCAGCAGCCCTC

External links

🪦 Wikipedia History

4.5 yearsage

1editors

1edits

Archive Provenance

Created: May 7, 2014

Deleted: November 18, 2018

Article size: 4.2 KB

Technical Metadata

Wikipedia page ID: 38397698

Metadata captured: May 9, 2026 10:04 PM

Metadata updated: May 9, 2026 10:04 PM

Subject Tags

BioinformaticsComputational biologyDNA sequencingMathematical and theoretical biologyMolecular biology techniques

Why Deleted

Speedy

G11

Unambiguous advertising: Content that is purely promotional with no encyclopedic value

Unambiguous advertising or promotion

by RHaworth

G11: Unambiguous advertising or promotion

Sources

http://patternsdna.blogspot.com/

http://revue.elth.pub.ro/upload/318569art11.pdf

www.biomedcentral.com/...

Additional preserved links are available in the archive details below.

Archive Inventory

View stored source record counts

Revision rows stored: 0

Outgoing links stored: 3

External links stored: 4

Templates stored: 5

Talk exports stored: 0

AfD exports stored: 0

Raw API payloads stored: 0

Image records stored: 2

View full source metadata

Outgoing Wikipedia links (3)

Index of coincidenceOptical character recognitionSliding window

External links (4)

http://patternsdna.blogspot.com/

http://revue.elth.pub.ro/upload/318569art11.pdf

www.biomedcentral.com/...

Templates (5)

Cite journalCite webDEFAULTSORT:Dna patternsGenetics sidebarReflist

DNA Patterns

Method

Example

External links

See Also

Herbert Daniel Landahl

CERF (software)

Predictprotein

Chou's invariance theorem

FastPCR