GTF (Gene Transfer Format) Documentation

Here are documentations for GTF: GTF2.2 and gtf2.2.

One example:

GTFsnippet

GTFsnippet

Another example:

GTFsnippet

GTFsnippet

Data extraction from the GTF file

number of genes in the file

cat Xxxxxx.gtf|cut -f9|cut -d “;” -f2|sort -u |wc -l

cut –f9 Xxxxxx.gtf | cut –d ‘ ‘ –f2 | sort –u | wc –l

number of transcripts in the file

cat Xxxxxx.gtf|cut -f9|cut -d “;” -f1,2|uniq|cut -d “;” -f1|uniq -c |awk ‘$1==“1”’|wc -l

cut –f9 Xxxxxx.gtf | cut –d ‘ ‘ –f4 | sort –u | wc –l

number of single transcript genes

cat Xxxxxx.gtf|cut -f9 |cut -d “;” -f1,2 | uniq | cut -d “;” -f1 | uniq -c | awk ‘$1 == 1’ | wc -l

number of single-exon transcripts

cut –f9 Xxxxxx.gtf | cut –d ‘ ‘ –f2,4 | sort –u | cut –d ‘‘ –f1 | sort | uniq –c | grep –c “ 1 “

number of multi-exon transcripts

cut –f9 Xxxxxx.gtf | cut –d ‘ ‘ –f4 | sort | uniq –c | grep –c “ 2 “

LS0tDQp0aXRsZTogIkdURjogQW5ub3RhdGlvbiBhbmQgYW5hbHlpcyBvZiBuZXh0IGdlbmVyYXRpb24gc2VxdWVuY2luZyAoTkdTKSBkYXRhIg0KYXV0aG9yOiAiQmhhZ2lyYXRoaSBEYXNoIg0KZGF0ZTogIkRlY2VtYmVyIDIsIDIwMTgiDQpvdXRwdXQ6IGh0bWxfbm90ZWJvb2sNCi0tLQ0KDQojIEdURiAoR2VuZSBUcmFuc2ZlciBGb3JtYXQpIERvY3VtZW50YXRpb24NCg0KSGVyZSBhcmUgZG9jdW1lbnRhdGlvbnMgZm9yIEdURjogW0dURjIuMl0oaHR0cDovL21ibGFiLnd1c3RsLmVkdS9HVEYyMi5odG1sKSBhbmQgW2d0ZjIuMl0oaHR0cHM6Ly91c2Vhc3QuZW5zZW1ibC5vcmcvaW5mby93ZWJzaXRlL3VwbG9hZC9nZmYuaHRtbCkuDQoNCk9uZSBleGFtcGxlOg0KDQohW0dURnNuaXBwZXRdKEM6L1VzZXJzL2JoYWdpL09uZURyaXZlL1Byb2dyYW1taW5nLUxpbnV4L0NvbW1hbmRMaW5lVG9vbHMvR1RGMi4yLkpQRykNCg0KQW5vdGhlciBleGFtcGxlOg0KDQohW0dURnNuaXBwZXRdKEM6L1VzZXJzL2JoYWdpL09uZURyaXZlL1Byb2dyYW1taW5nLUxpbnV4L0NvbW1hbmRMaW5lVG9vbHMvR1RGMi4yZXgyLkpQRykNCg0KDQojIERhdGEgZXh0cmFjdGlvbiBmcm9tIHRoZSBHVEYgZmlsZQ0KDQojIyMgbnVtYmVyIG9mIGdlbmVzIGluIHRoZSBmaWxlIA0KDQpjYXQgWHh4eHh4Lmd0ZnxjdXQgLWY5fGN1dCAtZCAiOyIgLWYyfHNvcnQgLXUgfHdjIC1sDQoNCmN1dCDigJNmOSBYeHh4eHguZ3RmIHwgY3V0IOKAk2Qg4oCYIOKAmCDigJNmMiB8IHNvcnQg4oCTdSB8IHdjIOKAk2wNCg0KIyMjIG51bWJlciBvZiB0cmFuc2NyaXB0cyBpbiB0aGUgZmlsZQ0KDQpjYXQgWHh4eHh4Lmd0ZnxjdXQgLWY5fGN1dCAtZCAiOyIgLWYxLDJ8dW5pcXxjdXQgLWQgIjsiIC1mMXx1bmlxIC1jIHxhd2sgJyQxPT0iMSInfHdjIC1sDQoNCmN1dCDigJNmOSBYeHh4eHguZ3RmIHwgY3V0IOKAk2Qg4oCYIOKAmCDigJNmNCB8IHNvcnQg4oCTdSB8IHdjIOKAk2wNCg0KDQojIyMgbnVtYmVyIG9mIHNpbmdsZSB0cmFuc2NyaXB0IGdlbmVzDQoNCmNhdCBYeHh4eHguZ3RmfGN1dCAtZjkgfGN1dCAtZCAiOyIgLWYxLDIgfCB1bmlxIHwgY3V0IC1kICI7IiAtZjEgfCB1bmlxIC1jIHwgYXdrICckMSA9PSAxJyB8IHdjIC1sDQoNCg0KIyMjIG51bWJlciBvZiBzaW5nbGUtZXhvbiB0cmFuc2NyaXB0cw0KDQpjdXQg4oCTZjkgWHh4eHh4Lmd0ZiB8IGN1dCDigJNkIOKAmCDigJgg4oCTZjIsNCB8IHNvcnQg4oCTdSB8IGN1dCDigJNkIOKAmOKAmCDigJNmMSB8IHNvcnQgfCB1bmlxIOKAk2MgfCBncmVwIOKAk2Mg4oCcIDEg4oCcDQoNCg0KIyMjIG51bWJlciBvZiBtdWx0aS1leG9uIHRyYW5zY3JpcHRzDQoNCmN1dCDigJNmOSBYeHh4eHguZ3RmIHwgY3V0IOKAk2Qg4oCYIOKAmCDigJNmNCB8IHNvcnQgfCB1bmlxIOKAk2MgfCBncmVwIOKAk2Mg4oCcIDIg4oCcDQo=