GTF文件
一、GTF文件格式
Fields must be tab-separated. Also, all but the final field in each feature line must contain a value; "empty" columns should be denoted with a '.'
1.seqname - name of the chromosome or scaffold; chromosome names can be given with or without the 'chr' prefix. Important note: the seqname must be one used within Ensembl, i.e. a standard chromosome name or an Ensembl identifier such as a scaffold ID, without any additional content such as species or assembly. See the example GFF output below.
2.source- name of the program that generated this feature, or the data source (database or project name)
3.feature- feature type name, e.g. Gene, Variation, Similarity
4.start- Start position of the feature, with sequence numbering starting at 1.
5.end- End position of the feature, with sequence numbering starting at 1.
6.score- A floating point value.
7.strand- defined as + (forward) or - (reverse).
8.frame- One of '0', '1' or '2'. '0' indicates that the first base of the feature is the first base of a codon, '1' that the second base is the first base of a codon, and so on..
9.attribute- A semicolon-separated list of tag-value pairs, providing additional information about each feature.
1.染色体名
2.注释信息的来源,比如”Genescan”、”Genbank” 等,可以为空,为空用”.”点号代替
3.注释信息的类型,比如Gene、cDNA、mRNA等,或者是SO对应的编号
4、5.开始和结束位置
7.序列的方向, +表示正义链, -反义链 , ? 表示未知
8.阅读框:有数字0、1和2。0代表序列的第一个碱基为密码子的第一个碱基,1代表是密码子第二个,2代表第三个。
最新文章
- Flask_more1
- linker command failed with exit code
- LeetCode——Balanced Binary Tree(判断是否平衡二叉树)
- ubuntu 14.04 java开发环境搭建 jdk 以及 inteliJ IDEA安装
- 【编程题目】输入两个整数 n 和 m,从数列 1,2,3.......n 中 随意取几个数, 使其和等于 m ... ★
- 【HDOJ】4983 Goffi and GCD
- startssl证书firefox支持配置
- jq基础
- 敏捷开发每日报告--day4
- [Noi2016]区间
- android使用.9图作为背景,内容不能居中的问题解决方案
- 带着新人看java虚拟机01
- 新装云服务器没有iptables 文件,并且无法通过service iptables save操作
- tiny6410 启动参数
- 剑指Offer 59. 按之字形顺序打印二叉树 (二叉树)
- 谷歌地图api 开发 (转载)
- excel 应用,右下角的小十字拖拽的时候形成递减的数列
- linux下pgAdmin4安装
- XPath Checker 和 firebug 插件使用
- servlet 简单filter避免中文乱码等