GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:12:46 Sequence gi568815581f:70032088_70233341 : 201254 bp : 38.09% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 743 738 6 1.05 1.05 Term - 3813 3728 86 1 2 65 43 105 0.584 0.54 1.04 Intr - 5701 5595 107 0 2 76 115 -6 0.200 -0.16 1.03 Intr - 9569 9384 186 1 0 62 14 261 0.612 13.98 1.02 Intr - 28926 28818 109 1 1 72 81 72 0.159 3.32 1.01 Init - 62829 62727 103 0 1 63 63 65 0.055 1.95 1.00 Prom - 83839 83800 40 -2.15 2.00 Prom + 86004 86043 40 -3.25 2.01 Init + 96780 96902 123 2 0 76 87 148 0.690 13.72 2.02 Term + 99908 101257 1350 1 0 95 37 932 0.989 79.27 2.03 PlyA + 101583 101588 6 1.05 3.00 Prom + 105445 105484 40 -5.35 3.01 Init + 119145 119518 374 2 2 44 71 282 0.079 18.58 3.02 Intr + 125244 125471 228 0 0 53 48 120 0.001 0.56 3.03 Intr + 131211 131275 65 0 2 79 72 3 0.002 -4.76 3.04 Intr + 136676 137614 939 0 0 60 110 286 0.068 18.21 3.05 Term + 142927 144236 1310 2 2 10 42 1518 0.014 130.15 3.06 PlyA + 144944 144949 6 1.05 4.04 PlyA - 145172 145167 6 1.05 4.03 Term - 150203 150135 69 2 0 85 43 58 0.134 -2.04 4.02 Intr - 166086 165954 133 2 1 113 46 102 0.622 8.23 4.01 Init - 176524 176514 11 1 2 82 116 9 0.804 3.03 4.00 Prom - 180514 180475 40 -4.15 5.03 PlyA - 180672 180667 6 -0.45 5.02 Term - 181689 181525 165 0 0 77 40 128 0.111 3.83 5.01 Intr - 190169 190056 114 2 0 55 99 37 0.019 1.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 142953 144236 1284 2 0 82 42 1493 0.970 139.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:70032088_70233341|GENSCAN_predicted_peptide_1|196_aa MTGDDFLWTNSTALQDRIWRARVPAVSQDWLTLAIILLARDMLISPASDNGATNLHLVQA QNFSLQPILMNIGAIPENSSSQKDYLRLSSDIIIEEKQLHADVAVLFTGIEMIMMKKRCF KDMNGREETHKPSPTHYKPKYHGKLLQHWEQMVKFSIKEKGKQTCLKQGEMMKGPLHPLS HGDGGSVAEQEELKQP >gi568815581f:70032088_70233341|GENSCAN_predicted_CDS_1|591_bp atgactggtgatgattttctgtggacaaattctacggccttgcaagataggatttggaga gcaagagttcctgccgtttctcaggactggttaactctagccataatattgctggcccgt gatatgctaatcagccctgcctctgataatggtgctacaaatctgcaccttgttcaggca cagaatttttctctccaaccaatactgatgaatattggagccattccagaaaacagctct agtcaaaaggactatttacgtctatcgtcggacattattattgaggaaaaacaactgcac gcagatgttgcagtacttttcacgggcatagaaatgatcatgatgaaaaagcgctgtttc aaagatatgaacggacgtgaagaaacccataaacccagtcctacccactataaacctaaa taccatggtaagctattacagcattgggaacagatggtgaagttttccataaaagagaaa gggaaacagacctgtctgaagcaaggtgagatgatgaagggtccacttcatcccttgagc catggagatggtggtagtgtggcagaacaggaggagctcaaacagccttaa >gi568815581f:70032088_70233341|GENSCAN_predicted_peptide_2|490_aa MASAPNEYAAGVYKVSCKQEVSQSDVTATQYADGLSFRLQRVLTENPNQEIATSLEFLLL QNSPGSLRAQQRMSYYGSSYHIINADAKYPGYPPEHIIAEKRRARRRLLHKDGSCNVYFK HIFGEWGSYVVDIFTTLVDTKWRHMFVIFSLSYILSWLIFGSVFWLIAFHHGDLLNDPDI TPCVDNVHSFTGAFLFSLETQTTIGYGYRCVTEECSVAVLMVILQSILSCIINTFIIGAA LAKMATARKRAQTIRFSYFALIGMRDGKLCLMWRIGDFRPNHVVEGTVRAQLLRYTEDSE GRMTMAFKDLKLVNDQIILVTPVTIVHEIDHESPLYALDRKAVAKDNFEILVTFIYTGDS TGTSHQSRSSYVPREILWGHRFNDVLEVKRKYYKVNCLQFEGSVEVYAPFCSAKQLDWKD QQLHIEKAPPVRESCTSDTKARRRSFSAVAIVSSCENPEETTTSATHEYRETPYQKALLT LNRISVESQM >gi568815581f:70032088_70233341|GENSCAN_predicted_CDS_2|1473_bp atggcgtcagcaccaaatgagtatgcggcaggggtttataaagtctcctgtaaacaggaa gtgtctcagtctgatgtaactgctacgcagtacgcggacggcctctctttccgtcttcag cgggttctaactgaaaacccaaaccaagaaatagcaacaagtctagaattcttactacta caaaactcacctggatccctaagggcacagcaaagaatgagctattacggcagcagctat catattatcaatgcggacgcaaaatacccaggctacccgccagagcacattatagctgag aagagaagagcaagaagacgattacttcacaaagatggcagctgtaatgtctacttcaag cacatttttggagaatggggaagctatgtggttgacatcttcaccactcttgtggacacc aagtggcgccatatgtttgtgatattttctttatcttatattctctcgtggttgatattt ggctctgtcttttggctcatagcctttcatcatggcgatctattaaatgatccagacatc acaccttgtgttgacaacgtccattctttcacaggggcctttttgttctccctagagacc caaaccaccataggatatggttatcgctgtgttactgaagaatgttctgtggccgtgctc atggtgatcctccagtccatcttaagttgcatcataaatacctttatcattggagctgcc ttggccaaaatggcaactgctcgaaagagagcccaaaccattcgtttcagctactttgca cttataggtatgagagatgggaagctttgcctcatgtggcgcattggtgattttcggcca aaccacgtggtagaaggaacagttagagcccaacttctccgctatacagaagacagtgaa gggaggatgacgatggcatttaaagacctcaaattagtcaacgaccaaatcatcctggtc accccggtaactattgtccatgaaattgaccatgagagccctctgtatgcccttgaccgc aaagcagtagccaaagataactttgagattttggtgacatttatctatactggtgattcc actggaacatctcaccaatctagaagctcctatgttccccgagaaattctctggggccat aggtttaatgatgtcttggaagttaagaggaagtattacaaagtgaactgcttacagttt gaaggaagtgtggaagtatatgcccccttttgcagtgccaagcaattggactggaaagac cagcagctccacatagaaaaagcaccaccagttcgagaatcctgcacgtcggacaccaag gcgagacgaaggtcatttagtgcagttgccattgtcagcagctgtgaaaaccctgaggag accaccacttccgccacacatgaatatagggaaacaccttatcagaaagctctcctgact ttaaacagaatctctgtagaatcccaaatgtag >gi568815581f:70032088_70233341|GENSCAN_predicted_peptide_3|971_aa MLNVIGQGSPTPSTGRWPVACYELGHKAECEWQVSEYYHLSSTSCQIGAAFDSRRSVNPI VNCAGKGSRLRVPYENLGIGVMCLNHPETITHNCPSRVRGKMVFHETSPWCQNVGDHCYM TKNTRRIHSHPDTSMAKGKRATSPLCGSRCPCSSVDEKILAEEKERQRKGDCWVVKRYEQ EGEVATHFSARFLLAPENPILPQSVIFPFLCPCDLIVQFPPMDQQSVPGGGGETEGGQGE RERAVHPSEKPRRLVSNSRTVGRTNSQAALQWPHLRASQSWGLCAPAPVPRPIDSVASPG AAASAGGFEGPRGSGREGVERETRRDRRTILPLRQRSLPGAGTSQAPRGARRASGPRAGE PGSVWGSRRSRKLCPRPGCNQAAATAGRGASHLLLSQLGRNFLLPGPGPTPRPGSEGGGG RGGGRFPPLATPRSILPSPPRFPGPAARPGALRLAGSGRGRGQESRCRTDRLKRAQDIAE RTGALASAQPSRRRRAGSWEFWFALAHSLFTNHWILHASVPPTSTPCPHAPAPATEALES PAEAMGSVRTNRYSIVSSEEDGMKLATMAVANGFGNGKSKVHTRQQCRSRFVKKDGHCNV QFINVGEKGQRYLADIFTTCVDIRWRWMLVIFCLAFVLSWLFFGCVFWLIALLHGDLDAS KEGKACVSEVNSFTAAFLFSIETQTTIGYGFRCVTDECPIAVFMVVFQSIVGCIIDAFII GAVMAKMAKPKKRNETLVFSHNAVIAMRDGKLCLMWRVGNLRKSHLVEAHVRAQLLKSRI TSEGEYIPLDQIDINVGFDSGIDRIFLVSPITIVHEIDEDSPLYDLSKQDIDNADFEIVV ILEGMVEATAMTTQCRSSYLANEILWGHRYEPVLFEEKHYYKVDYSRFHKTYEVPNTPLC SARDLAEKKYILSNANSFCYENEVALTSKEEDDSENGVPESTSTDTPPDIDLHNQASVPL EPRPLRRESEI >gi568815581f:70032088_70233341|GENSCAN_predicted_CDS_3|2916_bp atgttgaatgttataggacagggatccccaacccccagtacaggtcgatggcctgtggcc tgttacgaactgggccacaaagcagaatgtgagtggcaggtgagcgaatattaccacctg agctccacttcctgtcagatcggggcagcatttgattctcgtaggagtgtaaaccccatt gtgaactgtgcaggcaagggatctaggttacgtgttccttatgagaatttagggataggt gtaatgtgcttgaatcatcctgaaaccatcacccacaactgcccctctcgtgtccgtgga aaaatggtcttccatgaaaccagtccctggtgccaaaatgttggggaccattgttatatg actaaaaacaccaggcgaatacacagccaccctgatacatccatggccaaaggcaaaaga gcaacttctcctctctgtggcagcagatgcccctgttcttcagttgatgaaaaaatccta gcagaggaaaaggaaagacagaggaaaggagattgttgggttgttaagagatatgagcag gagggagaggttgcaacccacttctccgcccgctttctgctggccccagagaaccctata cttccccagagtgtgatattccccttcctgtgtccatgtgatctcattgttcagttccca cctatggatcagcaaagcgtgccgggcggtggtggagagactgagggcggacaaggcgag agggaacgagccgtccacccttcggagaagcctaggcgccttgtaagtaattcgcgaaca gtcgggagaacaaacagccaagcggcgctgcagtggccgcacttgcgcgcgtctcaatcc tgggggctctgcgcgcccgccccagtccctcgccccattgactcagtggcttctccgggc gctgcagcctccgcggggggcttcgaagggccgaggggctccggcagagagggagtggag agggagacgcgccgggaccgacgaacaatcctgcccctgcggcaaaggtctctacccggc gctggcacctcgcaggcccctcgaggagcacgcagggcaagcggcccaagagcgggggaa ccgggaagtgtgtggggctccagacggagtaggaagctttgcccaaggccaggctgcaat caggcagccgcaacagccgggcgcggagcttcccacctgctgctgtcccagctgggccgc aacttcctcctccccggcccgggcccgactccccggccgggctccgagggtggaggggga cgaggcggcggcaggttccctccgctggcaacgcctcgcagcatcctcccctccccgccg cggttcccggggccggccgcgcggccaggcgcgctgcgattggccggcagcggccggggg cggggccaggagagccggtgtcgcacggaccgcctcaaaagagcccaggatattgcagag cgcactggagccctggccagcgcgcagccttcccggcgccggcgggctgggtcttgggaa ttctggtttgctttggctcactcgctttttacaaaccactggatcttacatgcctctgta ccccccacttccactccatgtccccatgctcctgcgccagcaacagaagcactggagtcc ccagcagaagcgatgggcagtgtgcgaaccaaccgctacagcatcgtctcttcagaagaa gacggtatgaagttggccaccatggcagttgcaaatggctttgggaacgggaagagtaaa gtccacacccgacaacagtgcaggagccgctttgtgaagaaagatggccactgtaatgtt cagttcatcaatgtgggtgagaaggggcaacggtacctcgcagacatcttcaccacgtgt gtggacattcgctggcggtggatgctggttatcttctgcctggctttcgtcctgtcatgg ctgttttttggctgtgtgttttggttgatagctctgctccatggggacctggatgcatcc aaagagggcaaagcttgtgtgtccgaggtcaacagcttcacggctgccttcctcttctcc attgagacccagacaaccataggctatggtttcagatgtgtcacggatgaatgcccaatt gctgttttcatggtggtgttccagtcaatcgtgggctgcatcatcgatgctttcatcatt ggcgcagtcatggccaagatggcaaagccaaagaagagaaacgagactcttgtcttcagt cacaatgccgtgattgccatgagagacggcaagctgtgtttgatgtggcgagtgggcaat cttcggaaaagccacttggtggaagctcatgttcgagcacagctcctcaaatccagaatt acttctgaaggggagtatatccctctggatcaaatagacatcaatgttgggtttgacagt ggaatcgatcgtatatttctggtgtccccaatcactatagtccatgaaatagatgaagac agtcctttatatgatttgagtaaacaggacattgacaacgcagactttgaaatcgtggtc atactggaaggcatggtggaagccactgccatgacgacacagtgccgtagctcttatcta gcaaatgaaatcctgtggggccaccgctatgagcctgtgctctttgaagagaagcactac tacaaagtggactattccaggttccacaaaacttacgaagtccccaacactcccctttgt agtgccagagacttagcagaaaagaaatatatcctctcaaatgcaaattcattttgctat gaaaatgaagttgccctcacaagcaaagaggaagacgacagtgaaaatggagttccagaa agcactagtacggacacgccccctgacatagaccttcacaaccaggcaagtgtacctcta gagcccaggcccttacggcgagagtcggagatatga >gi568815581f:70032088_70233341|GENSCAN_predicted_peptide_4|70_aa MNERTFAVGYCKPLSMEIMQFVPKMKVSKERLMVWKPFKNIEEIDSSMEDDQLPWFAQSF SSFSTESLAR >gi568815581f:70032088_70233341|GENSCAN_predicted_CDS_4|213_bp atgaatgaaagaacatttgctgttggttactgtaaacctctgagtatggaaataatgcag ttcgtgccaaaaatgaaagtgtcaaaggagcgacttatggtctggaagccattcaaaaat atcgaagagatagattcttcaatggaggatgaccaactcccctggtttgcccagagcttt tccagtttcagcactgaaagtcttgcacgctag >gi568815581f:70032088_70233341|GENSCAN_predicted_peptide_5|92_aa GIVYNLCGYVENESCASPLNSTNCGWSKLVQKEGLFPLPWLELLGCRAPSPETAQSSKAL DSAQETIFASYAFGPVIGGAAGHALETFSSLS >gi568815581f:70032088_70233341|GENSCAN_predicted_CDS_5|279_bp ggaatcgtttacaatttatgtgggtatgttgaaaatgagagctgtgccagtcctctcaac tcaacaaattgcggctggtctaagctggtgcaaaaggaaggactgtttcctctgccatgg ctggagctgctgggatgcagggcaccaagtcctgagactgcacaaagcagcaaggccctg gactcagcccaggaaaccatttttgcctcctacgcctttgggcctgtgataggaggggct gctggacatgccctggagacattttcctcattgtcttag