GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:01:00 Sequence gi568815586f:80607720_80808957 : 201238 bp : 35.61% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2720 2906 187 1 1 98 110 103 0.979 11.84 1.02 Intr + 5873 6117 245 0 2 53 47 205 0.546 9.19 1.03 Intr + 8481 8547 67 2 1 61 83 70 0.582 1.36 1.04 Intr + 11665 11823 159 2 0 94 54 97 0.955 5.94 1.05 Intr + 12435 12657 223 1 1 47 89 117 0.471 3.96 1.06 Intr + 14342 14415 74 2 2 85 82 16 0.414 -1.17 1.07 Term + 15288 15424 137 1 2 49 38 103 0.248 -1.40 1.08 PlyA + 15800 15805 6 1.05 2.00 Prom + 16117 16156 40 -1.15 2.01 Init + 17263 17392 130 0 1 74 49 77 0.714 2.76 2.02 Intr + 24473 24572 100 0 1 123 116 -24 0.908 2.15 2.03 Intr + 27226 27354 129 1 0 64 89 189 0.922 15.49 2.04 Intr + 41869 41950 82 1 1 25 88 43 0.197 -3.28 2.05 Intr + 45025 45115 91 0 1 94 50 49 0.495 0.35 2.06 Intr + 50266 50342 77 2 2 96 84 115 0.966 10.22 2.07 Intr + 61288 61422 135 0 0 84 114 13 0.758 3.34 2.08 Intr + 61620 61745 126 2 0 74 60 97 0.952 5.46 2.09 Intr + 62625 62773 149 2 2 65 94 27 0.716 -0.89 2.10 Intr + 65450 65585 136 2 1 79 99 28 0.723 2.65 2.11 Intr + 70883 71055 173 1 2 87 107 11 0.094 0.72 2.12 Intr + 95262 95370 109 0 1 104 61 27 0.055 0.97 2.13 Intr + 99974 100519 546 1 0 18 75 620 0.005 45.85 2.14 Intr + 100805 101237 433 1 1 105 17 202 0.080 7.09 2.15 Intr + 109199 109845 647 0 2 47 103 727 0.300 60.87 2.16 Term + 138558 138719 162 2 0 51 47 129 0.095 2.05 2.17 PlyA + 140139 140144 6 1.05 3.03 PlyA - 140801 140796 6 1.05 3.02 Term - 159976 159791 186 1 0 47 45 132 0.751 1.31 3.01 Init - 168432 168379 54 0 0 83 64 74 0.406 5.84 3.00 Prom - 172324 172285 40 -6.75 4.05 PlyA - 174431 174426 6 1.05 4.04 Term - 178828 178633 196 0 1 60 44 172 0.612 5.90 4.03 Intr - 180501 180305 197 0 2 72 37 137 0.620 4.29 4.02 Intr - 180835 180621 215 2 2 66 107 70 0.659 4.21 4.01 Init - 182501 182354 148 2 1 75 89 129 0.991 12.00 4.00 Prom - 188123 188084 40 -6.55 5.02 PlyA - 188405 188400 6 1.05 5.01 Term - 197631 197461 171 0 0 105 41 168 0.996 10.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 110639 110714 76 1 1 69 95 47 0.856 1.67 S.002 Term - 190007 189930 78 2 0 140 39 72 0.883 4.28 S.003 Init - 190553 190506 48 2 0 69 76 20 0.842 -0.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:80607720_80808957|GENSCAN_predicted_peptide_1|363_aa VDNDEFNISFIKSNEENKTIEIKDLEIFTRYSVVITAFTGNISAAYVEGKSSAEMIVTTL ESAPKDPPNNMTFQKIPDEVTKFQLTFLPPSQPNGNIQVYQALVYREDDPTAVQIHNLSI IQKTNTFVIAMLEGLKGGHTYNISVYAVNSAGAGPKVPMRITMDIKAPARPKTKPTPIYD ATGKLLVTSTTITIRMPICYYSDDHGPIKNVQVLVTETGAQHDGNVTKWYDAYFNKARPY FTNEGFPNPPCTEGKTKFSGNEEIYIIGADNACMIPGNEDKICNGPLKPKKQYLFKFRAT NIMGQFTDSDYSDPVKTLGPQKPQFIRLKELHEQGISGAPLENVRLHVLPRSWRAVAQCI LNC >gi568815586f:80607720_80808957|GENSCAN_predicted_CDS_1|1092_bp gtagataatgatgaatttaatatatccttcatcaagtcaaatgaagaaaataaaaccata gaaattaaagatttagaaatattcacaaggtattctgtagtgatcactgcatttactggg aacattagtgctgcatatgtagaagggaagtcaagtgctgaaatgattgttactacttta gaatcagccccaaaggacccacctaacaacatgacatttcagaagataccagatgaagtt acaaaatttcaattaacgttccttcctccttctcaacctaatggaaatatccaagtatat caagctctggtttaccgagaagatgatcctactgctgtccagattcacaacctcagtatt atacagaaaaccaacacattcgtcattgcaatgctagaaggactaaaaggtggacataca tacaatatcagtgtttacgcagtcaatagtgctggtgcaggtccaaaggttccgatgaga ataaccatggatatcaaagctccagcacgaccaaaaaccaaaccaacccctatttatgat gccacaggaaaactgcttgtgacttcaacaacaattacaatcagaatgccaatatgttac tacagtgatgatcatggaccaataaaaaatgtacaagtgcttgtgacagaaacaggagct cagcatgatggaaatgtaacaaagtggtatgatgcatattttaataaagcaaggccatat tttacaaatgaaggctttcctaaccctccatgtacagaaggaaagacaaagtttagtggc aatgaagaaatctacatcataggtgctgataatgcatgcatgattcctggcaatgaagac aaaatttgcaatggaccactgaaaccaaaaaagcaatacttatttaaatttagagctaca aatattatgggacaatttactgactctgattattctgaccctgttaagactttaggccct caaaagccccagtttattcgtcttaaagaattgcatgaacagggtatttctggggcacca cttgaaaatgtaagacttcatgtgttgcccagatcctggcgagctgttgctcagtgtatc ttgaactgctaa >gi568815586f:80607720_80808957|GENSCAN_predicted_peptide_2|1074_aa MQPMSCELDKLATDSKEFLVKEHCQGRNSRRTQKENKVDQFSNGEGLSERTVEIILSVTL CILSIILLGTAIFAFARIRQKQKEGGTYSPQDAEIIDTKLKLDQLITVADLELKDERLTR PISKKSFLQHVEELCTNNNLKFQEEFSELPKFLQDLSSTDADLPWNRAKNRFPNIKPYNN NRVKLIADASVPGSDYINASYISGYLCPNEFIATQGPLPGTVGDFWRMVWETRAKTLVML TQCFEKGRIRCHQYWPEDNKPVTVFGDIVITKLMEDVQIDWTIRDLKIERHGDCMTVRQC NFTAWPEHGVPENSAPLIHFVKLVRASRAHDTTPMIVHCSAGVGRTGVFIALDHLTQHIN DHDFVDIYGLVAELRSERMCMVQNLAQYIFLHQCILDLLSNKGSNQPICFVNYSALQKMD SLDAMEGKQKQQYMPSLLVYHLRFSEVTQMFCRSKCKTIFTTAFGNLWAQTVCRLGKLQI ESEAKEENMMMDLFETGSYFFYLDGENVTLQPLEVAEGSPLYPGSDGTLSPCQDQMPPEA GSDSSGEEHVLAPPGLQPPHCPGQCLIWACKTCKRKSAPTDRRKAATLRERRRLKKINEA FEALKRRTVANPNQRLPKVEILRSAISYIERLQDLLHRLDQQEKMQELGVDPFSYRPKQE NLEGADFLRTCSSQWPSVSDHSRGLVITAKEGKVKGLWAAPEKIREVDRMLGPRGLEARA GTRPAKRALFARRDQAFPRCQSGLARGAELLRVPSLGAMFVCCFFAQEEQVLIRQPRVAF DAFLPSWTVFPRRNANSPAWRKWWRSQQASALVNYRSDRLGSSARDLPIGGGARLPFLPI PLAAVQVHRLPLSRMDVMDGCQFSPSEYFYDGSCIPSPEGEFGDEFVPRVAAFGAHKAEL QGSDEDEHVRAPTGHHQAGHCLMWACKACKRKSTTMDRRKAATMRERRRLKKVNQAFETL KRCTTTNPNQRLPKVEILRNAIRYIESLQELLREQVENYYSLPGQSCSEPTSPTSNCSDG MKERINFSVSLAMPRQNAADSQATAACIAEPKQPTSAKAIMALWAIFAEIESIA >gi568815586f:80607720_80808957|GENSCAN_predicted_CDS_2|3225_bp atgcagcccatgagctgcgagttggacaagctcgccacagattcaaaagagttcttagta aaagaacattgccagggaagaaactctagaagaactcaaaaagaaaacaaagttgatcaa ttctccaatggggaaggactttcagaaagaaccgtagagatcattctttccgtcactttg tgtatcctttcaataattctccttggaacagctatttttgcatttgcaagaattcgacag aagcagaaagaaggtggcacatactctcctcaggatgcagaaattattgacactaaattg aagctggatcagctcatcacagtggcagacctggaactgaaggacgagagattaacgcgg ccaataagcaagaaatccttcctgcaacatgttgaagagctttgcacaaacaacaaccta aagtttcaagaagaattttcggaattaccaaaatttcttcaggatctttcttcaactgat gctgatctgccttggaatagagcaaaaaaccgcttcccaaacataaaaccatataataat aacagagtaaagctgatagctgacgctagtgttccaggttcggattatattaatgccagc tatatttctggttatttatgtccaaatgaatttattgctactcaaggtccactaccagga acagttggagatttttggagaatggtgtgggaaaccagagcaaaaacattagtaatgcta acacagtgttttgaaaaaggacggatcagatgccatcagtattggccagaggacaacaag ccagttactgtctttggagatatagtgattacaaagctaatggaggatgttcaaatagat tggactatcagggatctgaaaattgaaaggcatggggattgcatgactgttcgacagtgt aactttactgcctggccagagcatggggttcctgagaacagcgcccctctaattcacttt gtgaagttggttcgagcaagcagggcacatgacaccacacctatgattgttcactgcagt gctggagttggaagaactggagtttttattgctctggaccatttaacacaacatataaat gaccatgattttgtggatatatatggactagtagctgaactgagaagtgaaagaatgtgc atggtgcagaatctggcacagtatatctttttacaccagtgcattctggatctcttatca aataagggaagtaatcagcccatctgttttgttaactattcagcacttcagaagatggac tctttggacgccatggaaggtaaacagaaacaacagtatatgcccagcttactagtttac cacctacggttctctgaagtgactcagatgttctgtaggtctaagtgcaaaactattttt accacagcgtttgggaatctctgggcacaaactgtctgccgattaggtaaattacagatc gagtcagaggccaaggaggagaacatgatgatggacctttttgaaactggctcctatttc ttctacttggatggggaaaatgttactctgcagccattagaagtggcagaaggctctcct ttgtatccagggagtgatggtaccttgtccccctgccaggaccaaatgcccccggaagcg gggagcgacagcagcggagaggaacatgtcctggcgcccccgggcctgcagcctccacac tgccccggccagtgtctgatctgggcttgcaagacctgcaagagaaaatctgcccccact gaccggcgaaaagccgccaccctgcgcgaaaggaggaggctaaagaaaatcaacgaggcc ttcgaggcactgaagcggcgaactgtggccaaccccaaccagaggctgcccaaggtggag attctgcggagcgccatcagctatattgagcggctgcaggacctgctgcaccggctggat cagcaggagaagatgcaggagctgggggtggaccccttcagctacagacccaaacaagaa aatcttgagggtgcggatttcctgcgcacctgcagctcccagtggccaagtgtttccgat cattccagggggctcgtgataacggctaaggaaggtaaagtaaaagggctctgggccgca ccagagaaaatccgggaggtggataggatgcttgggccgagagggctcgaagcgagagca gggacgcgccctgcgaaaagggcgctctttgcgcgccgggaccaggcctttcctcgctgc caaagcggcctcgcgcggggcgcggagctgcttcgtgttccttctttgggtgctatgttt gtgtgttgttttttcgctcaggaggagcaagtattgattcgtcagcctcgagtagccttc gatgcctttcttccatcgtggacagtatttcctcggaggaacgcaaactcccctgcgtgg aggaagtggtggagaagccaacaggcgtctgcccttgttaattaccggagcgacagacta gggagctccgcccgggatttgcccatcggcggaggcgccaggctcccgtttctccccatc cctctcgctgccgtccaggtgcaccgcctgcctctcagcaggatggacgtgatggatggc tgccagttctcaccttctgagtacttctacgacggctcctgcataccgtcccccgagggt gaatttggggacgagtttgtgccgcgagtggctgccttcggagcgcacaaagcagagctg cagggctcagatgaggacgagcacgtgcgagcgcctaccggccaccaccaggctggtcac tgcctcatgtgggcctgcaaagcctgcaagaggaagtccaccaccatggatcggcggaag gcagccactatgcgcgagcggaggcgcctgaagaaggtcaaccaggctttcgaaaccctc aagaggtgtaccacgaccaaccccaaccagaggctgcccaaggtggagatcctcaggaat gccatccgctacatcgagagcctgcaggagttgctgagagagcaggtggagaactactat agcctgccgggacagagctgctcggagcccaccagccccacctccaactgctctgatggc atgaaagaacgaatcaatttttctgtctcacttgcaatgccaaggcagaatgcagctgac agtcaagctacagcagcttgcattgcagaacccaagcagccgacatctgccaaggccatt atggcattatgggccatttttgctgagatagaatcaattgcctga >gi568815586f:80607720_80808957|GENSCAN_predicted_peptide_3|79_aa MAEGKREAVTFFTWQQGGTTVAVIGLLPDAYSKSIHQDAELAQRKRFNCRVTEQGDGKKP QIHLPEEFEDKIFKGFGVN >gi568815586f:80607720_80808957|GENSCAN_predicted_CDS_3|240_bp atggcagaaggcaaacgagaagcagtcaccttctttacatggcagcaaggcggaactacc gtagctgtaataggcttattgcctgatgcatacagcaagtcaatacaccaagatgctgag ctggcgcagagaaagagattcaattgtagggtcactgaacaaggagatgggaagaagcct caaatccatctccctgaagaatttgaggataaaatttttaagggttttggagtgaactga >gi568815586f:80607720_80808957|GENSCAN_predicted_peptide_4|251_aa MQEQWQAFNPIGSGNGHLAGSGAQRAPCWIQRDGSQWRVCDSGKTAVVDGEINSHVAHTK PVWWSLHTDAHEIWCRDSDWGTFLGRSIPCPPALCFMRKIHLRPQVLRPTSPRNISPISN LRQRRYVLSVDPKLWCRSRTSEGSLPLVFNHWRDASLIIHPGFGGVRPLRDACLGPSPLA ASPAFLGSLLQVPEIWPPGQGMPAAQIPPKPRPICVGPHWKSDCPTHLAATPRAPGTLAQ GSLTPSQIFLV >gi568815586f:80607720_80808957|GENSCAN_predicted_CDS_4|756_bp atgcaggaacaatggcaagcctttaacccaatcgggagtggcaatgggcaccttgctgga tcaggagcacagcgggcaccctgctggatccagagggatggaagtcagtggcgggtctgt gacagcggcaaaacagcagtggtggatggtgaaataaacagccatgttgctcatacaaag cctgtttggtggtctcttcacacggacgcgcatgaaatttggtgccgtgactcggattgg gggaccttccttgggagatcaatcccctgtcctcctgctctttgcttcatgagaaagatc cacctacgacctcaggtcctcagaccaaccagcccaagaaacatctcaccaatttcaaat ctgagacaaaggagatacgttttatccgtggacccaaaactctggtgccggtcacggact agtgaaggcagccttcccttggtgtttaatcattggagggacgcttctctgattattcac ccaggtttcggaggtgtcagaccactcagggatgcctgccttggtccttcacccttagca gcaagtcctgcttttctggggagcttgctacaagtgccagaaatctggccaccaggccaa ggaatgcctgcagcccagattcctcctaagccgcgtcccatctgtgtgggaccccactgg aaatcggactgtccaactcacctggcagccactcccagagcccctggaactctggcccaa ggctctctgactccttcccagatcttcttggtttag >gi568815586f:80607720_80808957|GENSCAN_predicted_peptide_5|56_aa HYHMHDPDDLIQGIITKTIRLVTKKTKFQGMTPIKSNARNPEFPSRTPDRNNHKAD >gi568815586f:80607720_80808957|GENSCAN_predicted_CDS_5|171_bp cattaccacatgcatgatcctgatgatctaatccagggaattattacgaaaactatccgc ctagtcactaagaaaaccaagttccaggggatgactccaataaaatcgaatgcaaggaac ccagagtttccaagcagaacaccggacagaaataatcataaagctgactaa