GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:50:01 Sequence gi568815587f:58522753_58724519 : 201767 bp : 38.83% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 987 1129 143 1 2 36 97 75 0.437 2.35 1.02 Term + 4672 4980 309 0 0 35 44 329 0.550 17.28 1.03 PlyA + 5586 5591 6 1.05 2.11 PlyA - 5851 5846 6 1.05 2.10 Term - 9347 9112 236 0 2 38 49 213 0.003 7.90 2.09 Intr - 10189 10058 132 2 0 36 39 165 0.003 6.10 2.08 Intr - 27115 27034 82 1 1 108 86 15 0.185 1.59 2.07 Intr - 27394 27221 174 1 0 93 95 115 0.976 11.91 2.06 Intr - 28480 28313 168 1 0 69 100 124 0.984 10.92 2.05 Intr - 32188 32089 100 0 1 106 113 152 0.973 18.69 2.04 Intr - 41449 41403 47 1 2 54 68 43 0.048 -4.81 2.03 Intr - 47961 47804 158 1 2 83 56 131 0.521 8.21 2.02 Intr - 51382 51258 125 0 2 66 63 109 0.518 5.51 2.01 Init - 52844 52675 170 2 2 60 -1 107 0.257 -1.95 2.00 Prom - 53534 53495 40 -9.25 3.00 Prom + 55391 55430 40 -10.45 3.01 Init + 56530 56870 341 0 2 79 76 319 0.740 26.58 3.02 Intr + 62104 62132 29 1 2 104 116 21 0.581 3.34 3.03 Intr + 87078 87287 210 1 0 30 47 120 0.041 0.06 3.04 Intr + 87546 87582 37 1 1 95 66 42 0.101 -0.90 3.05 Intr + 88198 88344 147 1 0 50 10 134 0.490 0.23 3.06 Intr + 88852 88986 135 1 0 60 81 147 0.997 9.96 3.07 Intr + 89526 89576 51 0 0 80 78 58 0.424 1.10 3.08 Intr + 90010 90088 79 1 1 56 82 120 0.999 6.83 3.09 Intr + 91477 91591 115 0 1 91 80 96 0.972 8.20 3.10 Intr + 93964 94063 100 2 1 30 79 115 0.969 3.05 3.11 Term + 94444 94954 511 1 1 121 40 541 0.999 45.46 3.12 PlyA + 96987 96992 6 1.05 4.00 Prom + 98770 98809 40 -7.95 4.01 Init + 100001 100114 114 1 0 56 92 129 0.901 10.36 4.02 Term + 101282 101770 489 1 0 59 43 419 0.508 28.17 4.03 PlyA + 102955 102960 6 1.05 5.09 PlyA - 103002 102997 6 1.05 5.08 Term - 104286 104137 150 0 0 88 47 74 0.021 0.23 5.07 Intr - 116200 116120 81 1 0 92 80 83 0.091 6.82 5.06 Intr - 123931 123614 318 1 0 49 47 167 0.018 3.93 5.05 Intr - 129528 129439 90 0 0 64 78 51 0.028 0.97 5.04 Intr - 131889 131788 102 0 0 65 100 69 0.386 5.25 5.03 Intr - 140804 140729 76 1 1 71 58 87 0.029 2.60 5.02 Intr - 141976 141829 148 2 1 88 24 65 0.546 -1.53 5.01 Init - 153092 152978 115 2 1 44 53 188 0.979 11.32 5.00 Prom - 155867 155828 40 -7.35 6.03 PlyA - 156430 156425 6 1.05 6.02 Term - 158037 157813 225 0 0 87 44 109 0.225 2.20 6.01 Init - 163484 163404 81 2 0 88 37 100 0.325 5.92 6.00 Prom - 168979 168940 40 -3.05 7.07 PlyA - 169269 169264 6 1.05 7.06 Term - 177058 176233 826 0 1 -5 47 384 0.209 16.55 7.05 Intr - 187416 187174 243 2 0 52 52 206 0.385 9.09 7.04 Intr - 188009 187838 172 0 1 65 84 154 0.968 10.78 7.03 Intr - 189229 189026 204 2 0 16 115 112 0.792 4.95 7.02 Intr - 190134 190008 127 0 1 50 106 77 0.807 5.03 7.01 Intr - 192671 192564 108 2 0 73 111 81 0.922 8.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 8339 8481 143 1 2 73 98 118 0.900 10.95 S.002 Intr + 8846 9319 474 2 0 -1 36 287 0.801 5.85 S.003 Init - 34499 34414 86 2 2 72 13 57 0.865 -3.06 S.004 Init + 82043 82101 59 1 2 52 92 70 0.887 4.63 S.005 Sngl + 184320 184649 330 2 0 60 44 179 0.940 6.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:58522753_58724519|GENSCAN_predicted_peptide_1|150_aa XREDGWLSTGCQEVYAIVRNPWSMNLNTVRRLLPTENQFRHYTLTISKESEEAMDQLALQ WEELIEARLTIGLVILLPENALRQLCQAESTHKVLRMELVPHGTDTTASDGLPTPMAERS PAVMVMELTEWTSIQFKEGASRKTGEAVLG >gi568815587f:58522753_58724519|GENSCAN_predicted_CDS_1|453_bp ngaagggaagatggatggctctcaacagggtgccaggaagtttatgcaattgtccgtaat ccatggtctatgaacctaaacacagttaggagacttctccccacggaaaatcaatttagg cactatacgctgactatatctaaggaatctgaagaggctatggatcagttggcattacag tgggaagagcttattgaagcaaggttgacaataggtcttgtcattctgctccctgaaaat gcccttcgacaactgtgtcaggcagaaagcacacacaaagtgctcaggatggaacttgta ccccatggcactgatacaacggccagtgatgggctgcccacacccatggcagagcgttcc ccggcggtgatggtaatggagctcacagaatggacgtccatccagttcaaagaaggagcc agtagaaaaactggtgaagcagtcctggggtag >gi568815587f:58522753_58724519|GENSCAN_predicted_peptide_2|463_aa MIVLGNYGFTVCSGKIDGEFGANRGINSWQILLDWIVTAVVVLLWAFCPLEVTGDQGLLM ATLQQFLQIVISPVTTLETWCGSFFLLAEKWHRGQVGSNALLEELERSTLQDSDEYSNPA PLPLDQHSRKETNLDETSEILSIQDNTSPLPAQLVYTTNIQELNVYSEAQEPKESPPPSK TSAAAQLDELMAHLTEMQAKVAVRADAGKKHLPDKQDHKASLDSMLGGLEQELQDLGIAT VPKGHCASCQKPIAGKVIHALGQSWHPEHFVCTHCKEEIGSSPFFERSGLAYCPNDYHQL FSPRCAYCAAPILDKVLTAMNQTWHPEHFFCSHCGEVFGAEGVKLQTFAVSVTAHKGSAD PKSEQQQDLLQRVKEQNFYSVEGDPTAVSVANPLTAWGWRCWPATLSAGPADPVLPETRA GPRAPHAAPVPARASPSTPPRRQRELAPASASPEWGSHSAAAG >gi568815587f:58522753_58724519|GENSCAN_predicted_CDS_2|1392_bp atgattgttttggggaactatgggttcactgtttgcagtggcaaaatagatggagagttt ggtgctaaccgcggaataaactcttggcaaatacttttggactggattgttactgcagtt gttgtcttgttatgggcgttttgccctttggaggtcactggagatcaagggctgttgatg gctacacttcaacagtttcttcaaattgtcatcagcccagtaactacccttgagacttgg tgtggatctttcttcctgctggcagagaaatggcatcgaggacaggttggctcaaatgcc ttattggaggaactggaacgctccacccttcaggacagtgatgaatattccaacccagct cctcttcccctggatcagcattccagaaaggagactaaccttgatgagacttcggagatc ctttctattcaggataacacaagtcccttgccggcgcagctcgtgtatactaccaatatc caggagctcaatgtctacagtgaagcccaagagccaaaggaatcaccaccaccttctaaa acgtcagcagctgctcagttggatgagctcatggctcacctgactgagatgcaggccaag gttgcagtgagagcagatgctggcaagaagcacttaccagacaagcaggatcacaaggcc tccctggactcaatgcttgggggtctggagcaggaattgcaggaccttggcattgccaca gtgcccaagggccattgtgcatcctgccagaaaccgattgctgggaaggtgatccatgct ctagggcaatcatggcatcctgagcattttgtctgtactcattgcaaagaagagattggc tccagtcccttctttgagcggagtggcttggcctactgccccaacgactaccaccaactt ttttctccacgctgtgcttactgcgctgctcccatcctggataaagtgctgacagcaatg aaccagacctggcacccagagcacttcttctgctctcactgcggagaggtgtttggtgca gaaggagtgaagctgcagaccttcgcggtgagtgttacagctcataaaggcagtgcggac ccaaagagtgagcagcagcaagatttattgcaaagagtgaaagaacaaaacttctacagt gtggaaggggacccaactgctgtctcagttgctaatcctctcactgcctggggctggcgg tgctggccggccactctgagtgcggggcctgctgatcccgtgctacccgaaactcgtgct ggcccacgagcaccgcatgcagccccagttcctgcccgtgcctctccgtccacacctccc cgcaggcagagggagctggctccagcctcagccagcccagagtggggctcccacagtgca gcggcaggctga >gi568815587f:58522753_58724519|GENSCAN_predicted_peptide_3|584_aa MPGETEEPRPPEQQDQEGGEAAKAAPEEPQQRPPEAVAAAPAGTTSSRVLRGGRDRGRAA AAAAAAAVSRRRKAEYPRRRRSSPSARPPDVPGQQPQAAKSPSPVQGKKSPRLLCIEKVT TDKDPKEEKEEEDDSALPQEVSIAASRPSRGWRSSRTSVSRHRDTENTRSSRSKTGSLQL ICKSEPNTDQLDYDVGEEHQSPGGISSEEEEEEEEEMLISEEEIPFKDDPRDETYKPHLE RHVSDFTADMLMKYKETPKPRRKSGKVKEEKEKKEIKVEVEVEVKEEENEIREDEEPPRK RGRRRKDDKSPRLPKRRKKPPIQYVRCEMEGCGTVLAHPRYLQHHIKYQHLLKKKYVCPH PSCGRLFRLQKQLLRHAKHHTDQRDYICEYCARAFKSSHNLAVHRMIHTGEKPLQCEICG FTCRQKASLNWHMKKHDADSFYQFSCNICGKKFEKKDSVVAHKAKSHPEVLIAEALAANA GALITSTDILGTNPESLTQPSDGQGLPLLPEPLGNSTSGECLLLEAEGMSKSYCSGTERV SLMADGKIFVGSGSSGGTEGLVMNSDILGATTEVLIEDSDSAGP >gi568815587f:58522753_58724519|GENSCAN_predicted_CDS_3|1755_bp atgccgggggagacggaagagccgagacccccggagcagcaggaccaggaagggggagag gcggccaaggcggctccggaggagccccaacaacggccccctgaggcggtcgcggcggcg cctgcagggaccactagcagccgcgtgctgaggggaggtcgggaccgaggccgggccgct gcggccgccgccgccgcagctgtgtcccgccggaggaaggccgagtatccccgccggcgg aggagcagccccagcgccaggcctcccgacgtccccgggcagcagccccaggccgcgaag tccccgtctccagttcagggcaagaagagtccgcgactcctatgcatagaaaaagtaaca actgataaagatcccaaggaagaaaaagaggaagaagacgattctgccctccctcaggaa gtttccattgctgcatctagacctagccggggctggcgtagtagtaggacatctgtttct cgccatcgtgatacagagaacacccgaagctctcggtccaagaccggttcattgcagctc atttgcaagtcagaaccaaatacagaccaacttgattatgatgttggagaagagcatcag tctccaggtggcattagtagtgaagaggaagaggaggaggaagaagagatgttaatcagt gaagaggagataccattcaaagatgatccaagagatgagacctacaaaccccacttagaa aggcatgtctcagattttactgcagacatgttaatgaaatataaggaaaccccaaagcca cggagaaaatcagggaaggtaaaagaagagaaggagaagaaggaaattaaagtggaagta gaggtggaggtgaaagaagaggagaatgaaattagagaggatgaggaacctccaaggaag agaggaagaagacgaaaagatgacaaaagtccacgtttacccaaaaggagaaaaaagcct ccaatccagtatgtccgttgtgagatggaaggatgtggaactgtccttgcccatcctcgc tatttgcagcaccacattaaataccagcatttgctgaagaagaaatatgtatgtccccat ccctcctgtggacgactcttcaggcttcagaagcaacttctgcgacatgccaaacatcat acagatcaaagggattatatctgtgaatattgtgctcgggccttcaagagttcccacaat ctggcagtgcaccggatgattcacactggcgagaagccattacaatgtgagatctgtgga tttacttgtcgacaaaaggcatctcttaattggcacatgaagaaacatgatgcagactcc ttctaccagttttcttgcaatatctgtggcaaaaaatttgagaagaaggacagcgtagtg gcacacaaggcaaaaagccaccctgaggtgctgattgcagaagctctggctgccaatgca ggcgccctcatcaccagcacagatatcttgggcactaacccagagtccctgacgcagcct tcagatggtcagggtcttcctcttcttcctgagcccttgggaaactcaacctctggagag tgcctactgttagaagctgaagggatgtcaaagtcatactgcagtgggacggaacgggtg agcctgatggctgatgggaagatctttgtgggaagcggcagcagtggaggcactgaaggg ctggttatgaactcagatatactcggtgctaccacagaggttctgattgaagattcagac tctgccggaccttag >gi568815587f:58522753_58724519|GENSCAN_predicted_peptide_4|200_aa MAFTEHSPLTPHRRDLCSRSIWLARKIRSDLTALTESYVKHQGLNKNINLDSADGMPVAS TDQWSELTEAERLQENLQAYRTFHVLLARLLEDQQVHFTPTEGDFHQAIHTLLLQVAAFA YQIEELMILLEYKIPRNEADGMPINVGDGGLFEKKLWGLKVLQELSQWTVRSIHDLRFIS SHQTGIPARGSHYIANNKKM >gi568815587f:58522753_58724519|GENSCAN_predicted_CDS_4|603_bp atggctttcacagagcattcaccgctgacccctcaccgtcgggacctctgtagccgctct atctggctagcaaggaagattcgttcagacctgactgctcttacggaatcctatgtgaag catcagggcctgaacaagaacatcaacctggactctgcggatgggatgccagtggcaagc actgatcagtggagtgagctgaccgaggcagagcgactccaagagaaccttcaagcttat cgtaccttccatgttttgttggccaggctcttagaagaccagcaggtgcattttacccca accgaaggtgacttccatcaagctatacatacccttcttctccaagtcgctgcctttgca taccagatagaggagttaatgatactcctggaatacaagatcccccgcaatgaggctgat gggatgcctattaatgttggagatggtggtctctttgagaagaagctgtggggcctaaag gtgctgcaggagctttcacagtggacagtaaggtccatccatgaccttcgtttcatttct tctcatcagactgggatcccagcacgtgggagccattatattgctaacaacaagaaaatg tag >gi568815587f:58522753_58724519|GENSCAN_predicted_peptide_5|359_aa MALEKALDNVQDKVCLEIPGQSLALDFADSAPPAEGSRGKMMTLFPISLGVYTLPMVLFL ISREDDDIIPNITGDVHPSCDILLNIQGEEEDDITHNIAGSVRPFCDLVPNTLKTTDDFD HDTNTYLGKNPEDLQGPSTRSSIFRFKETGLCLNPRESLEMPKGKHISEDSQRQMGEEAP SLVEGLQGITPLGQKNLKSILEPHSIPLTQSTKIRRNQKNNSGNMKKQSTLTPPKYNTST PAMDPNQEEISELPEKEIRTLIIKLIKEAPEKEEISKQQSIQEHVRTLSNHLENHPDPGG QVPGRDACAQWLVLVPFVDPLPLQIRLLYIHIPLHATHESLSQAPLSSETSADVCCCTC >gi568815587f:58522753_58724519|GENSCAN_predicted_CDS_5|1080_bp atggctttggagaaagcccttgacaacgttcaggacaaagtgtgcctggagatcccaggg cagtcccttgctctggactttgcggactccgctccgccagcagagggcagcagagggaag atgatgacattattcccaatatcactgggggtgtacaccctccccatggtattgttccta atttccagggaagatgatgacattattcccaatatcactggggatgtacacccttcctgt gatattcttcttaatatccagggagaagaagaagatgatattactcacaatatcgcaggg agtgtacgccctttctgtgatttagttcctaatacgctgaagacgacagatgactttgat cacgacaccaacacttacctgggtaagaacccagaggatttacagggcccatcaaccaga agcagcattttcagattcaaggaaacaggtctctgcctcaacccaagagaatcattggaa atgcctaaaggaaaacatatctcagaggacagccagagacaaatgggggaggaagcccca tccctagtggaagggcttcaagggatcacccccttgggacaaaagaatctgaagagtatc cttgagccccatagcattcctctgacacagtctaccaaaataagaaggaatcagaaaaac aattctggtaatatgaaaaagcaaagtactttaacacccccaaaatataacactagcaca ccagcaatggatccaaaccaagaagaaatctctgaattgccagaaaaagaaatcagaaca ttgattattaagctaatcaaggaggcaccagagaaagaagaaatttctaagcagcaaagc attcaagagcatgtgcggaccctcagcaaccacctggagaaccacccagacccaggaggc caagtacctggcagagatgcttgtgctcagtggttagtacttgtgccttttgtggaccct ctgccacttcaaatccgcctcttatacatacatattcctttacatgcaactcatgagagc ttaagccaagctccactgagttctgaaacttctgcagatgtctgctgctgtacctgctga >gi568815587f:58522753_58724519|GENSCAN_predicted_peptide_6|101_aa MPREVREDSNKISDPIDLEAAHTQEMKSLEAAKAKTAKQQRWQPAPLSGGSVSGSYRALA GPKTLVAQKSMVGRSCSVRRNGTRKLCKTVWLFFHRAAALC >gi568815587f:58522753_58724519|GENSCAN_predicted_CDS_6|306_bp atgcccagggaagtacgagaagactctaacaagatctccgatccaattgaccttgaggct gcacacacacaggaaatgaagagcctggaggcagcaaaggcaaagactgcaaaacagcaa agatggcagcctgcccctctctctgggggctctgtctcaggcagctatagagcccttgct ggcccaaaaaccctggtggcccaaaagtccatggttgggaggtcctgctcagtaaggaga aatgggaccaggaaactgtgtaaaacagtctggctgtttttccacagggcagctgcactg tgctag >gi568815587f:58522753_58724519|GENSCAN_predicted_peptide_7|559_aa VYGTVFHINHGNPFNLKAVVDKWPDFNTVVVCPQEQDMTDDLDHYTNTYQIYSKDPQNCQ EFLGSPELINWKQHLQIQTGDCGDSMMATCVASRLAQAYLNGIGLNIIGLNQEQKQRESP NGAGTFKPLLALNLLFPLMKDGQCGQGSQPSLNEAIQNLAAIKSFKVKQTQRILYMAAET AKELTPFLLKSKILSPNGGKPKAINQEMFKLSSMDVTHAHLVNKFWHFGGNERSQRFIER CIQTFPTCCLLGPEGTPVCWDLMDQTGEMRMAGTLPEYRLHGLVTPNRHLQNSPPQISRI YIFSTPHHTYSKIDHIIGSKARLSKCKRTEIITNYLSDHSAIKLELRIKKLTQNCSTTQK LNNLLLNDYRVHNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQER SKIDTLTSQLKELEKQEQTHSKASRRQGITKIRAELKEIETQKTLQKINESRSWFFERFN KIDRPLARLIKKQGEKNQRDAIKNDKGGITTNPTEIQTTIREYYKHLYANKLENLEEMDK FLDTYTLPRLNQEEVDSLN >gi568815587f:58522753_58724519|GENSCAN_predicted_CDS_7|1680_bp gtttatggaactgtctttcacataaaccatggaaatccattcaatctgaaggctgtggtg gacaagtggcctgattttaatacagtggttgtctgccctcaggagcaggatatgacagat gaccttgatcactataccaatacttaccaaatctactccaaagatccccaaaactgtcag gaattccttggatcaccagaactcatcaactggaaacagcatttacagattcaaactggt gactgtggtgacagcatgatggcaacatgtgtcgctagcaggctggcccaggcctatctt aatggtattggtcttaacataattggtcttaaccaggagcaaaaacaaagggaaagcccc aatggggcaggcacttttaagcctctccttgcattaaatttgctctttccattgatgaaa gatggtcaatgtggccaaggttcacagcctagcctgaatgaggctatacaaaatcttgca gccattaagtccttcaaagtcaaacaaacacaacgcattctctatatggcagctgaaaca gccaaggaactgactcctttcctgctgaaatcaaagattttatctcccaatggtggcaaa cccaaggccatcaaccaagagatgtttaaactctcatccatggatgttacccatgctcac ttggtgaataaattctggcattttggtggtaatgagaggagccagagattcattgagcgc tgcattcagacctttcccacctgctgtctcctggggcctgaggggacccctgtgtgctgg gatctaatggaccagactggagagatgagaatggcaggcaccttgccggaataccggctc catggccttgtgacacctaatagacatctacagaactctccacctcagatcagcagaata tacattttttcaacaccacaccacacctattccaaaattgaccacataattggaagtaaa gctcgcctcagcaaatgtaaaagaacagaaattataacaaactatctctcagaccacagt gcaatcaaactagaactcaggattaagaaactcactcaaaactgctcaactacacagaaa ctgaacaacctgctcctgaatgactaccgggtacataacgaaatgaaggcagaaataaag atgttctttgaaaccaatgagaacaaagacacaacataccagaatctctgggacacattc aaagcagtgtgtagagggaaatttatagcactaaatgcccacaagagaaagcaggaaaga tccaaaattgacaccctaacatcacaattaaaagaactagaaaagcaagagcaaacacat tcaaaagctagcagaaggcaaggaataactaaaatcagagcagaactgaaggagatagag acacaaaaaactcttcaaaaaattaatgaatccaggagctggttttttgaaaggttcaac aaaattgatagaccgctagcaagactaataaagaaacaaggagagaagaatcaaagagac gcaataaaaaatgataaagggggtatcaccaccaatcccacagaaatacaaactaccatc agagaatactacaaacacctctacgcaaataaactagaaaatctagaagaaatggataaa ttcctcgacacgtacaccctcccaagactaaaccaggaagaagttgattctctgaattga