GENSCAN 1.0 Date run: 6-Nov-116 Time: 11:57:12 Sequence gi568815594r:73998601_74199120 : 200520 bp : 36.90% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 595 762 168 1 0 121 111 26 0.561 6.34 1.02 Intr + 14964 15041 78 0 0 63 121 63 0.291 4.85 1.03 Intr + 16610 16731 122 2 2 53 53 63 0.504 -1.48 1.04 Intr + 16922 17005 84 0 0 55 94 72 0.498 3.37 1.05 Intr + 20447 20566 120 0 0 78 82 28 0.232 0.75 1.06 Term + 23675 23832 158 0 2 59 35 99 0.259 -1.19 1.07 PlyA + 24404 24409 6 1.05 2.05 PlyA - 25245 25240 6 1.05 2.04 Term - 34456 34277 180 1 0 154 49 27 0.564 2.03 2.03 Intr - 39576 39493 84 0 0 126 89 99 0.910 12.90 2.02 Intr - 39813 39731 83 1 2 96 73 92 0.782 6.94 2.01 Init - 40011 39912 100 0 1 80 94 205 0.999 18.77 2.00 Prom - 40124 40085 40 -4.85 3.05 PlyA - 40319 40314 6 -5.80 3.04 Term - 40880 40611 270 2 0 -8 54 276 0.822 9.10 3.03 Intr - 44617 44551 67 0 1 80 91 32 0.009 0.69 3.02 Intr - 55810 55675 136 2 1 56 75 107 0.004 4.91 3.01 Init - 60678 60618 61 0 1 87 99 46 0.332 7.10 3.00 Prom - 71107 71068 40 -4.95 4.06 PlyA - 71730 71725 6 1.05 4.05 Term - 75139 75069 71 2 2 75 39 63 0.232 -2.78 4.04 Intr - 77859 77775 85 0 1 108 121 31 0.367 6.87 4.03 Intr - 87955 87782 174 1 0 58 49 157 0.019 8.01 4.02 Intr - 91197 91028 170 1 2 93 45 57 0.355 0.64 4.01 Init - 96014 95816 199 2 1 96 22 155 0.325 8.81 4.00 Prom - 96335 96296 40 -7.15 5.08 PlyA - 96739 96734 6 1.05 5.07 Term - 97227 97089 139 2 1 66 33 95 0.174 -1.75 5.06 Intr - 100084 100001 84 0 0 134 106 87 0.989 13.12 5.05 Intr - 100322 100199 124 0 1 107 103 96 0.985 11.72 5.04 Intr - 100977 100421 557 2 2 49 94 339 0.029 22.16 5.03 Intr - 113000 112804 197 2 2 84 97 116 0.917 9.39 5.02 Intr - 113934 113846 89 1 2 47 64 84 0.041 0.67 5.01 Init - 114993 114972 22 0 1 90 81 14 0.038 0.81 5.00 Prom - 119408 119369 40 -4.65 6.00 Prom + 129010 129049 40 -5.05 6.01 Init + 131391 131554 164 2 2 74 72 44 0.235 0.71 6.02 Intr + 135255 135554 300 0 0 59 69 285 0.562 18.62 6.03 Term + 139960 140104 145 1 1 21 53 174 0.066 3.60 6.04 PlyA + 143302 143307 6 1.05 7.02 PlyA - 145540 145535 6 1.05 7.01 Sngl - 149064 148828 237 0 0 60 43 165 0.477 4.04 7.00 Prom - 157709 157670 40 -1.55 8.03 PlyA - 157936 157931 6 1.05 8.02 Term - 159961 159010 952 0 1 52 47 621 0.948 44.95 8.01 Init - 168788 168637 152 2 2 36 43 154 0.395 3.82 8.00 Prom - 171954 171915 40 -7.55 9.00 Prom + 172844 172883 40 -7.65 9.01 Init + 174885 174949 65 2 2 81 80 36 0.932 2.78 9.02 Intr + 175906 176090 185 1 2 83 80 52 0.424 2.41 9.03 Intr + 176681 176803 123 0 0 124 88 128 0.859 16.04 9.04 Intr + 194042 194073 32 0 2 86 91 57 0.088 2.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 56130 56269 140 1 2 62 49 173 0.915 7.94 S.002 Init - 100988 100421 568 2 1 43 94 358 0.834 27.17 S.003 Init - 114993 114908 86 0 2 90 81 89 0.877 8.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:73998601_74199120|GENSCAN_predicted_peptide_1|243_aa XQVILGFLNDFMNLWTSTSFGSQDNMITRDPCLLPLEVSGLKGPLNFVNSPTDHLLWLFQ ACVVFILPVSAYNVTFSVTIVLRPIRGVNPLSCKSGTGLHPTRALADCLIVLWFTPQLTS ITGMSDLKIHLLLVNASNIEFYGKPLILNDREKYSLTFLSESNTTMEVGYEIDLESYEQN FFKREIEWNRKFEKGVMFSLYAWRYPYPVLVLSTLSIPVEAFANFLVRLGIFPRTSMDST ADT >gi568815594r:73998601_74199120|GENSCAN_predicted_CDS_1|732_bp nngcaggtcattctaggtttcttgaatgactttatgaatctctggacttcaaccagcttt ggaagccaagataacatgataacaagagacccttgtctcttgccacttgaagtcagtggc ctgaaaggaccactgaattttgttaactcccctacagatcacctactttggctctttcag gcttgtgtcgtttttatccttccagtgtcagcttacaatgtcaccttctcagtgaccatt gttctaaggcctatccggggagtgaatcctctcagctgcaagtctgggacgggacttcat cctactagagccctagctgattgcttgatagttctttggtttacaccacagttgacatcc ataacaggaatgtctgacttaaagattcatttgttactggtcaatgcatctaacattgag ttttacggaaaacccctgatacttaatgacagagagaagtattctctgacttttctgagt gagtcaaataccactatggaagtaggttatgaaatagatttagagagttatgagcagaac ttttttaaaagggaaatagaatggaatagaaaatttgaaaaaggagttatgttctcactc tatgcgtggcggtatccatatcctgtcttggtactttctacgctttccattcctgttgaa gctttcgctaacttccttgtgaggctgggaattttccccaggacttctatggattcaaca gcagatacataa >gi568815594r:73998601_74199120|GENSCAN_predicted_peptide_2|148_aa MAHATLSAAPSNPRLLRVALLLLLLVAASRRAAGASVVTELRCQCLQTLQGIHLKNIQSV NSHTQEWEESLSQPRIPHGSENHRKDTEQDELHLPESLAASRTSMDFPIFMPLITWLSPT GAFSIPQCCPIYLENSPSFKTHISNDFL >gi568815594r:73998601_74199120|GENSCAN_predicted_CDS_2|447_bp atggcccacgccacgctctccgccgcccccagcaatccccggctcctgcgggtggcgctg ctgctcctgctcctggtggccgccagccggcgcgcagcaggagcgtccgtggtcactgaa ctgcgctgccagtgcttgcagacactgcagggaattcacctcaagaacatccaaagtgtg aatagccacactcaagaatgggaagaaagcttgtctcaaccccgcatcccccatggttca gaaaatcatcgaaaagatactgaacaagatgaactccatctacctgagtcacttgctgct tctaggacttccatggactttccaatcttcatgcctttgattacttggttgtctcctact ggagcgttttccattcctcagtgttgtcccatctacttggaaaactctccatcctttaaa actcacataagtaacgatttcttgtag >gi568815594r:73998601_74199120|GENSCAN_predicted_peptide_3|177_aa MGIQNSPALLLMAVIVFGTFAVSVDSDLYTELRCVYVKSTFVLHPRNIHNLELVSAGPHC SKDEVMMEQCLSLGSSKMQNLSHEPAMQREEGRYAGYKRRGHVIQPWLPRTLTLNSNFDT DNLLPPNGKRKQGILSVIREYAKQGTSRTFFSGIRDDGCTFTESMMLDVHEITLNRK >gi568815594r:73998601_74199120|GENSCAN_predicted_CDS_3|534_bp atgggtattcagaactcaccagcactcctcctgatggctgtcattgtgtttggcacattt gctgtaagtgtagacagtgacttgtacactgaactgcgctgcgtgtatgtgaagtcaacc tttgtacttcatcccagaaacatccacaatttggagttggtctcagcaggaccccattgc agcaaagacgaagtaatgatggagcagtgcctaagtttaggctcctccaaaatgcagaat ttgagtcatgagcctgccatgcagagggaggaaggacgttatgcaggatacaaaagaaga ggtcatgttatacagccctggcttccacggacactaacactgaattcaaattttgacact gataatctgttgccaccaaatggaaaacgtaaacaaggtattctaagtgtgattagagaa tatgcaaaacaaggaacaagtagaacattcttctctggaatccgagacgatggctgtact ttcacagagagcatgatgttagatgtacatgaaataacgctaaaccgaaaatga >gi568815594r:73998601_74199120|GENSCAN_predicted_peptide_4|232_aa MVLVIGPQANSMDIVWVSILDQVDVKIVLFTPNCQEVASTTQSPIERVMAGSGTQAYLRW WSQGEGIYLGSVITRAEGTNMPTSPTGCFNAAALAPASAKEFPLATLESPFGCPLQWIIA TSLDLETLGFDKTQAETTVSALTTLSNVSPDTTYKETVTQAQQKITVQQLMAHLDSIRKN MGLEKLRKGSRCNEIIFNIKQNIETVKLKVPQNKTVCGNEVFTEVTRLSGVH >gi568815594r:73998601_74199120|GENSCAN_predicted_CDS_4|699_bp atggtccttgtcattggtcctcaggccaactccatggacattgtctgggtctccatctta gatcaagtagatgtgaaaattgtgcttttcacaccaaattgccaagaagttgcaagtacc acgcagtctcctatagaacgtgttatggcaggaagtgggacacaagcctatttgaggtgg tggtcacagggtgaaggaatatatcttggttctgtcattaccagggcagagggcactaac atgcctacaagtccaactggctgctttaatgcagctgccttagctcctgcatctgcaaag gagtttcccttagccacactagagtctccttttggatgccctctgcaatggattatagct acttccttggacttggaaactcttggatttgacaaaacacaagcagaaacaacagtatca gcattaactactttatcaaatgtcagcccggatactacctataaagagacggtcactcaa gctcaacagaaaataacagtacaacagctaatggctcatttggactctatcaggaaaaac atgggactggaaaaactcagaaagggttccagatgtaatgagattatatttaacattaag caaaatatagaaactgtgaaattaaaagtacctcagaataagactgtatgtggaaatgaa gtctttacagaggtaaccaggttaagtggagttcattag >gi568815594r:73998601_74199120|GENSCAN_predicted_peptide_5|403_aa MTSATASGSFCIGSRSLDEGNAVAPENSEMPATVEPQSPIEHINCQFIRNTTNNNNKKPL QSGNVKFIRITQQEEQCSDSILVSQKEDVQRIGLKMFKTDLTREVTPPGKDVARSLRGSK GSDPRRTALGSRSASSQAVISVSLRAAGSRSRSRDSGQKENIPQLAGVTQDSQTRTSLVS APTPLHPRGGAIAFLPNSGSIWSSGNFPGPGLRAFQPQPCIKGVRRSRRATEPGPQAAPC QLSSSHSRSNRLLSPMARATLSAAPSNPRLLRVALLLLLLVAASRRAAGAPLATELRCQC LQTLQGIHLKNIQSVKVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIEKMLKNGI LMGPHIAFALWGTLYSVLPQDDSSTPGFQDRNTIVMSKEERPQ >gi568815594r:73998601_74199120|GENSCAN_predicted_CDS_5|1212_bp atgacttcagcaactgcttctggcagcttttgcattgggagccggagtctagatgagggg aatgcagtggcacctgaaaactcagagatgccagcaactgtggagccccagtctccaata gaacatatcaactgccaattcatcagaaatactaccaacaacaacaacaaaaaacccctt caatcaggaaacgttaaatttattagaattactcaacaggaggaacaatgctctgacagc attttagtgtctcagaaggaggatgttcagagaattggactcaaaatgtttaaaactgat ctgacaagagaagtaactccccccggtaaggatgtagcgcggtccctacgtgggtctaag ggatctgacccacgacgcactgcactgggttcacgaagcgcctcctcgcaggcggttatc tcggtatctctgagagcggcgggctctcgctcccgctccagggattcggggcagaaagag aacatcccacagttggcgggagttacgcaagacagtcagacccggacgtcactcgtgagt gccccgacccccctccaccccagaggcggggccatcgccttccttccgaactcgggatcg atctggagctccgggaatttccctggcccgggactccgggctttccagccccaaccatgc ataaaaggggttcgccgttctcggagagccacagagcccgggccacaggcagctccttgc cagctctcctcctcgcacagccgctcgaaccgcctgctgagccccatggcccgcgccacg ctctccgccgcccccagcaatccccggctcctgcgggtggcgctgctgctcctgctcctg gtggccgccagccggcgcgcagcaggagcgcccctggccactgaactgcgctgccagtgc ttgcagaccctgcagggaattcacctcaagaacatccaaagtgtgaaggtgaagtccccc ggaccccactgcgcccaaaccgaagtcatagccacactcaagaatgggcagaaagcttgt ctcaaccccgcatcgcccatggttaagaaaatcatcgaaaagatgctgaaaaatggaatt ttgatgggacctcatatagcgtttgccctctggggaactttgtattcagtgctgccccag gatgacagcagcacaccaggttttcaggacagaaacacaattgtcatgtcaaaggaagaa agacctcaataa >gi568815594r:73998601_74199120|GENSCAN_predicted_peptide_6|202_aa MDKFLDTYTLPRLNQEEVESLNRPITSSEIEAVINSLPTKKSPGPDKFTAEFYQRNQKEI HSSKIFTSSNPEILYEDETVPVGTEKWKHSEQMVGEPDSTSMRPLPCITPGISSMKNLPS THGFCTEKREIEIVNQLSHLLRFPARRPAFALNHRNNGEPLARSRAAMGTSLDQERSRHP AGSGGVEVNGGSGRAANSSGGW >gi568815594r:73998601_74199120|GENSCAN_predicted_CDS_6|609_bp atggacaaattcctggacacatacaccctcccaagactaaaccaggaagaagtcgaatcc ctgaatagaccaataacaagttctgaaattgaggcagtaattaatagcctaccaaccaag aaaagcccaggaccagacaaattcacagccgaattctaccagagaaaccaaaaagaaata cacagcagcaagatcttcaccagcagcaacccagagatcctgtatgaggatgagacagtt cctgtaggtacagagaagtggaaacattctgagcagatggtaggagaaccagattctaca tccatgaggcctcttccctgcattacaccaggcatcagcagtatgaaaaatctgccctca actcatggtttctgcactgaaaaacgtgaaattgagattgtcaaccagctttcccatctt cttcggtttcctgccaggagacctgcctttgccttaaaccacaggaacaatggcgagcct ttagcccgatccagagcggcaatgggaacctcgctggatcaggagcgcagcagacaccct gctggatccggaggagtggaagtcaatggagggtctgggagggcagcaaacagcagtggt ggatggtga >gi568815594r:73998601_74199120|GENSCAN_predicted_peptide_7|78_aa MWRNWKDEIEVDNQIFLHLGFPGRTHVPDLTCRKHYECLNGEISLRTAKDKRGEVETLLC NSPKGDAKLEWLFSTTKL >gi568815594r:73998601_74199120|GENSCAN_predicted_CDS_7|237_bp atgtggagaaattggaaagatgagattgaggtagacaaccagatttttctccatcttgga ttccctggcaggacacatgtccctgacttaacctgcaggaagcattatgaatgcctgaat ggagaaatatccctgaggacagccaaagacaaaaggggagaagtagaaactctgctctgt aactcgcccaaaggagatgccaaattagagtggctgttcagcaccaccaagctgtag >gi568815594r:73998601_74199120|GENSCAN_predicted_peptide_8|367_aa MVHKPVAFHCCLSLEEVGAYSSLSKLASAGKALHQSVDLEILDGPSGMAHGASNVFLNQV HLLQRWNILVREPRAEHVFVPQAEATATRPRDRQGHLTDKAVLDPARRRAAAHEPWAPRA RAPPAPPPTSHRQRLRTRRPQPSYLTPLLRKPRNALPGSPGALTEGAVLLPNAGARPRRP RSSEKPRTGTVMAADPRLPDWGSTSRDSSPRGWRPGSALTRAGRAASSASQPGSGRHAEW GKERFPGPGTELTECRVRRGRGQDPSPGPSEDVEACSVAWRLGPAPSQHIAGIPIATAQR RPPNYHFRQHAPLSTKPEILPNSLRDYVELDLKQERSNRFIVSHFLCEIPWKLEKRTKVC GEGAKPG >gi568815594r:73998601_74199120|GENSCAN_predicted_CDS_8|1104_bp atggttcacaagcctgtggcctttcattgctgtctatcgcttgaagaagtaggcgcctat tccagtctttctaaactggcttcagcagggaaagcccttcaccagtcagtggatctagag attcttgatgggccatccggcatggcccatggtgcctcaaatgtcttcctaaatcaagtg cacctgctgcaaagatggaacattctggtgcgagagcccagagcagagcacgttttcgtt ccccaagctgaggcaactgcgactcgcccgcgcgaccggcaaggccacctcacagacaag gctgtcctcgacccagcccgccgccgcgcggccgcccacgagccttgggccccacgcgcg cgagcgccgcccgcgcccccgccgacctcccaccgccagcggctccgcacccggcgcccg cagccctcgtacctcacaccgctgctccgaaagccccggaacgcactcccgggctctccc ggtgcccttacggagggtgctgtgcttctgcccaacgccggcgctcggccaaggcggccg cggagcagcgagaagccgcggaccggcaccgtcatggccgcggatccccggcttccggac tggggctccacctcccgcgactccagtcctcgcggctggaggcctggctccgccctcacc cgggctggcagggcagcaagtagcgcttcccagccaggttccggcagacacgcagagtgg ggtaaagagcggttcccaggaccggggacggaactgacagagtgccgggtccgcaggggg cggggccaggacccaagtcccgggccctctgaggatgtggaagcgtgcagtgtggcctgg cgactgggccccgcacccagccaacatatagctggcattcccatagcgacggcacagaga aggcctccaaactatcactttaggcagcacgccccactttccaccaagccagagatcctg cccaacagtctccgtgactacgtggaattggatttaaagcaggaaagaagtaataggttc attgtttcccacttcctttgtgagattccttggaagttggagaaaagaacaaaggtgtgt ggagaaggagcaaaacctggctga >gi568815594r:73998601_74199120|GENSCAN_predicted_peptide_9|135_aa MARLTAVVRGHAWGHGITFFFEHEAIIISGTEMAKHIQKEIQRGVESWVSLGNRRPHLSI ILVGDNPASHTYVRNKIRAASAVGICSELILKPKDVSQEELLDVTDQLNMDPRVSGILVQ LPLPDSRYLFHPHTQ >gi568815594r:73998601_74199120|GENSCAN_predicted_CDS_9|405_bp atggcaagactcacagctgttgtccggggccatgcctggggccatggaataacatttttc tttgaacatgaagccattattatatcaggaaccgaaatggccaagcatatccagaaagaa atacagcgaggtgtggaatcatgggtttcccttggaaacagaagacctcacctcagtata attttagtgggagataacccagcaagccatacatatgtcaggaataagataagagctgcc tctgctgtaggtatttgtagtgagctcattctaaaacctaaggatgtttctcaggaagaa cttttggacgtaactgatcaattgaatatggacccaagagtcagcggtatattagttcag ttaccactaccagacagtcgctatctctttcatcctcacactcag