GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:02:48 Sequence gi568815575r:43849802_44058645 : 208844 bp : 39.62% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 2324 2363 40 -2.75 1.01 Init + 3112 3167 56 0 2 109 55 4 0.667 0.05 1.02 Intr + 4562 4706 145 2 1 91 67 71 0.922 4.66 1.03 Intr + 8220 8385 166 2 1 63 23 121 0.550 1.71 1.04 Term + 11315 11580 266 0 2 55 43 142 0.284 1.09 1.05 PlyA + 12594 12599 6 1.05 2.00 Prom + 14360 14399 40 -7.15 2.01 Init + 21188 21365 178 1 1 59 63 148 0.540 8.87 2.02 Intr + 27445 27628 184 2 1 65 -12 177 0.094 3.32 2.03 Intr + 28830 28885 56 0 2 68 100 21 0.085 -1.00 2.04 Intr + 32169 32480 312 1 0 45 -29 302 0.069 9.43 2.05 Intr + 33279 33439 161 1 2 21 -14 166 0.062 -1.51 2.06 Intr + 34012 34087 76 0 1 108 94 82 0.321 8.97 2.07 Term + 49565 49683 119 0 2 61 54 95 0.122 1.12 2.08 PlyA + 51192 51197 6 1.05 3.00 Prom + 52984 53023 40 -5.45 3.01 Init + 55263 55497 235 2 1 81 100 127 0.787 9.74 3.02 Term + 63329 63477 149 0 2 56 45 132 0.338 2.78 3.03 PlyA + 63496 63501 6 1.05 4.02 PlyA - 64439 64434 6 1.05 4.01 Sngl - 73175 72648 528 2 0 81 46 243 0.872 15.21 4.00 Prom - 73467 73428 40 -5.25 5.03 PlyA - 73847 73842 6 1.05 5.02 Term - 76546 76284 263 2 2 21 35 210 0.716 3.70 5.01 Init - 77118 76797 322 0 1 78 8 260 0.691 14.54 5.00 Prom - 85355 85316 40 -2.75 6.07 PlyA - 86960 86955 6 1.05 6.06 Term - 100225 99998 228 1 0 115 54 208 0.943 15.75 6.05 Intr - 104471 104231 241 1 1 63 94 142 0.769 8.93 6.04 Intr - 105218 105035 184 0 1 86 107 78 0.958 7.42 6.03 Intr - 105979 105826 154 1 1 46 27 65 0.689 -5.08 6.02 Intr - 108864 108671 194 1 2 90 116 123 0.486 13.59 6.01 Init - 114790 114778 13 1 1 63 97 20 0.184 0.75 6.00 Prom - 115024 114985 40 -8.85 7.00 Prom + 116369 116408 40 -9.75 7.01 Init + 116781 116909 129 2 0 84 45 45 0.452 -0.00 7.02 Intr + 119702 120049 348 1 0 14 77 264 0.333 12.53 7.03 Intr + 120453 120561 109 2 1 77 78 91 0.633 5.94 7.04 Intr + 121106 121232 127 0 1 64 38 71 0.555 -1.48 7.05 Intr + 121644 121723 80 0 2 75 54 80 0.143 1.58 7.06 Term + 125827 126128 302 2 2 46 52 153 0.031 1.90 7.07 PlyA + 126398 126403 6 1.05 8.00 Prom + 133582 133621 40 -4.05 8.01 Init + 138686 138810 125 1 2 52 76 121 0.926 6.99 8.02 Term + 140271 140370 100 0 1 53 52 125 0.961 2.02 8.03 PlyA + 140458 140463 6 1.05 9.04 PlyA - 140911 140906 6 1.05 9.03 Term - 143111 142974 138 2 0 64 52 129 0.450 3.88 9.02 Intr - 154541 154333 209 0 2 33 55 116 0.097 0.67 9.01 Init - 156165 156075 91 0 1 73 99 48 0.614 5.10 9.00 Prom - 160792 160753 40 -0.75 10.00 Prom + 167851 167890 40 -4.75 10.01 Sngl + 180251 181120 870 1 0 48 49 757 0.945 63.47 10.02 PlyA + 181200 181205 6 1.05 11.04 PlyA - 183205 183200 6 1.05 11.03 Term - 186253 185995 259 0 1 108 44 96 0.430 1.24 11.02 Intr - 189273 188991 283 1 1 73 75 118 0.461 4.55 11.01 Init - 196698 196545 154 0 1 72 48 98 0.487 4.39 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:43849802_44058645|GENSCAN_predicted_peptide_1|210_aa MKARKRLTTDIWKGSFLGRNVDEAGSHHSQQTNTGTEKQTPHVLIHKWELNNENTWTQGG KHHTPGPECGFSKDRHHFPEPSQIQVSCLHGSKDGGSSYSPTGLSVNPLRKFLPPVCEAY LQYNKLCESRDLACVISCFIPSTRENIWHVAILNKYLLLAGKNSDESINSPQKDEEQLIL QRTNASAAIRMHSSSKPDVSKTYFNVKVTV >gi568815575r:43849802_44058645|GENSCAN_predicted_CDS_1|633_bp atgaaggcaaggaagaggttaactacagatatttggaaagggagcttcctggggaggaac gtggatgaagctggaagccatcattctcagcaaactaacacaggaacagaaaaacaaaca ccacatgttcttattcataagtgggagttgaacaatgagaacacatggacacagggaggg aaacatcacacaccggggcctgaatgtggcttcagcaaagacagacaccattttcctgag ccctcgcagattcaagtcagctgcctgcatggctccaaagatggaggctcttcctattcc ccaacagggctttctgtcaacccactgaggaaattcctccctcctgtatgtgaagcatat ttacaatacaataagctctgtgaaagcagagatcttgcgtgtgttatttcctgctttatc cctagcacccgggagaatatctggcatgtagccatcctcaacaaatatttgttgctggca ggaaagaattcagacgagtcaattaattcacctcagaaggatgaagagcaactcattttg caaagaactaacgctagtgcggccatcagaatgcattcctcctcaaaacctgatgtatca aaaacgtattttaatgtcaaggtgaccgtttga >gi568815575r:43849802_44058645|GENSCAN_predicted_peptide_2|361_aa MGLHIEALSGFNMYEERKQDWAKEGVELRDASTTEKSISPTVNSQANVAFQSYPGLKQEG SLDLNTQSEEKKIVTIVNAKPSLKPALVTSEDTWGPRNCDSSGNWLYSHSPPTGPNTVSQ GTFKWVTGAQGSSPFFFTKASEGLAAFWGRSSKSGPRAATGFQPLASASGNAPKIAPKGG FSARRSGPTRALPVRGQSGDSRVSPCPTAAPRVSPRQPPVRAREEEGGRTAATNLRCRPP PRPPSQMPHLKYTLKQCSVSLVEAVEARPQETSGSREEVVDRHCGIQSPQKQQAKLPVPE AVGFLAVDTLVSPPSLQVNYVPGKSYQSEKWEGLSILLLQTKMANLNKRVRAATMCMCES A >gi568815575r:43849802_44058645|GENSCAN_predicted_CDS_2|1086_bp atgggtttgcatatagaagctttatcggggttcaacatgtatgaagaaaggaagcaggat tgggcaaaggaaggagttgaactgagagatgcaagcacaacagagaaatcaatcagtcct acagtgaactctcaggctaatgtggcctttcagagttatcctggattgaagcaagagggg agcctagacctcaatacccagtcagaggagaagaaaattgtgaccattgtcaatgccaag ccaagccttaaacctgccctggtgacttctgaggacacatgggggcctagaaattgtgac tctagtggcaactggctctactcacacagcccaccaaccggccccaacactgtgtcccag gggacattcaagtgggttacaggagcccagggaagttctccatttttcttcaccaaagca agcgaaggcctagcagccttctgggggagaagttcaaagtctggccccagagctgcaaca ggcttccagcctctggcctctgcgtccgggaacgcacccaaaattgcccccaaaggaggg ttcagtgcacggcgctctggacccactagagccctgcccgtgcgtggacagtctggggac tccagggtcagcccctgccccacggccgccccccgcgtctcccccaggcagccacctgtc cgagcgcgtgaagaggaaggagggcgcacagccgcgactaacctgagatgccgcccccca ccacgaccaccatcacaaatgccgcaccttaaatacactcttaagcaatgcagtgtgtcc ttggtggaagctgttgaggctagaccccaggagactagtggcagcagggaagaggtggtg gatcggcattgtgggattcagagcccccagaagcagcaagccaagttacctgtccctgaa gctgtgggcttcttggctgtggatactcttgtctccccacctagcctgcaagtgaattac gtgccaggaaaatcgtatcagtctgagaagtgggaaggtctgagtattctgcttttgcag acaaaaatggcaaaccttaataagagggtgcgagcagccacgatgtgtatgtgtgagtct gcatga >gi568815575r:43849802_44058645|GENSCAN_predicted_peptide_3|127_aa MKARSKKAFLVMLGMSFVLQTVAVLTLGVGWLVQTKGGVGWCIEAGIFLEIGQGFPKDNP IGRTENPDDMSRATEAQQGERYPFGPIARIFFLTLSELGGYKDKANVPEGSRRMQFHSSS MHTIQIR >gi568815575r:43849802_44058645|GENSCAN_predicted_CDS_3|384_bp atgaaggccagatcgaagaaagccttcctggtcatgctgggcatgagctttgtcctgcag acagtagcagtgctgacattgggagtgggatggttagtccaaactaaaggaggggtgggc tggtgcatagaggctggtatattccttgagatagggcagggatttccaaaggacaacccc ataggaaggacagagaacccagatgacatgtccagagccactgaggcccagcaaggtgaa aggtatcctttcggcccaattgcacgcattttctttctgactctcagtgaacttggaggg tataaagacaaagcaaatgtcccagaaggaagtagacgtatgcagtttcatagttcatct atgcacacaatccagatacgttga >gi568815575r:43849802_44058645|GENSCAN_predicted_peptide_4|175_aa MGKDFMTETPKAMTTKAKIDKRDLIKLKSFCTAKETTIRVNRQPTEWEKIFAVYPSDKGP ISRIYKQLKQIYKKKSNNPIKKWAKNRNRHFSKEDIYVAKKHMEKSSSSLVIRKMHIKTT VGYHLMPVRMAIIKKSGNNRCWRGCGEMLVGGSISSTIVEDSVAIPQGSRTRNTI >gi568815575r:43849802_44058645|GENSCAN_predicted_CDS_4|528_bp atgggcaaagacttcatgactgaaacaccaaaagcaatgacaacaaaagccaaaattgac aaacgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttgcagtctatccatctgacaaagggcca atatccagaatctacaagcaacttaaacaaatttacaagaaaaaatcaaacaaccccatc aaaaagtgggcaaagaataggaacagacacttctcaaaagaagacatttatgtggccaaa aaacatatggaaaaaagctcatcatcactggtcattagaaaaatgcacatcaaaaccaca gtgggataccatctcatgccagttagaatggcaatcattaaaaagtcaggaaacaacaga tgctggagaggatgtggagaaatgttagtgggagggtcaattagttcaaccattgtggaa gacagtgtggcaattcctcaaggatctagaaccagaaataccatttga >gi568815575r:43849802_44058645|GENSCAN_predicted_peptide_5|194_aa MRKNQRKKSEHSKNQNASSPKDHNSSLAREQNWIEIEFDELTEVGFRRWVITNSSELKEH VLTQCKEAKNLEKRLDELLTRITNLEKNINDLMELKNTAGELHEAYTTRQANIQIQEIQR TPQRYSSRSATPRHIIIRFTKVEMKEKMLRPAREKDRVTHKGKPIRLTVDLSAETLQARR EWGPVFNILKETNV >gi568815575r:43849802_44058645|GENSCAN_predicted_CDS_5|585_bp atgaggaaaaaccagcgcaaaaagtctgaacattccaaaaaccagaacgcctcttcccca aaggatcacaactcctcactagcaagggaacaaaactggatagagattgagtttgatgaa ttgacagaagtaggcttcagaaggtgggtaataacaaactcctctgagctgaaggagcat gttctaacccaatgcaaggaagctaagaatcttgaaaaaaggttagacgaattgctaact agaataaccaatttagagaagaacataaatgacctgatggagctgaaaaacacagcagga gaacttcatgaagcatacacaacaagacaggccaacattcaaattcaggaaatacagaga acaccacaaagatactcctcgagaagcgcaaccccaagacacatcatcatcagattcacc aaggttgaaatgaaggaaaaaatgttaaggccagccagagagaaagatcgggttacccac aaagggaagcccatcagactaacagtggatctctctgcagaaaccctacaagccagaaga gagtgggggccagtattcaacattcttaaagaaacgaatgtttaa >gi568815575r:43849802_44058645|GENSCAN_predicted_peptide_6|337_aa MKAVEKFFLTTMRKHVLAASFSMLSLLVIMGDTDSKTDSSFIMDSDPRRCMRHHYVDSIS HPLYKCSSKQFRKGLRGSQAEKTGHSGPPKRTVWPQIRMCLIVLSGGTSWVWVQKILGVA DEWRKEGLEGDEFWTWAREGISGMEGGKANWQGRPGKSQLGEEWGMLAPGQGMEPGHDYL NRLAQIFWRLTWVPEDERSYEAARGERMGEEAWVNLVPEVIEGVSRGAMEIRVHSVHCCE KGKILFHSVGVKSQKCSSRVGQMVLLARCEGHCSQASRSEPLVSFSTVLKQPFRSSCHCC RPQTSKLKALRLRCSGGMRLTATYRYILSCHCEECNS >gi568815575r:43849802_44058645|GENSCAN_predicted_CDS_6|1014_bp atgaaagctgtagagaagtttttccttacaacaatgagaaaacatgtactagctgcatcc ttttctatgctctccctgctggtgataatgggagatacagacagtaaaacggacagctca ttcataatggactcggaccctcgacgctgcatgaggcaccactatgtggattctatcagt cacccattgtacaagtgtagctcaaagcaattccggaaaggactgagagggagtcaggca gaaaagactggccacagtggcccaccaaagaggacagtgtggccacagattaggatgtgc ctgatagttttaagtggtggcacctcctgggtttgggtacaaaagatcctgggagtagca gatgagtggaggaaggaggggctggagggggatgagttttggacttgggccagggagggg atttcaggaatggagggtgggaaagcaaactggcagggaagacctggaaagtctcagttg ggtgaggagtgggggatgctggccccaggccagggcatggaaccagggcatgactatctg aaccgcctagcacagatcttttggaggttgacctgggtccctgaagatgagcggagttat gaggctgcccgtggggaaagaatgggagaagaagcatgggtgaatcttgttcctgaggtt atagagggtgtttcaagaggtgccatggaaataagggtgcatagtgttcactgctgcgaa aaaggcaaaatacttttccactctgtcggagtaaaatcgcaaaaatgctcttctagggtg ggacagatggtgctcctggccaggtgcgaggggcactgcagccaggcgtcacgctccgag cctttggtgtcgttcagcactgtcctcaagcaacccttccgttcctcctgtcactgctgc cggccccagacttccaagctgaaggcactgcggctgcgatgctcagggggcatgcgactc actgccacctaccggtacatcctctcctgtcactgcgaggaatgcaattcctga >gi568815575r:43849802_44058645|GENSCAN_predicted_peptide_7|364_aa MHECTFSLWKRKGARCSGDTCAETTKHLFCCCTAGCGMASGGEAFAGDVRVTVMVCSFLP TRSLQPRSRQPKGQLSHVAAPLCEWIQARDYKKPPSPISPAELVPRMTSWEYYGDGEIPW AVLPGWNRTQAVWKTGRAKSQLTKEGAQAKGKGGGLGAVHFTVCPPAFPSPLLPLAAGLV QRAWAKKQSLAVMGRGWKDLAVLAVQMTCTQDLVKKAKGESLNHKWKENPLMSENHFNVI GTRIKIENSTGTFPSVGVSCLVHSRREEFAEGDVYQLAEKSFKGSLLSLLFLVLFQVGFL FHGSPSGYLQESPSIQTPLGGWVVPKQVGDRTRLLAAEPEEAGVSPPLLRTAGGARCNQA WFTR >gi568815575r:43849802_44058645|GENSCAN_predicted_CDS_7|1095_bp atgcatgaatgtacattctccctgtggaaaagaaaaggtgccaggtgttctggggatacc tgtgctgaaacaactaaacacttgttctgctgctgtactgctggctgtggtatggcctca ggaggagaggcctttgccggagatgttcgggtcactgtgatggtatgctcgttcctgcca acccggtcactccagccccgctctcggcagcccaaagggcagctaagccatgtggctgct cctctctgtgagtggattcaggctcgggactacaaaaagccccccagtcctatcagcccg gcagagctggtgccaagaatgacatcctgggaatactatggagatggggagatcccatgg gctgttctccccggatggaacaggacccaagctgtctggaaaacaggcagagcaaagagc cagctgacaaaggaaggagctcaggcgaagggaaaagggggcgggctgggagcagtgcac ttcacagtctgtccacctgccttcccatcccctctgctcccactggctgctggcctagtg cagagggcctgggcaaagaagcagtcactggccgtcatgggaagaggctggaaagatcta gccgtgttagctgtgcaaatgacgtgtacccaggatctggtgaaaaaggccaaaggggaa tcccttaatcataaatggaaagaaaacccactgatgagtgagaatcatttcaatgtcatt ggaactcgaattaagattgaaaactccacaggcacatttccatctgtgggtgtcagctgc ttggtacacagtcggagagaagagtttgcagaaggagatgtctatcagctggctgagaaa tccttcaagggatcacttctcagccttttatttctggtcctgtttcaagttgggtttctc tttcatggtagccctagtggatatttacaggagtcacctagcattcaaacacccctgggg ggatgggtagtcccaaagcaggtgggagacaggacccgacttcttgctgcagaacctgaa gaagcaggtgtttcccctcccctcctcaggacagctggaggagccaggtgtaaccaagcc tggttcaccagatga >gi568815575r:43849802_44058645|GENSCAN_predicted_peptide_8|74_aa MNNEDTARTLIAHSVSMCSSTLLNPLVVDGTMEPVLANGQERTYTSGSPGSQAFSLEPKV TLLAFLVLRPLDLD >gi568815575r:43849802_44058645|GENSCAN_predicted_CDS_8|225_bp atgaataatgaagacacggcaagaacacttattgctcactcagtttccatgtgttcctcc acattactcaacccccttgttgtggatgggaccatggaaccagttctggccaatgggcag gaaaggacttacaccagtggatctccaggttctcaggccttcagccttgaaccaaaggtt acattattagctttcctggttctgagacctttggacttggattga >gi568815575r:43849802_44058645|GENSCAN_predicted_peptide_9|145_aa MAHEDVHILIPGTCEYFILHVNFEQHFHCSGKSEEQDRWFLMPKSGILGVQSQILLQGKP DSEHGLLGCCFLELSHPTHSHSDRGSPDCWDIWDMGIQETTSELWASWPLDSRTYTSGSP GSQAFSLEPKVTLLAFLVLRPLDLD >gi568815575r:43849802_44058645|GENSCAN_predicted_CDS_9|438_bp atggcccatgaagatgttcacatcctaattcctggaacctgcgaatattttatattacat gttaattttgagcagcatttccactgttctggaaagtcagaagagcaggacaggtggttc ctgatgccaaagtcaggaattctgggagtacagagccagatactcctccagggaaagcct gactcagagcatgggctcctgggatgctgcttcctggaactctcacaccccacccactca cactcagatcggggaagtccagactgttgggacatttgggacatgggcattcaggaaact acatcagaactctgggcttcatggcctttggactccaggacttacaccagtggatctcca ggttctcaggccttcagccttgaaccaaaggttacattattagctttcctggttctgaga cctttggacttggattga >gi568815575r:43849802_44058645|GENSCAN_predicted_peptide_10|289_aa MYKREVLDLRGFRWTHYTFNITEDTLRGFFEPFGRIESIQLTIDSETGRSKGYGFITFSD SECAKKSLEQFNGFELAGRPMKVGHVIERADASSTSSFLDSDELERTRIDLGTTGRLQLV ARLAEDTGLQIPPAAQQALQMSGSLAFGAVAEFSFVMDLQTRLSQQTEASALAAAASVQP LATQCFQLSNMFNPQTEEVGWDTEIKDDVIEECNKHGGIIHIYVDKNSAQGNVYVKCSSI AAAIAAVNALHGRWFAGKMITAAYVPLPTYHNLFPDSMTATQLLVPSRR >gi568815575r:43849802_44058645|GENSCAN_predicted_CDS_10|870_bp atgtacaaaagggaagtgctggacctaagaggctttaggtggactcattacacttttaac ataactgaagatacgcttcgtgggttctttgagcctttcggaaggattgaaagtatccag ctgacgatagacagtgaaactggtcgatccaagggatatggatttattacattttctgat tcagaatgtgccaaaaagtctttggaacaatttaatggatttgaactagcagggaggcca atgaaagttggtcatgttattgaacgtgctgatgcttcgagtactagttcatttttggac agtgatgaactggaaaggactagaattgatttgggaacaactggtcgtcttcagttagtg gcaagacttgcagaggatacaggtttgcagattccaccagcagcacagcaagctctacaa atgagtggctctttggcatttggtgctgtggcagaattttcttttgttatggatttgcaa acaagactttcccagcagactgaagcttcagctttagctgcagctgcttctgttcagcca cttgcaacacagtgtttccaactctctaacatgtttaaccctcaaacagaagaagttgga tgggatacagagattaaggacgatgtgattgaggaatgtaataaacatggaggaattatt catatttatgttgacaaaaattcagctcagggcaatgtctatgtgaagtgctcatcaatt gctgcagctattgctgctgtcaatgcattgcatggcaggtggtttgctggtaaaatgata acagcagcatatgtacctcttccaacttaccacaacctgtttcctgattctatgacagca acacaactactggttccaagtagacgatga >gi568815575r:43849802_44058645|GENSCAN_predicted_peptide_11|231_aa MPNITNDQGNANQNHNTIRPYSCKNGHNQKIKKIMDVGVDAVNREHFFYTADNHTKTHLD VSPTAFNESLINVGEETLKCPSKISGKQELTPFSETKSSKAHIQFCSFHLLNYSNTAISL NGPLRNSTSNRQALPCLSPTVINMLPSEALGRRSSENLLLGIIKRCTSGTTHLKIPKIVS GYNTSPQTSIFYNSVLLLWAFRKVKEAWAAVYQKVAFLSTGDLLATSEVVS >gi568815575r:43849802_44058645|GENSCAN_predicted_CDS_11|696_bp atgcccaacatcactaatgatcagggaaatgcaaatcaaaaccacaatacaatacgacct tactcctgcaagaatggccataatcaaaaaattaaaaaaataatggatgttggagtggat gcagtgaacagggaacacttcttctacactgctgataaccacacaaaaacacatttagat gtctcacctactgctttcaatgaaagcttaataaatgtgggggaagaaaccctcaagtgt ccttcaaaaatatccgggaaacaagagctcacacccttttctgaaacaaagtcgagtaag gcacacatccaattctgttcctttcacctgctgaactacagcaatacggccatatccctg aatggccccctaagaaactcaacaagtaacaggcaagctctgccatgtttatcacctaca gtgataaatatgctaccatctgaagctcttggcagaagatcttctgaaaatttgcttctg ggaataatcaagaggtgtacttcaggtactacccatttgaagattcccaaaattgtctca ggttataataccagccctcagacttccatattttacaactctgtgttattattatgggct ttcagaaaagtaaaggaagcttgggctgcagtttatcagaaggtggctttcctgtcaacg ggggacctgcttgccacatctgaagttgtgtcatag