GENSCAN 1.0 Date run: 2-Nov-116 Time: 19:38:47 Sequence gi568815588r:60771485_61011542 : 240058 bp : 39.30% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1497 1513 17 2 2 81 113 14 0.592 2.96 1.02 Intr + 6879 7042 164 0 2 92 -51 165 0.089 1.70 1.03 Intr + 13221 13377 157 1 1 92 91 107 0.799 9.55 1.04 Intr + 14180 14303 124 2 1 84 93 62 0.995 5.97 1.05 Intr + 20406 20569 164 2 2 51 70 127 0.997 5.05 1.06 Intr + 20664 20805 142 0 1 69 99 94 0.996 8.03 1.07 Term + 22393 22491 99 0 0 115 33 77 0.966 2.05 1.08 PlyA + 22717 22722 6 1.05 2.04 PlyA - 22799 22794 6 1.05 2.03 Term - 36014 35821 194 0 2 73 50 61 0.277 -2.60 2.02 Intr - 37109 36975 135 0 0 90 106 93 0.463 10.92 2.01 Init - 38750 38600 151 2 1 5 54 150 0.536 3.55 2.00 Prom - 45505 45466 40 -2.55 3.09 PlyA - 45931 45926 6 1.05 3.08 Term - 53652 53560 93 0 0 75 38 55 0.048 -3.95 3.07 Intr - 55388 55292 97 1 1 86 84 76 0.027 6.09 3.06 Intr - 68929 68854 76 2 1 40 70 61 0.007 -2.85 3.05 Intr - 70336 70183 154 1 1 47 91 87 0.109 3.62 3.04 Intr - 76954 76875 80 2 2 52 38 68 0.063 -3.45 3.03 Intr - 77616 77484 133 0 1 99 60 151 0.900 12.70 3.02 Intr - 84477 84296 182 1 2 43 76 102 0.443 3.17 3.01 Init - 88023 87873 151 0 1 88 68 79 0.894 6.15 3.00 Prom - 98126 98087 40 -4.55 4.12 PlyA - 99738 99733 6 1.05 4.11 Term - 100167 99998 170 1 2 51 49 227 0.816 12.16 4.10 Intr - 100806 100701 106 0 1 73 94 7 0.906 -1.33 4.09 Intr - 103009 102812 198 1 0 14 58 151 0.645 3.33 4.08 Intr - 103558 103470 89 2 2 129 72 71 0.905 8.37 4.07 Intr - 106574 106424 151 2 1 50 110 97 0.434 6.91 4.06 Intr - 114746 114628 119 0 2 65 94 150 0.368 12.56 4.05 Intr - 117701 116728 974 1 2 76 100 1018 0.808 91.49 4.04 Intr - 121511 121326 186 1 0 77 121 155 0.362 15.58 4.03 Intr - 135284 135153 132 1 0 106 77 116 0.840 11.14 4.02 Intr - 139506 139403 104 0 2 108 97 135 0.997 14.45 4.01 Init - 140046 139867 180 0 0 91 94 255 0.670 25.63 4.00 Prom - 145053 145014 40 -7.05 5.04 PlyA - 145088 145083 6 1.05 5.03 Term - 150458 150250 209 0 2 57 37 118 0.663 0.12 5.02 Intr - 151123 150981 143 0 2 57 84 97 0.406 5.28 5.01 Init - 153729 153668 62 0 2 74 116 46 0.666 6.77 5.00 Prom - 163657 163618 40 -4.75 6.08 PlyA - 164491 164486 6 1.05 6.07 Term - 167838 167590 249 0 0 118 44 110 0.633 4.32 6.06 Intr - 173552 173341 212 0 2 88 86 87 0.327 6.21 6.05 Intr - 186636 186503 134 2 2 63 98 38 0.016 1.67 6.04 Intr - 203378 203291 88 0 1 119 100 5 0.120 2.91 6.03 Intr - 218214 218099 116 2 2 94 83 30 0.086 2.27 6.02 Intr - 225083 224951 133 0 1 65 14 128 0.009 1.88 6.01 Init - 230065 229915 151 1 1 75 74 157 0.857 11.25 6.00 Prom - 231920 231881 40 -3.65 7.05 PlyA - 232680 232675 6 1.05 7.04 Term - 233284 233016 269 2 2 22 48 160 0.227 0.07 7.03 Intr - 235566 235387 180 1 0 61 82 76 0.705 3.22 7.02 Intr - 236273 235798 476 1 2 -21 77 377 0.754 17.48 7.01 Init - 236519 236374 146 2 2 79 86 137 0.973 12.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 180226 180148 79 1 1 105 50 92 0.973 8.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:60771485_61011542|GENSCAN_predicted_peptide_1|288_aa MKELVSHPGRPAQRSWALIGCFESLRATRLVNPGPFSAVSLKLLALGFKAGSWKLSGERR GTYGVVYKGRHKTTGQVVAMKKIRLESEEEGVPSTAIREISLLKELRHPNIVSLQDVLMQ DSRLYLIFEFLSMDLKKYLDSIPPGQYMDSSLVKVVTLWYRSPEVLLGSARYSTPVDIWS IGTIFAELATKKPLFHGDSEIDQLFRIFRALGTPNNEVWPEVESLQDYKNTFPKWKPGSL ASHVKNLDENGLDLLSKMLIYDPAKRISGKMALNHPYFNDLDNQIKKM >gi568815588r:60771485_61011542|GENSCAN_predicted_CDS_1|867_bp atgaaggagcttgtaagccacccgggaaggcctgcccagcgtagctgggctctgattggc tgctttgaaagtctacgggctacccgattggtgaatccggggccctttagcgcggtgagt ttgaaactgctcgcacttggcttcaaagctggctcttggaaattgagcggagagcgacgc ggtacctatggagttgtgtataagggtagacacaaaactacaggtcaagtggtagccatg aaaaaaatcagactagaaagtgaagaggaaggggttcctagtactgcaattcgggaaatt tctctattaaaggaacttcgtcatccaaatatagtcagtcttcaggatgtgcttatgcag gattccaggttatatctcatctttgagtttctttccatggatctgaagaaatacttggat tctatccctcctggtcagtacatggattcttcacttgttaaggtagtaacactctggtac agatctccagaagtattgctggggtcagctcgttactcaactccagttgacatttggagt ataggcaccatatttgctgaactagcaactaagaaaccacttttccatggggattcagaa attgatcaactcttcaggattttcagagctttgggcactcccaataatgaagtgtggcca gaagtggaatctttacaggactataagaatacatttcccaaatggaaaccaggaagccta gcatcccatgtcaaaaacttggatgaaaatggcttggatttgctctcgaaaatgttaatc tatgatccagccaaacgaatttctggcaaaatggcactgaatcatccatattttaatgat ttggacaatcagattaagaagatgtag >gi568815588r:60771485_61011542|GENSCAN_predicted_peptide_2|159_aa MCLLDGCNGDTSWGKSGSHKLKMLEHHRSGSLNDLLQESHPAIVFIVQGCPRRGTGLQEA VMMSGIWENGWEVTVEKSSNNKIKIKWILQVIRKESRMHSFCLACNLELLGPELSQIPPH ILSGALLQRPSGSGQVSYMLKDQERQQGEAVQIGKMESA >gi568815588r:60771485_61011542|GENSCAN_predicted_CDS_2|480_bp atgtgcctgctggatggctgcaatggagacacctcctggggaaagtctggaagccacaag ttgaagatgctagagcatcaccggtctggatccctgaatgatttgctgcaggaaagccat cctgccatcgtttttattgtccagggctgtcctagaagaggaacagggctacaggaagca gtgatgatgtctggcatttgggaaaatggctgggaagtaactgtggagaagagctccaac aacaaaataaaaataaaatggattctccaagtaataagaaaagaaagtagaatgcacagc ttctgccttgcctgcaatttagaactcctgggaccagaactcagccaaattcctccccat atcttatcaggagctctccttcaaaggccttcagggtctgggcaggtatcatatatgctt aaagatcaagaaaggcaacagggagaagcagtgcaaattggcaaaatggagagtgcttag >gi568815588r:60771485_61011542|GENSCAN_predicted_peptide_3|321_aa MAFKVVVSHLGDIIEKQLGWLGSIIFKLLKLLNQEGAPPGEVTILMTKTLVWPSEATSRT PLIYTYSSLWSSSQPGWCQQDTHWTRLLSFSLPCLEEVQKGPGLSGLGLQELLSPHFSKC GPWTSRISITWERARDAASQAPLRPAGPEVAFKIPGRTQQKLQTAAVPHEARARLTQVIL EQLLSYLTIPVSWRQLSLYASGRSSRTENLQVLAPPPPPAPCQQALNEAVLFSDTREMLL PIPRSVSSNSPVSEHHEILYHQNTDSPPQAACIPESETRLPVDGRVRPDFGGSEWAGKLF IINVLSNSDALGSPCSVRHQP >gi568815588r:60771485_61011542|GENSCAN_predicted_CDS_3|966_bp atggcctttaaggtggttgtttctcatctgggagatattatagagaaacagctgggttgg ttagggtcgataatcttcaaattgttgaaactgctaaatcaagagggtgctccacctgga gaggtgaccatcttaatgaccaaaaccttggtgtggcccagtgaagccacaagtcggaca cccctgatttacacttactcctccctgtggagcagcagccaacctggttggtgtcagcag gatactcactggaccaggttgctcagtttctctctcccttgccttgaggaagtgcagaaa gggccaggactctctgggttgggacttcaagagcttttaagccctcacttctcaaagtgt ggtccatggaccagccgcatcagcatcacgtgggagcgtgctagagatgcagcttctcag gccccactcagacctgctggaccagaagttgcatttaagatcccaggaagaacacagcaa aaattgcaaacagctgctgtccctcatgaggcaagagccagactgacccaggtcatcctg gagcagctactgtcctatttaacaatacctgtttcatggagacaactgtctctctacgct tctggaagaagttctaggacagaaaatctccaggtcttggccccgcccccgccccctgct ccttgtcagcaggccctcaatgaagctgtgttattctcagacacccgggaaatgcttctc cccatccctcgttctgtgtcctcaaattcacctgtatctgaacaccatgaaatcttgtat caccaaaacactgattctcctcctcaagctgcctgcattccagaatctgagaccagatta cctgtggatggaagagtaaggccagactttgggggtagcgaatgggctggaaaattgttc atcatcaatgtcctctccaattctgacgctctgggatccccctgctcagttaggcatcaa ccataa >gi568815588r:60771485_61011542|GENSCAN_predicted_peptide_4|802_aa MDYERPNVETIKCVVVGDNAVGKTRLICARACNTTLTQYQLLATHVPTVWAIDQYRVCQE VLERSRDVVDEVSVSLRLWDTFGDHHKDRRFAYGRSENLQTLNKAYVADSDNNYSLIRVP QAPGLCYVLHVSCPASLLWSDVVVLCFSIANPNSLNHVKSMWYPEIKHFCPRTPVILVGC QLDLRYADLEAVNRARRPLARPIKRGDILPPEKGREVAKELGLPYYETSVFDQFGIKDVF DNAIRAALISRRHLQFWKSHLKKVQKPLLQAPFLPPKAPPPVIKIPECPSMGTNEAACLL DNPLCADVLFILQDQEHIFAHRIYLATSSSKFYDLFLMECEESPNGSEGACEKEKQSRDF QGRILSVDPEEEREEGPPRIPQADQWKSSNKSLVEALGLEAEGAVPETQTLTGWSKGFIG MHREMQVNPISKRMGPMTVVRMDASVQPGPFRTLLQFLYTGQLDEKEKDLVGLAQIAEVL EMFDLRMMVENIMNKEAFMNQEITKAFHVRKANRIKECLSKGTFSDVTFKLDDGAISAHK PLLICSCEWMAAMFGGSFVESANSEVYLPNINKISMQAVLDYLYTKQLSPNLDLDPLELI ALANRFCLPHLVALAEQHAVQELTKAATSGVGIDGEVLSYLELAQTCPLLPEKVQDSALA SPWSNFPKAQAKLCLTFLIHRKWDDSLHPTNLTDYSENEVGENVQERALKSFHNAHQLAA WCLHHICTNYNSVCSKFRKEIKSKSADNQEYFERHRWPPVWYLKEEDHYQRVKREREKED IALNKHRSRRKWCFWNSSPAVA >gi568815588r:60771485_61011542|GENSCAN_predicted_CDS_4|2409_bp atggactacgaaagacccaacgttgaaactatcaaatgtgtggtcgtgggtgacaatgcc gtggggaagacgcgcttgatctgtgccagggcgtgcaacaccacactcacgcagtatcag ctgctggccacccacgtgccaacagtgtgggcgattgaccagtaccgcgtgtgccaggag gtcttggagcgttctcgggatgttgttgatgaagtgagtgtttctctcaggctttgggat acttttggtgatcatcacaaagacagacgctttgcatatggcaggtcagagaaccttcag acactgaataaagcctatgttgctgacagtgataacaactacagtttaatcagggtgccc caggcaccaggcctgtgctacgtgctgcatgtgtcctgtcctgcttcactcttgtggtct gatgttgtggtcctctgtttttcgattgctaatcccaattccctaaatcatgtgaaaagc atgtggtatccagaaatcaagcacttttgccctcgaacacccgttatccttgttgggtgc cagcttgatctccgctatgccgacctggaagctgttaatcgagccaggcgcccgttagca aggcccataaagagaggggatattttgcccccagaaaaaggccgagaggtagcaaaggaa cttggcttaccatactatgaaacaagcgtgtttgaccagtttggtatcaaggatgtgttt gacaatgcaatccgagcagcgctgatttcccgcaggcacctgcaattctggaaatcccac ctaaagaaagtccagaaacctttacttcaggcacccttcctacctccaaaagcccctcca ccggtcatcaaaattccagagtgtccttccatggggacaaatgaagctgcctgtttactg gacaatcctctatgtgccgatgttctgttcatccttcaggaccaggaacacatctttgca catcgaatttacctcgctacctcttcttccaaattttatgatctgtttttaatggaatgt gaagaatccccaaatgggagtgaaggagcctgtgagaaagagaagcagagcagagatttc caggggcggatattgagtgtcgacccagaggaagaaagggaggagggcccgcctaggatt cctcaggccgaccagtggaagtcttcaaacaagagcctggtggaggctctggggctggaa gccgagggtgcagttcctgagacacagactttgaccggatggagtaaggggttcattggc atgcacagggaaatgcaagtcaaccccatttcaaagcggatggggcccatgactgtggtc aggatggacgcttcagtccagccaggcccttttcggaccctgctccagtttctttatacg ggacaactggatgaaaaggaaaaggatttggtgggcctggctcagatcgcagaggtcctc gagatgttcgatttgaggatgatggtggaaaacatcatgaacaaggaagccttcatgaac caggagattacgaaagcctttcacgtaaggaaagccaatcggataaaagagtgtctcagc aagggaacgttctcggacgtgacatttaaattggacgatggagccatcagtgcccacaag ccgctgctgatctgtagctgtgagtggatggcagccatgttcggggggtcatttgtggaa agtgccaacagtgaggtgtatctcccgaacataaacaagatatcaatgcaagcagtattg gattatctctataccaagcagttgtctcctaacttggatctggacccgctggaattaatt gccttggcaaacagattttgcctgccacacttggttgcacttgcagaacagcatgccgtt caggagttgaccaaagccgccacgagtggcgtgggcattgacggagaagtgctctcttac ttggaattggctcagacttgccctcttcttccagagaaggtccaagattcagctttagct tcaccatggagtaacttccccaaagcacaagctaaactctgcctcactttcctcatccac aggaaatgggatgacagtttgcatcctaccaaccttacagattattctgagaatgaagtg ggagagaatgttcaagagcgagcactgaagagttttcacaatgcccaccagttggccgcc tggtgtttgcaccacatctgcaccaactacaacagtgtatgctccaagttccgtaaggaa atcaaatcaaaatctgcagacaaccaggaatacttcgagcggcaccgctggccccctgtg tggtacctgaaggaagaagatcactaccagcgtgtgaaaagggaacgagagaaggaagat attgcactaaataagcatcgctcaagacgaaagtggtgcttctggaattcatctccagca gtggcctga >gi568815588r:60771485_61011542|GENSCAN_predicted_peptide_5|137_aa MVLASPQLLERPQEAYNHGGRLWASVVKNVSLLPGPILFPVDMAVCWSSVVLSFGSMEVN LAEILPVSGCGLWSLVAPMVTDALPVREQWLVQCHLSNTALVMGHTDVPQGVSLTNCRLS HTLLLSRPQGLMISHQR >gi568815588r:60771485_61011542|GENSCAN_predicted_CDS_5|414_bp atggtgctggcatctcctcagcttctggagagacctcaggaagcttataatcatggtgga agattatgggcatctgtggtgaagaatgtctccctacttcctgggccaattcttttcccc gtggacatggcagtctgctggtcatcagtggtgttatcatttggctccatggaagtgaat ttggcggaaatccttccagtttctgggtgtggcctttggtctctggtggctcccatggtt actgatgcacttcctgttagggagcaatggttggtgcaatgccatctgtcaaacactgct cttgtgatgggccacacagatgtccctcaaggtgtttctctcaccaactgcaggcttagc cacacgctgctgctttcaaggccacagggcttaatgatcagtcaccagagatag >gi568815588r:60771485_61011542|GENSCAN_predicted_peptide_6|360_aa MQAGEAAEVGEGGVGGAGRGRRCPGGFTKPAPASAARRDPEPPRELPASPGCRSSRGHQV IYSGGHQVIYNGGQRGKFLRLFSLGWKGLEGDCENTKVGNLPRVGAKWTSLSQHRKQGAK AEVRLNCCELSLGEKRTNVCKKLQSVYHYPQRNKVSSAGSCNHLTLAVLCTESFHHMSRG PNYWLFNEINCGVCICSGRHSQVVSAHKLTVSLEEREVSVRRHGLSWKTKTAHGGHSQFH LLTQPFAGGYKSPLYPGRGQRCVPTPPGGTPPAKSRRKDRNWKIGKKTPKMMDVCVSMTA TFSRCPRTLLSSDTFLSSVIKYITMPLIRLYPLKNELASFTKKAMPIGLEESNYMSPLCT >gi568815588r:60771485_61011542|GENSCAN_predicted_CDS_6|1083_bp atgcaggcgggggaggcggccgaggtgggagagggcggggtcggcggggcggggcggggt cggcgctgtcccggcggctttacaaagcccgcgccagcctcggccgcccggcgggaccca gagccgccgcgggagcttccagcgtccccaggctgccgaagctctcggggacatcaggtc atctacagtgggggacatcaggtcatctacaatggtggacagcgaggcaaattcctcagg ctattttctctgggctggaaaggtctggaaggtgactgtgaaaacaccaaagttggaaac ctacccagagtaggggcaaagtggacgtcactgtctcagcacaggaaacaaggtgctaag gcagaggttagactgaactgctgtgagctgtcactgggtgaaaagagaactaatgtttgt aagaaattacagtcagtttaccactatcctcagagaaataaagtctcttctgcaggttcc tgcaaccatttgacgttggctgttttgtgtactgagtcttttcaccacatgtcccgagga cccaattactggctttttaatgagattaactgtggggtctgtatctgttctggcagacac tcacaagtggtgtctgctcacaaactcacggtcagcctggaagagagagaagtttctgtg aggaggcatgggctgtcttggaagaccaagacagcccatggaggccacagtcaatttcac ctcctcactcagccatttgcaggaggttataaatcacctctgtacccgggtcggggtcag cggtgcgtccctacgccccctggggggacccctcccgccaagtcgcggaggaaggacaga aattggaaaattggcaaaaagacgccaaaaatgatggatgtgtgtgtgagtatgactgcc actttttccagatgtcccaggacactcctgtcttcagatactttcttgagctctgtcatc aaatacatcaccatgcctctgatcaggctttatccactaaagaacgaattagcaagtttt acaaagaaagcgatgccaataggtctagaagaaagcaattacatgtctccactctgcacc tga >gi568815588r:60771485_61011542|GENSCAN_predicted_peptide_7|356_aa MRRNQSRKPENSKNQNASSHPKKHNSSPAREQSCMENEFDELTEVGFRRITSEEKSLNDL MELKTTVQELREAYTSFNSQFDQPEERVSVIEDQINEIKREDKIKNKRVKRNEQSLQEIW DYVKRPNLRLIGVPESDGENGTKLENTLQDIIQENFPNLARQANIQIQKTQRTPQRYSLR RATSRHLIVRFTKIEMKKGMLREAREKGAPEGSTKHGREQPVPGTAKTCQIVKTIDAIKK LYQLMGDVTSWHRDGGIKFTHNNINLKSDKQLQQSLRIQNQCAKITSIPIYQEQTNREPN HELTPIHNYYKENKIPRNPTYKGCEGLLQGELQTTAQQNKRRHKQMEEHPMLMDRK >gi568815588r:60771485_61011542|GENSCAN_predicted_CDS_7|1071_bp atgaggagaaaccagagcagaaagcctgaaaattccaaaaaccagaatgcctcttctcat ccaaagaaacacaactcctcaccagcaagggagcaaagctgcatggagaatgagtttgac gagctgacagaagtaggcttcagaagaataaccagtgaagagaagagcttaaatgacctg atggagctgaaaaccacagtacaagaactgcgtgaagcatacacaagcttcaatagccaa tttgatcaaccagaagaaagggtatcagtgattgaggatcaaattaatgaaataaagaga gaagacaagattaaaaataaaagagtgaaaagaaacgaacaaagcctccaagaaatatgg gactatgtgaaaagaccaaatctacgtttgattggtgtacctgaaagtgatggggagaat ggaaccaagttagaaaacactcttcaggatattatccaggagaacttccccaacctagca aggcaggccaacattcaaattcagaaaacacagagaacaccacaaagatactccttgaga agagcaacctcaagacacctaattgtcagattcaccaagattgaaatgaagaaaggaatg ttgagggaagccagagagaaaggagctcctgaaggaagcactaaacatggaagggaacaa ccagtaccaggcactgcaaaaacatgccaaattgtaaagaccatcgatgctataaagaaa ctgtatcaattaatgggtgacgtaaccagctggcatcgtgatggcgggatcaaattcaca cataacaatattaaccttaaatctgataagcaacttcagcaaagtctcaggatacaaaat caatgtgcaaaaatcacaagcattcctatataccaagaacagacaaacagagagccaaat catgagttaactcccattcacaattactacaaagagaataaaatacctaggaatccaact tacaagggatgtgaaggacttcttcaaggagaactacaaacaactgctcaacaaaataaa agacgacacaaacaaatggaagaacatcccatgctcatggataggaagtga