GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:38:25 Sequence gi568815583r:78496607_78718918 : 222312 bp : 44.79% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10990 11065 76 0 1 63 105 52 0.446 3.49 1.02 Intr + 16478 16819 342 0 0 123 82 243 0.963 22.60 1.03 Intr + 18416 18501 86 0 2 44 110 69 0.692 4.24 1.04 Intr + 30792 30957 166 2 1 15 82 163 0.843 7.93 1.05 Term + 36604 37064 461 2 2 123 42 279 0.994 22.05 1.06 PlyA + 38016 38021 6 1.05 2.00 Prom + 38521 38560 40 -4.76 2.01 Init + 42732 42804 73 2 1 85 66 65 0.787 5.23 2.02 Intr + 43814 43933 120 0 0 57 93 136 0.906 11.47 2.03 Intr + 45877 46039 163 2 1 83 80 107 0.974 8.43 2.04 Intr + 48263 48351 89 2 2 100 53 33 0.990 0.61 2.05 Intr + 49028 49158 131 0 2 67 68 89 0.996 5.21 2.06 Intr + 49969 50092 124 0 1 71 86 57 0.995 3.96 2.07 Term + 52184 52338 155 0 2 71 39 137 0.903 5.18 2.08 PlyA + 52565 52570 6 1.05 3.00 Prom + 64558 64597 40 -4.26 3.01 Init + 69114 69219 106 2 1 102 99 200 0.965 20.88 3.02 Intr + 84205 84356 152 2 2 47 66 83 0.038 1.88 3.03 Intr + 91708 91817 110 0 2 50 97 47 0.864 0.88 3.04 Intr + 93199 94030 832 1 1 63 58 424 0.301 28.30 3.05 Term + 96486 96647 162 2 0 121 48 11 0.225 -1.66 3.06 PlyA + 97494 97499 6 1.05 4.08 PlyA - 97541 97536 6 1.05 4.07 Term - 100126 99998 129 1 0 112 47 28 0.636 -0.72 4.06 Intr - 105658 104647 1012 0 1 122 94 1392 0.996 133.90 4.05 Intr - 111918 111733 186 2 0 74 39 110 0.018 3.50 4.04 Intr - 120527 120418 110 2 2 53 80 110 0.678 5.78 4.03 Intr - 122055 122011 45 0 0 122 108 58 0.999 9.81 4.02 Intr - 122309 122170 140 0 2 38 89 179 0.809 13.18 4.01 Init - 124188 124107 82 0 1 96 101 313 0.994 32.53 4.00 Prom - 126734 126695 40 -5.96 5.08 PlyA - 126802 126797 6 1.05 5.07 Term - 128685 128527 159 0 0 137 55 240 0.998 23.54 5.06 Intr - 130789 130688 102 1 0 3 30 146 0.594 0.67 5.05 Intr - 133339 132361 979 0 1 52 84 1602 0.979 147.14 5.04 Intr - 134579 134470 110 2 2 56 89 212 0.590 17.18 5.03 Intr - 134726 134682 45 2 0 112 111 70 0.998 10.31 5.02 Intr - 138981 138833 149 1 2 64 66 247 0.627 20.05 5.01 Init - 144527 144473 55 2 1 90 101 124 0.905 13.35 5.00 Prom - 152859 152820 40 -4.26 6.07 PlyA - 153407 153402 6 -0.45 6.06 Term - 154931 154705 227 0 2 -12 38 172 0.089 -0.86 6.05 Intr - 156985 156865 121 1 1 24 94 70 0.173 1.27 6.04 Intr - 157747 157557 191 2 2 56 119 32 0.239 2.40 6.03 Intr - 164662 164467 196 1 1 87 33 305 0.462 23.89 6.02 Intr - 164810 164709 102 2 0 47 93 117 0.284 8.47 6.01 Init - 186751 186635 117 1 0 87 78 61 0.589 5.21 6.00 Prom - 221209 221170 40 -3.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:78496607_78718918|GENSCAN_predicted_peptide_1|376_aa RRCAAEAALPVCGKAGSTPGRRVAADIMSSGNYQQSEALSKPTFSEEQASALVESVFGLK VSKVRPLPSYDDQNFHVYVSKTKDGPTEYVLKISNTKASKNPDLIEVQNHIIMFLKAAGF PTASVCHTKGDNTASLVSVGRPIAELPVSPQLLYEIGKLAAKLDKTLQLSSLHRENFIWN LKNVPLLEKYLYALGQNRNREIVEHVIHLFKEEVMTKLSHFRECINHGDLNDHNILIESS KSASGNAEYQVSGILDFGDMSYGYYVFEVAITIMYMMIESKSPIQVGGHVLAGFESITPL TAVEKGALFLLVCSRFCQSLVMAAYSCQLYPENKDYLMVTAKTGWKHLQQMFDMGQKAVE EIWFETAKSYESGISM >gi568815583r:78496607_78718918|GENSCAN_predicted_CDS_1|1131_bp cgccggtgcgcggccgaggccgcactacctgtctgcgggaaagcgggatccaccccagga cgtcgggtcgctgccgacataatgtcaagtggaaactatcagcagtcagaggctcttagc aaacccactttcagtgaggaacaagcctctgcgttagtggagtcagtgtttgggttgaaa gtttccaaggtccggccacttcctagctatgatgaccaaaactttcatgtctacgtttca aaaaccaaagatggcccaactgaatatgtcctcaaaataagcaacaccaaggctagcaaa aatccagacctgattgaagtgcagaatcacatcatcatgtttctgaaagccgctggattt ccaacagcctctgtgtgtcacactaaaggagacaacacagcttctctcgtgtctgtagga agacccatcgctgagcttcccgtcagcccccagctattgtatgaaattggaaaactagct gccaaattggataagacactgcagttaagtagtcttcatcgggagaacttcatctggaat ctgaaaaatgttcctcttctggagaaatacctgtatgccctgggccagaatcgaaaccga gagattgttgagcatgtcattcatctgttcaaggaggaagtaatgaccaaattaagtcat tttcgagaatgtatcaatcacggagatcttaatgaccataatattttaatagagtccagc aagtcagcctctggaaatgctgaatatcaagtgtctgggattttagactttggtgacatg agctatggctactatgtgtttgaagtggcaattaccatcatgtacatgatgattgagagc aagagtcctatacaagtaggaggccatgtccttgcagggtttgaaagcatcaccccactg acagctgtagagaagggtgctttgtttttacttgtatgcagtcgtttttgtcagtcactt gtcatggctgcatactcttgccagctatacccagagaacaaagactatctcatggttact gcaaaaaccgggtggaaacacttacagcaaatgtttgacatgggtcagaaagctgtagaa gaaatctggtttgaaactgccaaatcctatgaatctgggatctccatgtga >gi568815583r:78496607_78718918|GENSCAN_predicted_peptide_2|284_aa MDNEHRNKYDNFSKEDKMVRVYADAVLRMRGGHISSGYSVSGGGLFFRGVKGSVDISGLQ GLPSGRLYQVEYAMEAIGHAGTCLGILANDGVLLAAERRNIHKLLDEVFFSEKIYKLNEY LLQYQEPIPCEQLVTALCDIKQAYTQFGGKRPFGVSLLYIGWDKHYGFQLYQSDPSGNYG GWKATCIGNNSAAAVSMLKQDYKEGEMTLKSALALAIKVLNKTMDVSKLSAEKVEIATLT RENGKTVIRVLKQKEVEQLIKKHEEEEAKAEREKKEKEQKEKDK >gi568815583r:78496607_78718918|GENSCAN_predicted_CDS_2|855_bp atggataatgaacacagaaacaagtatgacaatttcagtaaagaagataaaatggtgcga gtatatgctgacgcggttctgcgcatgcgcgggggccatattagcagcggttattcggtg agcggtggtggtttattcttccgtggagttaagggctccgtggacatctcaggtcttcag ggtcttccatctggtcgcttataccaagttgaatatgccatggaagctattggacatgca ggcacctgtttgggaattttagcaaatgatggtgttttgcttgcagcagagagacgcaac atccacaagcttcttgatgaagtctttttttctgaaaaaatttataaactcaatgagtat ttattacagtatcaggagccaataccttgtgagcagttggttacagcgctgtgtgatatc aaacaagcttatacacaatttggaggaaaacgtccctttggtgtttcattgctgtacatt ggctgggataagcactatggctttcagctctatcagagtgaccctagtggaaattacggg ggatggaaggccacatgcattggaaataatagcgctgcagctgtgtcaatgttgaaacaa gactataaagaaggagaaatgaccttgaagtcagcacttgctttagctatcaaagtacta aataagaccatggatgttagtaaactctctgctgaaaaagtggaaattgcaacactaaca agagagaatggaaagacagtaatcagagttctcaaacaaaaagaagtggagcagttgatc aaaaaacatgaggaagaagaagccaaagctgagcgtgagaagaaagaaaaagaacagaaa gaaaaggataaatag >gi568815583r:78496607_78718918|GENSCAN_predicted_peptide_3|453_aa MAARGSGPRALRLLLLVQLVAGRCGLAGAAGGAQRGLSEPSSIAKHEDSLLKDLFQDYER WVRPVEHLNDKIKIKFGLAISQLVDVEWIDVKLRWNPDDYGGIKVIRVPSDSVWTPDIVL FDNADGRFEGTSTKTVIRYNGTVTWTPPANYKSSCTIDVTFFPFDLQNCSMKFGSWTYDG SQVDIILEDQDVDKRDFFDNGEWEIVSATGSKGNRTDSCCWYPYVTYSFVIKRLPLFYTL FLIIPCIGLSFLTVLVFYLPSNEGEKICLCTSVLVSLTVFLLVIEEIIPSSSKVIPLIGE YLVFTMIFVTLSIMVTVFAINIHHRSSSTHNAMAPLVRKIFLHTLPKLLCMRSHVDRYFT QKEETESGSGPKSSRNTLEAALDSIRYITRHIMKENDVREVVEDWKFIAQVLDRMFLWTF LFVSIVGSLGLFVPVIYKWANILIPVHIGNANK >gi568815583r:78496607_78718918|GENSCAN_predicted_CDS_3|1362_bp atggcggcgcgggggtcagggccccgcgcgctccgcctgctgctcttggtccagctggtc gcggggcgctgcggtctagcgggcgcggcgggcggcgcgcagagaggattatctgaacct tcttctattgcaaaacatgaagatagtttgcttaaggatttatttcaagactacgaaaga tgggttcgtcctgtggaacacctgaatgacaaaataaaaataaaatttggacttgcaata tctcaattggtggatgtggaatggatagatgtaaaattaagatggaaccctgatgactat ggtggaataaaagttatacgtgttccttcagactctgtctggacaccagacatcgttttg tttgataatgcagatggacgttttgaagggaccagtacgaaaacagtcatcaggtacaat ggcactgtcacctggactccaccggcaaactacaaaagttcctgtaccatagatgtcacg tttttcccatttgaccttcagaactgttccatgaaatttggttcttggacttatgatgga tcacaggttgatataattctagaggaccaagatgtagacaagagagatttttttgataat ggagaatgggagattgtgagtgcaacagggagcaaaggaaacagaaccgacagctgttgc tggtatccgtatgtcacttactcatttgtaatcaagcgcctgcctctcttttataccttg ttccttataataccctgtattgggctctcatttttaactgtacttgtcttctatcttcct tcaaatgaaggtgaaaagatttgtctctgcacttcagtacttgtgtctttgactgtcttc cttctggttattgaagagatcataccatcatcttcaaaagtcatacctctaattggagag tatctggtatttaccatgatttttgtgacactgtcaattatggtaaccgtcttcgctatc aacattcatcatcgttcttcctcaacacataatgccatggcgcctttggtccgcaagata tttcttcacacgcttcccaaactgctttgcatgagaagtcatgtagacaggtacttcact cagaaagaggaaactgagagtggtagtggaccaaaatcttctagaaacacattggaagct gcgctcgattctattcgctacattacaagacacatcatgaaggaaaatgatgtccgtgag gttgttgaagattggaaattcatagcccaggttcttgatcggatgtttctgtggactttt cttttcgtttcaattgttggatctcttgggctttttgttcctgttatttataaatgggca aatatattaataccagttcatattggaaatgcaaataagtga >gi568815583r:78496607_78718918|GENSCAN_predicted_peptide_4|567_aa MGSGPLSLPLALSPPRLLLLLLLSLLPVARASEAEHRLFERLFEDYNEIIRPVANVSDPV IIHFEVSMSQLVKVDEVNQIMETNLWLKQIWNDYKLKWNPSDYGGAEFMRVPAQKIWKPD IVLYNKQDPQLQVCWSLLEVHSRPCLPGYQQRWLQNSGGCRTVDLGDPQMLLPDRSSGSF VSEEYPAVAVGDFQVDDKTKALLKYTGEVTWIPPAIFKSSCKIDVTYFPFDYQNCTMKFG SWSYDKAKIDLVLIGSSMNLKDYWESGEWAIIKAPGYKHDIKYNCCEEIYPDITYSLYIR RLPLFYTINLIIPCLLISFLTVLVFYLPSDCGEKVTLCISVLLSLTVFLLVITETIPSTS LVIPLIGEYLLFTMIFVTLSIVITVFVLNVHYRTPTTHTMPSWVKTVFLNLLPRVMFMTR PTSNEGNAQKPRPLYGAELSNLNCFSRAESKGCKEGYPCQDGMCGYCHHRRIKISNFSAN LTRSSSSESVDAVLSLSALSPEIKEAIQSVKYIAENMKAQNEAKEIQDDWKYVAMVIDRI FLWVFTLVCILGTAGLFLQPLMAREDA >gi568815583r:78496607_78718918|GENSCAN_predicted_CDS_4|1704_bp atgggctctggcccgctctcgctgcccctggcgctgtcgccgccgcggctgctgctgctg ctgctgctgtctctgctgccagtggccagggcctcagaggctgagcaccgtctatttgag cggctgtttgaagattacaatgagatcatccggcctgtagccaacgtgtctgacccagtc atcatccatttcgaggtgtccatgtctcagctggtgaaggtggatgaagtaaaccagatc atggagaccaacctgtggctcaagcaaatctggaatgactacaagctgaaatggaacccc tctgactatggtggggcagagttcatgcgtgtccctgcacagaagatctggaagccagac attgtgctgtataacaaacaggaccctcagctgcaggtctgttggagtttgctagaggtc cactccagaccctgtttgcctgggtatcagcagcggtggctgcagaacagcggtggctgt agaacagtggatcttggtgacccacagatgctgctgcctgatcgttcctctggaagtttt gtatcagaggagtacccggccgttgctgttggggatttccaggtggacgacaagaccaaa gccttactcaagtacactggggaggtgacttggatacctccggccatctttaagagctcc tgtaaaatcgacgtgacctacttcccgtttgattaccaaaactgtaccatgaagttcggt tcctggtcctacgataaggcgaaaatcgatctggtcctgatcggctcttccatgaacctc aaggactattgggagagcggcgagtgggccatcatcaaagccccaggctacaaacacgac atcaagtacaactgctgcgaggagatctaccccgacatcacatactcgctgtacatccgg cgcctgcccttgttctacaccatcaacctcatcatcccctgcctgctcatctccttcctc actgtgctcgtcttctacctgccctccgactgcggtgagaaggtgaccctgtgcatttct gtcctcctctccctgacggtgtttctcctggtgatcactgagaccatcccttccacctcg ctggtcatccccctgattggagagtacctcctgttcaccatgatttttgtaaccttgtcc atcgtcatcaccgtcttcgtgctcaacgtgcactacagaaccccgacgacacacacaatg ccctcatgggtgaagactgtattcttgaacctgctccccagggtcatgttcatgaccagg ccaacaagcaacgagggcaacgctcagaagccgaggcccctctacggtgccgagctctca aatctgaattgcttcagccgcgcagagtccaaaggctgcaaggagggctacccctgccag gacgggatgtgtggttactgccaccaccgcaggataaaaatctccaatttcagtgctaac ctcacgagaagctctagttctgaatctgttgatgctgtgctgtccctctctgctttgtca ccagaaatcaaagaagccatccaaagtgtcaagtatattgctgaaaatatgaaagcacaa aatgaagccaaagagattcaagatgattggaagtatgttgccatggtgattgatcgtatt tttctgtgggttttcaccctggtgtgcattctagggacagcaggattgtttctgcaaccc ctgatggccagggaagatgcataa >gi568815583r:78496607_78718918|GENSCAN_predicted_peptide_5|532_aa MRRAPSLVLFFLVALCGRGNCRVANAEEKLMDDLLNKTRYNNLIRPATSSSQLISIKLQL SLAQLISVNEREQIMTTNVWLKQEWTDYRLTWNSSRYEGVNILRIPAKRIWLPDIVLYNN ADGTYEVSVYTNLIVRSNGSVLWLPPAIYKSACKIEVKYFPFDQQNCTLKFRSWTYDHTE IDMVLMTPTASMDDFTPSGEWDIVALPGRRTVNPQDPSYVDVTYDFIIKRKPLFYTINLI IPCVLTTLLAILVFYLPSDCGEKMTLCISVLLALTFFLLLISKIVPPTSLDVPLIGKYLM FTMVLVTFSIVTSVCVLNVHHRSPSTHTMAPWVKRCFLHKLPTFLFMKRPGPDSSPARAF PPSKSCVTKPEATATSTSPSNFYGNSMYFVNPASAASKSPAGSTPVAIPRDFWLRSSGRF RQDVQEALEGVSFIAQHMKNDDEDQSFLDQQHRHRLRAYRKCRVSGPNPDLLAPNLHFNQ VVEDWKYVAMVVDRLFLWVFMFVCVLGTVGLFLPPLFQTHAASEGPYAAQRD >gi568815583r:78496607_78718918|GENSCAN_predicted_CDS_5|1599_bp atgaggcgcgcgccttccctggtccttttcttcctggtcgccctttgcgggcgcgggaac tgccgcgtggccaatgcggaggaaaagctgatggacgaccttctgaacaaaacccgttac aataacctgatccgcccagccaccagctcctcacagctcatctccatcaagctgcagctc tccctggcccagcttatcagcgtgaatgagcgagagcagatcatgaccaccaatgtctgg ctgaaacaggaatggactgattaccgcctgacctggaacagctcccgctacgagggtgtg aacatcctgaggatccctgcaaagcgcatctggttgcctgacatcgtgctttacaacaac gccgacgggacctatgaggtgtctgtctacaccaacttgatagtccggtccaacggcagc gtcctgtggctgccccctgccatctacaagagcgcctgcaagattgaggtgaagtacttt cccttcgaccagcagaactgcaccctcaagttccgctcctggacctatgaccacacggag atagacatggtcctcatgacgcccacagccagcatggatgactttactcccagtggtgag tgggacatagtggccctcccagggagaaggacagtgaacccacaagaccccagctacgtg gacgtgacttacgacttcatcatcaagcgcaagcctctgttctacaccatcaacctcatc atcccctgcgtgctcaccaccttgctggccatcctcgtcttctacctgccatccgactgc ggcgagaagatgacactgtgcatctcagtgctgctggcactgacattcttcctgctgctc atctccaagatcgtgccacccacctccctcgatgtgcctctcatcggcaagtacctcatg ttcaccatggtgctggtcaccttctccatcgtcaccagcgtctgtgtgctcaatgtgcac caccgctcgcccagcacccacaccatggcaccctgggtcaagcgctgcttcctgcacaag ctgcctaccttcctcttcatgaagcgccctggccccgacagcagcccggccagagccttc ccgcccagcaagtcatgcgtgaccaagcccgaggccaccgccacctccaccagcccctcc aacttctatgggaactccatgtactttgtgaaccccgcctctgcagcttccaagtctcca gccggctctaccccggtggctatccccagggatttctggctgcggtcctctgggaggttc cgacaggatgtgcaggaggcattagaaggtgtcagcttcatcgcccagcacatgaagaat gacgatgaagaccagagtttcttggaccagcagcaccggcatcgcctgcgagcttatcgg aaatgcagagtctcaggccccaatccagacctgctagctccaaacctgcattttaaccag gtcgttgaggactggaagtacgtggctatggtggtggaccggctgttcctgtgggtgttc atgtttgtgtgcgtcctgggcactgtggggctcttcctaccgcccctcttccagacccat gcagcttctgaggggccctacgctgcccagcgtgactga >gi568815583r:78496607_78718918|GENSCAN_predicted_peptide_6|317_aa MAPTKRDTDPRGVAEEVEQLSPTLQRTPVLGPLEHTAWEMIRKMKLPGRENKMAVVLGTI MDDVRVQEVLNLKGGAGGKILTFDQLALDSPKGSSTIPLFGSLKGQEVYRHFGKAPGTPH SHTKAYVRSKGRKFKRARGVSIENTSNKGNRRKSDPIRADLKINICPYVEHPAMLPQFSL SIRLSQQRAKTLVQSIRLLLSLLGGLNISAVVTVTKGKKNTQQVFRQHCQPLPGCQWTAG EPASGGTTKGKALERSLGSCALQSSHICGSAQNAYGRLRVSAGHMVSSRGEELDSGQSED KLAFASTTTSVTAETKG >gi568815583r:78496607_78718918|GENSCAN_predicted_CDS_6|954_bp atggcacccacaaagagggacactgacccccggggtgtggcagaggaggtggaacagctg agccccacacttcagagaacccctgtcctgggcccactggagcacacagcatgggagatg atccggaagatgaagcttcctggccgggaaaacaaaatggccgtggtcttggggaccata atggatgatgtgcgggttcaggaggttctcaacctgaagggcggggcagggggcaagatc ctcacttttgaccagctggccctggactcccccaagggcagcagcaccattccgctcttt ggttctctcaagggccaagaggtgtaccggcatttcggcaaggccccaggaacgccgcac agccacaccaaagcctacgtccgctccaagggccggaagttcaagcgcgccagaggtgtc tccattgaaaatacttccaacaaaggaaacagaaggaaatcagaccccatcagggcagac ttaaaaatcaacatctgcccttacgtggagcacccagcaatgttacctcaattctctttg agtattaggctttcccagcaacgtgccaaaaccctggtacagagcattaggcttcttctc tccctgctaggaggtttgaatatctctgcagtggtcacagtcacaaaagggaagaaaaac acccagcaggttttccgccagcactgccagcctttgcctggctgtcagtggacagctggg gagccagcttctggtgggaccaccaagggcaaggccctggagaggtctctgggcagctgt gccttgcagagctcccacatctgtggatctgcacaaaatgcctatggtagacttagggtc agtgcaggtcatatggtcagcagtcggggagaagagctggactcagggcagagtgaggac aagctggcatttgccagcaccactacatctgtcactgctgaaaccaaggggtga