GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:36:09 Sequence gi568815581r:35772380_35980305 : 207926 bp : 44.89% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 1025 801 225 0 0 85 53 329 0.942 27.06 1.04 Intr - 6696 6509 188 2 2 95 105 144 0.996 16.13 1.03 Intr - 6944 6865 80 2 2 72 91 135 0.995 10.55 1.02 Intr - 13631 13522 110 0 2 82 93 70 0.961 6.90 1.01 Init - 22998 22860 139 0 1 91 74 342 0.805 31.40 1.00 Prom - 25595 25556 40 -5.86 2.02 PlyA - 25772 25767 6 1.05 2.01 Sngl - 37343 37185 159 2 0 99 43 178 0.780 6.98 2.00 Prom - 39047 39008 40 -4.46 3.00 Prom + 46620 46659 40 -1.26 3.01 Init + 47755 47869 115 0 1 35 47 88 0.287 -0.23 3.02 Intr + 47953 48058 106 2 1 101 116 90 0.998 12.37 3.03 Intr + 50261 50454 194 2 2 51 116 156 0.998 13.84 3.04 Intr + 62187 62219 33 1 0 135 95 2 0.893 3.79 3.05 Intr + 63753 63862 110 1 2 97 113 23 0.961 5.70 3.06 Intr + 66045 66174 130 2 1 101 68 116 0.979 11.17 3.07 Intr + 69988 70080 93 2 0 70 94 39 0.664 2.64 3.08 Intr + 71698 71765 68 2 2 101 -7 35 0.329 -6.08 3.09 Term + 72098 72670 573 1 0 105 39 794 0.186 70.65 3.10 PlyA + 72871 72876 6 1.05 4.16 PlyA - 73053 73048 6 1.05 4.15 Term - 83031 82684 348 0 0 45 41 218 0.808 7.29 4.14 Intr - 83371 83285 87 1 0 68 84 102 0.985 7.97 4.13 Intr - 83845 83794 52 0 1 100 87 33 0.994 3.31 4.12 Intr - 84426 84353 74 0 2 120 69 63 0.994 5.80 4.11 Intr - 85940 85821 120 2 0 67 89 94 0.992 8.09 4.10 Intr - 86146 86054 93 1 0 96 99 56 0.995 7.66 4.09 Intr - 86691 86509 183 0 0 107 70 149 0.979 14.98 4.08 Intr - 90746 90616 131 0 2 104 55 103 0.208 9.01 4.07 Intr - 91180 91123 58 1 1 45 92 122 0.928 6.76 4.06 Intr - 91923 91867 57 0 0 109 89 38 0.954 5.08 4.05 Intr - 92511 92379 133 2 1 67 32 26 0.333 -4.45 4.04 Intr - 93017 92836 182 2 2 89 75 154 0.962 12.87 4.03 Intr - 94394 94345 50 0 2 65 110 18 0.918 0.10 4.02 Intr - 96392 96276 117 0 0 121 100 102 0.933 15.14 4.01 Init - 96974 96902 73 2 1 65 47 19 0.105 -5.27 4.00 Prom - 97148 97109 40 -5.76 5.04 PlyA - 97221 97216 6 1.05 5.03 Term - 100085 99998 88 1 1 124 45 132 0.977 9.63 5.02 Intr - 106260 106149 112 1 1 116 67 153 0.916 15.44 5.01 Init - 107926 107851 76 1 1 88 103 136 0.999 14.36 5.00 Prom - 108015 107976 40 -3.26 6.11 PlyA - 108343 108338 6 1.05 6.10 Term - 123002 122934 69 2 0 105 47 53 0.157 0.84 6.09 Intr - 134307 134200 108 0 0 57 99 41 0.072 2.58 6.08 Intr - 137359 137079 281 2 2 33 105 114 0.647 4.80 6.07 Intr - 137853 137690 164 2 2 47 76 95 0.308 3.82 6.06 Intr - 142888 142770 119 1 2 59 50 88 0.271 1.36 6.05 Intr - 147893 147784 110 0 2 55 44 102 0.406 2.50 6.04 Intr - 152393 152225 169 2 1 39 66 93 0.800 1.72 6.03 Intr - 153258 153136 123 0 0 65 92 81 0.991 6.98 6.02 Intr - 157876 157697 180 1 0 74 98 83 0.967 7.96 6.01 Init - 158348 158253 96 2 0 91 58 96 0.970 7.21 6.00 Prom - 158431 158392 40 -7.66 7.00 Prom + 159525 159564 40 -3.86 7.01 Init + 159772 159790 19 0 1 81 117 6 0.255 3.24 7.02 Intr + 162426 162556 131 1 2 73 97 13 0.275 1.11 7.03 Term + 163975 164112 138 0 0 85 48 131 0.336 6.76 7.04 PlyA + 164144 164149 6 1.05 8.05 PlyA - 164208 164203 6 -1.95 8.04 Term - 164454 164372 83 1 2 129 55 55 0.641 4.16 8.03 Intr - 165537 165379 159 1 0 97 47 182 0.904 14.96 8.02 Intr - 166978 166839 140 0 2 80 74 145 0.422 12.41 8.01 Init - 180827 180682 146 2 2 79 98 123 0.807 11.99 8.00 Prom - 182761 182722 40 -3.26 9.00 Prom + 188951 188990 40 -6.36 9.01 Init + 189206 189283 78 1 0 83 39 93 0.687 4.96 9.02 Intr + 199443 199480 38 2 2 77 67 70 0.540 0.86 9.03 Term + 201130 201202 73 1 1 111 41 92 0.858 4.18 9.04 PlyA + 204959 204964 6 1.05 10.03 PlyA - 205066 205061 6 1.05 10.02 Term - 205352 205187 166 1 1 95 45 220 0.994 15.79 10.01 Intr - 205905 205764 142 1 1 49 77 73 0.480 1.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:35772380_35980305|GENSCAN_predicted_peptide_1|248_aa MVARVGLLLRALQLLLWGHLDAQPAERGGQELRKEAEVRRVYLAESAIKLHEHRGKNNIV NGATVIKLSQTIFLDVDDAPARKAFLEKYGYLNEQVPKAPTSTRFSDAIRAFQWVSQLPV SGVLDRATLRQMTRPRCGVTDTNSYAAWAERISDLFARHRTKMRRKKRFAKQGNKWYKQH LSYRLVNWPEHLPEPAVRGAVRAAFQLWSNVSALEFWEAPATGPADIRLTFFQGDHNDGL GNAFDGPX >gi568815581r:35772380_35980305|GENSCAN_predicted_CDS_1|744_bp atggtcgcgcgcgtcggcctcctgctgcgcgccctgcagctgctactgtggggccacctg gacgcccagcccgcggagcgcggaggccaggagctgcgcaaggaggcggaggtacgccgt gtgtacctggcggagagcgcaataaagctccatgaacatagaggtaaaaataacattgtt aatggggcaactgtcataaagctttcccagaccatcttcttagatgttgatgatgctcct gcgagaaaggcattcctagagaagtacggatacctcaatgaacaggtccccaaagctccc acctccactcgattcagcgatgccatcagagcgtttcagtgggtgtcccagctacctgtc agcggcgtgttggaccgcgccaccctgcgccagatgactcgtccccgctgcggggttaca gataccaacagttatgcggcctgggctgagaggatcagtgacttgtttgctagacaccgg accaaaatgaggcgtaagaaacgctttgcaaagcaaggtaacaaatggtacaagcagcac ctctcctaccgcctggtgaactggcctgagcatctgccggagccggcagttcggggcgcc gtgcgcgccgccttccagttgtggagcaacgtctcagcgctggagttctgggaggcccca gccacaggccccgctgacatccggctcaccttcttccaaggggaccacaacgatgggctg ggcaatgcctttgatggcccagnn >gi568815581r:35772380_35980305|GENSCAN_predicted_peptide_2|52_aa MAAAAAAPPPQASLRAGGFSSGPSGGREDSPTPPGPSRRCRCSLKKSPTRHD >gi568815581r:35772380_35980305|GENSCAN_predicted_CDS_2|159_bp atggcggcggcagcggccgcccccccaccccaagcctccctcagggccggaggcttctcc tcagggccctccggtgggcgggaagacagcccaaccccgccaggaccttctcgtcgctgc cgctgctcgctgaagaagtcacctacccgacatgactaa >gi568815581r:35772380_35980305|GENSCAN_predicted_peptide_3|473_aa MKLEYLPKFQSYSGYGQTTDSSYGQNYSGYSSYGQSQSGYSQSYGGYENQKQSSYSQQPY NNQGQQQNMESSGSQGGRAPSYDQPDYGQQDSYDQQSGYDQHQGSYDEQSNYDQQHDSYS QNQQSYHSQRENYSHHTQGHRDYGPRTDADSESDNSDNNTIFVQGLGEGVSTDQVGEFFK QIGIIKTNKKTGKPMINLYTDKDTGKPKGEATVSFDDPPSAKAAIDWFDGKEFHGNIIKV SFATRRPEFMRGGGSGGGRRGRGGYRGRGGFQGRGGDPKSGDWISGGEATVERGATEVVG AEVETEAAMVETEVGVAMVETEAAVVATAEIEVGAAMVETEVGVAMVGTEAAAMVGTEEA AMEETEEVAMEEIEVAMEETEVEAMVETEEAMEEIEEVTEEIEEVMEEIEEAMEETEAGG AMEETVVVAVATVETEVEAMEETGVVAAMEETEVGATEETEVAMEAKWEEGEY >gi568815581r:35772380_35980305|GENSCAN_predicted_CDS_3|1422_bp atgaaacttgagtatttacctaaatttcagagctattctggctatgggcaaacgactgat tcctcttatggacagaactacagcggttactccagttatggacaaagtcagtcaggttat tcacagtcctatggtggttatgagaatcaaaagcagagctcatatagccagcaaccatat aataaccagggacagcagcaaaacatggaatcatcaggaagccaaggtggaagagcacct tcctatgaccagccagactatggtcaacaagattcatatgaccagcagtcaggctatgat caacatcaaggctcatatgatgagcagtcaaattatgatcagcagcatgattcctatagt caaaaccagcagtcctatcattcacaaagggaaaactacagccaccacacacaaggtcac agggattatggacccagaacagatgctgattcagaatctgataattcagataacaacaca atctttgtgcaaggacttggggagggtgtgtctacagatcaagttggggagttctttaaa caaataggaattatcaagacaaataagaagaccggaaaaccaatgataaatctttataca gacaaggacacaggaaagccaaagggggaggcaacagtgtcatttgatgaccctccttca gctaaggcagccattgactggtttgatggaaaagaattccatggcaacatcattaaagtg tcctttgccactagaagacctgaattcatgagaggaggtggaagtggaggtgggcggcga ggccgtggaggatatagaggtcgtggaggctttcaagggagaggtggagaccccaaaagt ggggattggatttccgggggagaggctacggtggagagaggggctacagaggtcgtgggg gcagaggtggagaccgaggcggctatggtggagacagaagtgggggtggctatggtggag acagaagcagcggtggtggctacagcggagatagaagtgggggcggctatggtggagaca gaagtgggggtggctatggtggggacagaggcggcggctatggtggggacagaggaggcg gctatggaggagaccgaggaggtggctatggaggagatcgaggtggctatggaggagacc gaggtggaggctatggtggagaccgaggaggctatggaggagatcgaggaggttacggag gagatcgaggaggttatggaggagatcgaggaggctatggaggagacagaagccgggggg gctatggaggagaccgtggtggtggcagtggctacggtggagaccgaagtggaggctatg gaggagacaggagtggtggcggctatggaggagaccgaggtgggggctacggaggagacc gaggtggctatggaggcaaaatgggaggaaggtgagtattag >gi568815581r:35772380_35980305|GENSCAN_predicted_peptide_4|585_aa MHLLSQLLGRLTQENRLNLGGGDCSSLWPHQEKKMAYEKSTDISDVSRSMFLYPWLEYPD KTKELRKAMAPVHLPLSCYQMPKEEFPPSPECWRQHPSKPNSVPYCYFKKPEIYTHWHDL YDQREEREAEKMLRKMRDDCRYIKEVHQTHIKMFHLPMSKLTIKSEMRSRPLEPTQDPLK WQRLRALGCLRISDKFVMEALQQVAQTGPEKVKYEAYRTLAILGCLNKHVIRALIKQLKE KNEGQRMETLTGLRMALNSWAAVSKDKRTQVGDEGKLVPVLQTLIKKSSSEASLEAALCL GFLRPCSNMVQEFLLQCLCQGLKTQRMKALRMLVKVMHVHSAPVIKAILDQLCSSSVLED RFEATQMLKTIGLEQIQAQGLEELTFNLLRRKTHNEPFLAVRQAVAQTVEELKLKPTMMN LVEAQLMNPDATARQEAVISLGVLGIRSPQVFHLLLDLLDAENHQAVKKSLQETLILCAS IDPWIQNKLKNKVLSVYEAPKTNVKAEPTRFQKEPENPEELTIQDFRLAKLNPLFIAKSI TKVGQKKTPAFPPCCSKPRKHRPQVIGPWQPRIKKQLRVLAEIAK >gi568815581r:35772380_35980305|GENSCAN_predicted_CDS_4|1758_bp atgcacctgttgtcccagctacttgggaggctgacgcaggagaatcgcttgaacctggga ggcggagactgcagttccctctggccccaccaggagaagaagatggcctatgaaaaatca actgatatctctgatgtctccaggtcaatgttcctgtacccatggctggaatatccagac aagaccaaagaactcagaaaagccatggctcctgttcatctgcccttgtcctgctaccag atgccaaaggaagagtttcccccaagtccagagtgctggaggcagcatccgagcaagcca aactcagtcccgtactgctacttcaagaaacctgagatctacacgcactggcacgacctg tatgatcagcgagaggaaagggaggctgagaagatgttgaggaaaatgagagatgactgt aggtacatcaaagaggtacatcaaacccacatcaaaatgttccatctcccaatgagcaag ctgactataaaatctgagatgcgatccaggcccttagagcctacccaggaccccctgaag tggcaaagattaagggctctgggatgcttacgcatcagtgacaagtttgtcatggaggca ctacagcaggtggcccaaactggtccagagaaagtgaagtacgaggcctaccgaaccctg gccatcctgggttgcctgaataagcatgtgatccgggctctcatcaaacagctgaaggag aaaaatgagggtcaaaggatggagactttgacggggctacgaatggctcttaactcctgg gctgctgtctctaaagacaagaggactcaagtcggggatgagggcaagctggtgcctgta ctacagacactgatcaagaagtcgtccagtgaagcatctctggaggcagccctgtgcctg ggtttcctgaggccttgcagcaacatggtccaagagttcttgttgcagtgcctgtgccaa ggactcaagacccagcggatgaaggcacttaggatgctggtcaaggtgatgcacgtgcac tcagccccagtcatcaaggccatcctagaccagctgtgttcttccagtgtccttgaggac cgctttgaagccacccaaatgctcaagaccattgggctggaacagatccaggcacagggg ctagaggaactcacatttaacctgctcaggaggaagacgcataatgaacccttccttgct gtgaggcaggctgtggctcaaactgtggaagagctcaagttgaagcctacgatgatgaac ttggtggaggcacaactgatgaacccagatgccactgcacgccaggaagcagtcatctct ttgggtgtcctggggatccgcagtccacaagtgttccacttgctcctggacttactagat gcagaaaaccaccaggctgtgaagaagagtctacaagaaacattaatcctttgtgcctca attgatccctggatccaaaacaagctgaaaaacaaggttctctctgtatatgaggcacct aagaccaatgtgaaggcagagcccacaaggttccagaaagagcctgagaacccagaagag ttaactattcaagactttcgacttgcaaagctgaaccccttgtttattgcaaagtccatc accaaagtaggccaaaagaaaacgcctgctttcccaccgtgctgctcgaaaccacgaaaa cataggccacaggtcatagggccctggcagccaaggatcaagaaacagctccgggtcctt gctgaaattgccaaataa >gi568815581r:35772380_35980305|GENSCAN_predicted_peptide_5|91_aa MKVSAAALAVILIATALCAPASASPYSSDTTPCCFAYIARPLPRAHIKEYFYTSGKCSNP AVVFVTRKNRQVCANPEKKWVREYINSLEMS >gi568815581r:35772380_35980305|GENSCAN_predicted_CDS_5|276_bp atgaaggtctccgcggcagccctcgctgtcatcctcattgctactgccctctgcgctcct gcatctgcctccccatattcctcggacaccacaccctgctgctttgcctacattgcccgc ccactgccccgtgcccacatcaaggagtatttctacaccagtggcaagtgctccaaccca gcagtcgtctttgtcacccgaaagaaccgccaagtgtgtgccaacccagagaagaaatgg gttcgggagtacatcaactctttggagatgagctag >gi568815581r:35772380_35980305|GENSCAN_predicted_peptide_6|472_aa MAELVPFAVPIESDKTLLVWELSSGPTAEALHHSLFTAFSQFGLLYSVRVFPNAAVAHPG FYAVIKFYSARAAHRAQKACDRKQLFQKSPVKVRLGTRHKAVQHQALALNSSKCQELANY YFGFNGCSKRIIKLQELSDLEERENEDSMVPLPKQSLKFFCALEVVLPSCDCRSPGIGLV EEPMDKVEEESGKIAVEYRPSEDIVGVRCEEELHGLIQVCEDKNSGQFQHLKDQQEMIIQ QLNTPENDELPPVPQEPTTQSPAQTLAPSGSGTLSNSAKLSSSDSIPPMEAEPSPNQQEA TVQASEPPKNIELSSQQMVPENIFPPTMENSNQLPEPPTEVVAQLPPRYEVTIPTQGQDQ AQLSTLASVTLQPLDLGFIITPESTTEIELSPTMQETPTQPPKEFVPQPPVYQESHRKRL DLYQFNRRLQLNLQNLLKMRTPLQYSRRLQGIQSQQYDISGTLTSPSQEASK >gi568815581r:35772380_35980305|GENSCAN_predicted_CDS_6|1419_bp atggcggagttggtaccttttgcggttcccatcgagagtgacaaaaccttgctagtgtgg gagctgagctccggacccacggccgaggctttgcatcattctctgttcacagcattttct cagtttggccttctgtattcagtccgggtcttcccaaatgctgcagtggcccatcctggt ttctatgccgtcattaagttttattctgcaagggctgcccacagagcccaaaaggcatgc gaccggaagcagctttttcagaaatctccagtcaaggttcgtcttggcaccagacataag gcagttcaacatcaagcccttgccctgaacagttccaaatgccaagaactggcgaattac tactttggtttcaatgggtgttccaaaaggatcatcaagcttcaggagctttctgacctt gaagaaagggaaaatgaagatagcatggtgccacttccgaagcaaagcctgaagttcttc tgtgctttagaagtggtgttgccatcctgtgattgcaggagtcctggcattggcttggtg gaggagcctatggataaggtggaggaagaaagtggtaaaatagctgtggagtacagaccc agtgaagacatcgtaggtgtcagatgcgaagaagaactacacggtttaattcaagtatgt gaagataaaaactcagggcagtttcagcatttgaaagaccagcaagaaatgattattcag cagctaaatacccctgaaaatgatgaacttcctccagtccctcaagagcccacaactcag tcaccagctcagactttagctccctcaggaagtggaaccctctctaactcagcaaaactt tccagctcagactccataccccctatggaggcagagccttctccaaaccagcaggaggcc acagttcaggcttcagagccccccaagaatatagaactttcaagccagcagatggtccca gagaatatatttcctccaaccatggagaactcaaatcaacttccagaaccacctacggag gttgtagctcaacttccacctcgttatgaggtgacaattccaacacaaggtcaggatcaa gctcagctttcaacactggccagtgtcacacttcaacctttggacctggggtttatcatc actccagaatccactacagaaattgaactttctccaaccatgcaggagaccccaactcag cctcctaaggaatttgtaccccaacctccagtatatcaagagagtcatcggaagagactg gacctttaccagttcaacaggagacttcagctgaatctccagaacctactaaagatgaga acccctctccaatacagtaggaggctgcagggtatacagtctcagcagtacgacatatca gggacgctaacttctcccagccaagaagcaagcaaatga >gi568815581r:35772380_35980305|GENSCAN_predicted_peptide_7|95_aa MGIVGSGGILSASRRVAGLNSATSILPCGEKRHGSCLTLSNSSTHMTQFLTVEQRKSAER EVKQMHQEKLKPAMDSVLTISSPGSMLFLRPSCSP >gi568815581r:35772380_35980305|GENSCAN_predicted_CDS_7|288_bp atgggcatagtgggctccggcggcatcctgtcagccagtagaagagtggccggcctgaac agtgcaacctccattctaccctgcggagaaaaaagacatggttcttgcctcacactcagt aactcttccacacacatgacccagttcttgactgtggagcagagaaagtctgcggagaga gaagtgaagcaaatgcaccaagagaagctgaaacctgccatggacagtgtgctgacaatt tccagccctggttccatgctcttcctgaggcccagctgcagcccttga >gi568815581r:35772380_35980305|GENSCAN_predicted_peptide_8|175_aa MRKNQCKNAENSKNQNASSPPNDRNTAPARAQNWMENEIDKLTEVGFRRMTKALLIYLVS SFLALNQASLISRCDLAQVLQLEDLDGFEGYSLSDWLCLAFVESKFNISKINENADGSFD YGLFQINSHYWCNDYKSYSENLCHVDCQDLLNPNLLAGIHCAKRIVSGARGMNNW >gi568815581r:35772380_35980305|GENSCAN_predicted_CDS_8|528_bp atgaggaaaaaccaatgcaaaaacgctgaaaattccaaaaaccagaatgcctcttctcct ccaaatgatcgcaacaccgctccagcaagggcacaaaactggatggagaatgagattgac aaattgacagaagtaggcttcagaaggatgacaaaggcgctactcatctatttggtcagc agctttcttgccctaaatcaggccagcctcatcagtcgctgtgacttggcccaggtgctg cagctggaggacttggatgggtttgagggttactccctgagtgactggctgtgcctggct tttgtggaaagcaagttcaacatatcaaagataaatgaaaatgcagacggaagctttgac tatggcctcttccagatcaacagccactactggtgcaacgattataagagttactcggaa aacctttgccacgtagactgtcaagatctgctgaatcccaaccttcttgcaggcatccac tgcgcaaaaaggattgtgtccggagcacgggggatgaacaactggtga >gi568815581r:35772380_35980305|GENSCAN_predicted_peptide_9|62_aa MKNQIEKQYDIDETQKRTDRNFQENKILYQKLAVVTAALPLNSDSLALTLVLAKDFETSC RI >gi568815581r:35772380_35980305|GENSCAN_predicted_CDS_9|189_bp atgaagaatcaaatcgagaaacaatatgacatagatgaaacacagaagagaactgacagg aacttccaggagaacaagatcctgtaccagaagctggcagttgtcacagcagccctgcct ttgaactctgactccctggctttgaccctggtgctggctaaggactttgaaacttcttgc cgcatctga >gi568815581r:35772380_35980305|GENSCAN_predicted_peptide_10|102_aa XCNVIPSEVPEWVNTPSTCCLKYYEKVLPRRLVVGYRKALNCHLPAIIFVTKRNREVCTN PNDDWVQEYIKDPNLPLLPTRNLSTVKIITAKNGQPQLLNSQ >gi568815581r:35772380_35980305|GENSCAN_predicted_CDS_10|309_bp ncttgcaatgtgattccttcagaagttcctgagtgggtgaacaccccatccacctgctgc ctgaagtattatgagaaagtgttgccaaggagactagtggtgggatacagaaaggccctc aactgtcacctgccagcaatcatcttcgtcaccaagaggaaccgagaagtctgcaccaac cccaatgacgactgggtccaagagtacatcaaggatcccaacctacctttgctgcctacc aggaacttgtccacggttaaaattattacagcaaagaatggtcaaccccagctcctcaac tcccagtga