GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:01:50 Sequence gi568815592r:35475838_35742824 : 266987 bp : 45.50% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 Intr - 255 82 174 0 0 106 92 289 0.999 31.24 1.10 Intr - 568 444 125 2 2 94 72 78 0.162 7.10 1.09 Intr - 1535 1474 62 1 2 81 92 85 0.735 6.58 1.08 Intr - 2734 2571 164 1 2 83 46 138 0.655 7.87 1.07 Intr - 3479 3468 12 2 0 144 119 12 0.964 4.88 1.06 Intr - 4285 4223 63 1 0 146 102 12 0.990 7.31 1.05 Intr - 4519 4475 45 1 0 17 95 110 0.518 3.21 1.04 Intr - 8787 8723 65 1 2 91 80 27 0.364 0.64 1.03 Intr - 10874 10624 251 1 2 106 86 437 0.551 42.38 1.02 Intr - 12024 11999 26 0 2 120 44 37 0.629 -0.68 1.01 Init - 17324 17190 135 2 0 69 48 122 0.635 6.44 1.00 Prom - 17456 17417 40 -7.76 2.14 PlyA - 22059 22054 6 1.05 2.13 Term - 22623 22490 134 1 2 90 46 389 0.998 33.15 2.12 Intr - 24315 24144 172 0 1 73 84 309 0.979 28.52 2.11 Intr - 27820 27722 99 1 0 102 56 207 0.916 19.21 2.10 Intr - 28011 27900 112 2 1 102 45 202 0.967 17.68 2.09 Intr - 30016 29904 113 1 2 141 59 109 0.999 12.48 2.08 Intr - 30336 30166 171 0 0 78 98 251 0.991 25.24 2.07 Intr - 33475 33372 104 2 2 62 53 120 0.923 5.79 2.06 Intr - 33913 33797 117 2 0 46 40 101 0.737 1.54 2.05 Intr - 34091 33990 102 0 0 112 105 12 0.977 5.45 2.04 Intr - 35173 35024 150 2 0 39 47 238 0.973 14.93 2.03 Intr - 35969 35811 159 0 0 47 93 148 0.940 11.16 2.02 Intr - 36433 36343 91 1 1 109 64 41 0.960 3.37 2.01 Init - 52219 52061 159 1 0 79 48 107 0.130 5.62 2.00 Prom - 67046 67007 40 -5.96 3.03 PlyA - 67148 67143 6 1.05 3.02 Term - 69088 68933 156 1 0 89 48 165 0.619 10.53 3.01 Init - 73488 73486 3 0 0 87 53 0 0.148 -3.60 3.00 Prom - 77450 77411 40 -7.86 4.00 Prom + 78530 78569 40 -4.16 4.01 Init + 80036 80273 238 1 1 117 62 289 0.877 27.37 4.02 Intr + 81327 81491 165 1 0 -5 87 113 0.441 1.83 4.03 Term + 93464 93627 164 0 2 97 47 53 0.325 0.20 4.04 PlyA + 96487 96492 6 1.05 5.09 PlyA - 96503 96498 6 1.05 5.08 Term - 100105 99998 108 1 0 115 48 134 0.874 10.61 5.07 Intr - 101396 101157 240 2 0 27 93 375 0.999 29.65 5.06 Intr - 104384 104199 186 2 0 54 87 149 0.993 11.29 5.05 Intr - 111280 111197 84 1 0 54 93 83 0.549 5.42 5.04 Intr - 115383 115293 91 2 1 94 100 54 0.978 7.20 5.03 Intr - 121567 121411 157 2 1 110 121 131 0.925 17.67 5.02 Intr - 143373 143259 115 0 1 119 106 108 0.964 15.72 5.01 Init - 147946 147944 3 1 0 108 81 0 0.934 1.30 5.00 Prom - 151440 151401 40 -3.76 6.04 PlyA - 153189 153184 6 1.05 6.03 Term - 154291 154230 62 2 2 140 43 -39 0.195 -5.23 6.02 Intr - 161321 161177 145 2 1 106 116 69 0.986 11.36 6.01 Init - 166987 166883 105 1 0 62 100 93 0.980 8.12 6.00 Prom - 205835 205796 40 -1.46 7.04 PlyA - 206378 206373 6 1.05 7.03 Term - 212236 212121 116 2 2 62 48 93 0.842 1.43 7.02 Intr - 212544 212257 288 1 0 75 58 122 0.889 5.12 7.01 Init - 212955 212895 61 0 1 117 36 123 0.980 9.42 7.00 Prom - 220852 220813 40 -5.76 8.00 Prom + 230915 230954 40 -4.46 8.01 Init + 236876 236885 10 1 1 103 91 2 0.415 2.56 8.02 Intr + 255651 255748 98 1 2 75 73 49 0.000 1.83 8.03 Intr + 259626 259772 147 2 0 68 96 30 0.002 2.23 8.04 Intr + 261257 261434 178 1 1 24 86 223 0.006 15.19 8.05 Intr + 262190 262335 146 0 2 18 94 142 0.962 7.80 8.06 Intr + 262547 262681 135 1 0 85 99 175 0.988 19.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 251745 251646 100 2 1 84 50 66 0.801 -0.00 S.002 Intr - 252423 252368 56 0 2 95 93 87 0.882 7.68 S.003 Init + 261272 261434 163 1 1 88 86 191 0.887 18.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:35475838_35742824|GENSCAN_predicted_peptide_1|374_aa MRVTVNEQDSGCDATVYASERKGAEETYLFLDDTVTLMRALTEWLHAFGSCVTLLSRPVG SGPEPRATSTIASNSWNASSSPGEAREDGPEGLDKGLDNDAEGVWSPDIEQSFQEALAIY PPCGRRKIILSDEGKMYGRNELIARYIKLRTGKTRTRKQVLARKKVREYQVGIKVSSHLQ VLARRKSREIQSKLKAMNLDQVSKDKALQSMASMSSAQIVSASVLQNKFSPPSPLPQAVF STSSRVHGSQEVGCIKPFAQPAYPIQPPLPPTLSTAASVPVWQDRTIASSRLRLLEYSAF MEVQRDPDTVTCLGVRYSKHLFVHIGQTNPAFSDPPLEAVDVRQIYDKFPEKKGGLKELY EKGPPNAFFLVKFW >gi568815592r:35475838_35742824|GENSCAN_predicted_CDS_1|1122_bp atgcgggtaacagtgaatgagcaggacagtggctgtgacgcaactgtgtatgcgagcgag aggaagggcgcagaggaaacctatttgttcttagatgacaccgtgactctcatgagggca ctgactgagtggctgcatgcctttggcagctgcgtgaccttgctcagccgcccagtgggc tcaggcccagagcccagagcaaccagcacaatagcgtccaacagctggaacgccagcagc agccccggggaggcccgggaggatgggcccgagggcctggacaaggggctggacaacgat gcggagggcgtgtggagcccggacatcgagcagagcttccaggaggccctggccatctac ccgccctgcggccggcggaagatcatcctgtcagacgagggcaagatgtacggccgaaat gagttgattgcacgctatattaaactgaggacggggaagactcggacgagaaaacaggtt ctagctcggaagaaggtgcgggagtaccaggttggcatcaaggtctctagccacttgcaa gttctagccaggcggaaatctcgggagattcagtctaagctgaaggccatgaacctggac caggtctccaaggacaaagcccttcagagcatggcgtccatgtcctctgcccagatcgtc tctgccagtgtcctgcagaacaagttcagcccaccttcccctctgccccaggccgtcttc tccacttcctcgcgggtacatgggtcccaggaggtgggctgcatcaagccctttgcacag ccagcctaccccatccagccgcccctgccgccgacgctcagcactgctgcctctgtgcct gtgtggcaggaccgtaccattgcctcctcccggctgcggctcctggagtattcagccttc atggaggtgcagcgagaccctgacacggtaacgtgtttgggggtgaggtacagcaaacac ctgtttgtgcacatcggccagacgaaccccgccttctcagacccacccctggaggcagta gatgtgcgccagatctatgacaaattccccgagaaaaagggaggattgaaggagctctat gagaaggggccccctaatgccttcttccttgtcaagttctgg >gi568815592r:35475838_35742824|GENSCAN_predicted_peptide_2|560_aa MESASAESDYPYGGKLDFPQRKSGYHYQEKRKKMLGQKNWELRVGSPNGVKEERPAPAQR LRKKRTEAPESPCPTGSKPRKPGAGRTGRPREEPSPDPAQARAPQTVYARFLRDPEAKKR DPRETFLVARAPDAEDEEEEEEEDEEDEEEEAEEKKEKILLPPKKPLREKSSADLKERRA KAQGPRGDLGSPDPPPKPLRVRNKEAPAGEGTKMRKTKKKGSGEADKDPSGSPASARKSP AAMFLVGEGSPDKKALKKKGTPKGARKEEEEEEEAATVIKKSNQKGKAKGKGKKKEERAP SPPVEVDEPREFVLRPAPQGRTVRCRLTRDKKGMDRGMYPSYFLHLDTEKKVFLLAGRKR KRSKTANYLISIDPTNLSRGGENFIGKLRSNLLGNRFTVFDNGQNPQRGYSTNVASLRQE LAAVIYETNVLGFRGPRRMTVIIPGMSAENERVPIRPRNASDGLLVRWQNKTLESLIELH NKPPVWNDDSGSYTLNFQGRVTQASVKNFQIVHADDPDYIVLQFGRVAEDAFTLDYRYPL CALQAFAIALSSFDGKLACE >gi568815592r:35475838_35742824|GENSCAN_predicted_CDS_2|1683_bp atggagtcagcctcagctgaatctgattatccatatggggggaaactggattttcctcaa aggaaaagtgggtaccattaccaagagaaaaggaaaaagatgcttgggcagaagaactgg gagctacgggttggaagtcccaatggggtcaaagaagagcgacccgccccggcacagagg ctaaggaagaagaggacggaggcccccgaatccccctgccccacgggatccaagccccgg aagcccggagctgggcggacggggaggccgcgggaggagccttccccagacccagcccag gcccgggcgccgcagacggtctacgccaggttcctcagggaccccgaggccaagaagcgc gacccccgggaaacctttctggtagcccgtgccccagacgcggaggacgaggaggaggag gaagaggaggacgaggaggacgaggaagaggaggcagaggaaaagaaagagaaaatcctt ctgcctcccaagaagcccctgagagagaagagctccgcagacctgaaggagaggagggcc aaggcccagggcccaaggggagacctgggaagccctgaccccccaccgaaacctctgcgt gttaggaataaggaagctccagcaggggaggggaccaagatgagaaagaccaagaagaaa gggtctggggaggccgacaaggacccctcagggagcccagccagtgcgaggaagagccca gcagccatgtttctggttggggaaggcagtcctgacaagaaagccctgaagaagaaaggc actcccaaaggcgcgaggaaggaggaagaagaggaggaggaggcagctacggtgataaag aagagcaatcaaaagggcaaagccaaaggaaaaggcaaaaagaaggaggagagggccccg tctccccccgtggaggtggacgaaccccgggagtttgtgctccggcctgccccccagggc cgcacggtgcgctgccggctgacccgggacaaaaagggcatggatcgaggcatgtatccc tcctacttcctgcacctggacacggagaagaaggtgttcctcttggctggcaggaaacga aaacggagcaagacagccaattacctcatctccatcgaccctaccaatctgtcccgagga ggggagaatttcatcgggaagctgaggtccaacctcctggggaaccgcttcacggtcttt gacaacgggcagaacccacagcgtgggtacagcactaatgtggcaagccttcggcaggag ctggcagctgtgatctatgaaaccaacgtgctgggcttccgtggcccccggcgcatgacc gtcatcattcctggcatgagtgcggagaacgagagggtccccatccggccccgaaatgct agtgacggcctgctggtgcgctggcagaacaagacgctggagagcctcatagaactgcac aacaagccacctgtctggaacgatgacagtggctcctacaccctcaacttccaaggccgg gtcacccaggcctcagtcaagaacttccagattgtccacgctgatgaccccgactatatc gtgctgcagttcggccgcgtggcggaggacgccttcaccctagactaccggtacccgctg tgcgccctgcaggccttcgccatcgccctctccagtttcgacgggaagctggcctgcgag tga >gi568815592r:35475838_35742824|GENSCAN_predicted_peptide_3|52_aa MMASSRWRGDAIQVQTDMDAPAGEKRLSLASPETHAGASNNIRRTLELNNHE >gi568815592r:35475838_35742824|GENSCAN_predicted_CDS_3|159_bp atgatggcgtcctcgagatggagaggcgatgccatccaggtgcagacagacatggacgca cctgcaggggagaaaaggttgtcactggccagccctgagacccacgcaggagcaagcaac aacatccggaggacactggaactgaacaaccatgaataa >gi568815592r:35475838_35742824|GENSCAN_predicted_peptide_4|188_aa MVCVNVLADALKSINNAEKRGKRPVLIRPCSKLIVRFLTVMMKHGYIGEFEIIDDRRAGK IVANLTGRLNKCGVISPRFENPSDFKQEKDSLEGYWVVHKITERAEEPRSKTKPFLQLSP SPSAGTAIACQLWADVSSRWQMLCDDAAICMPHFSLVCESLSPGGPILHLSGTVPSSQQM GRKCLGAG >gi568815592r:35475838_35742824|GENSCAN_predicted_CDS_4|567_bp atggtgtgcgtgaacgtcctggccgatgctctcaagagcatcaacaatgccgaaaagaga ggcaaacgcccagtgcttattaggccgtgctccaaactcatcgtccggtttctcactgtg atgatgaagcatggttacattggcgaatttgaaatcattgatgatcgcagagctgggaaa attgttgcgaacctcacaggcaggctaaacaagtgtggagtgatcagccccagattcgaa aatccttctgactttaagcaggaaaaggattcattggaaggatattgggtagttcacaaa atcactgagagggctgaggaaccaagatccaagactaaaccatttctgcagttaagcccc agcccttcggctggcacagccatcgcctgccagctctgggcagatgtcagtagcagatgg cagatgctctgtgatgatgcagccatctgcatgcctcatttttcacttgtctgcgagtcg ctcagcccagggggccccatccttcacctcagtggaacagtacccagttcacaacagatg ggccgtaaatgtttgggggctggctga >gi568815592r:35475838_35742824|GENSCAN_predicted_peptide_5|327_aa MIELLDFKGEDLFEDGGIIRRTKRKGEGYSNPNEGATVEIHLEGRCGGRMFDCRDVAFTV GEGEDHDIPIGIDKALEKMQREEQCILYLGPRYGFGEAGKPKFGIEPNAELIYEVTLKSF EKAKESWEMDTKEKLEQAAIVKEKGTVYFKGGKYMQAVIQYGKIVSWLEMEYGLSEKESK ASESFLLAAFLNLAMCYLKLREYTKAVECCDKALGLDSANEKGLYRRGEAQLLMNEFESA KGDFEKVLEVNPQNKAARLQISMCQKKAKEHNERDRRIYANMFKKFAEQDAKEEANKAMG KKTSEGVTNEKGTDSQAMEEEKPEGHV >gi568815592r:35475838_35742824|GENSCAN_predicted_CDS_5|984_bp atgattgagctccttgatttcaaaggagaggatttatttgaagatggaggcattatccgg agaaccaaacggaaaggagagggatattcaaatccaaacgaaggagcaacagtagaaatc cacctggaaggccgctgtggtggaaggatgtttgactgcagagatgtggcattcactgtg ggcgaaggagaagaccacgacattccaattggaattgacaaagctctggagaaaatgcag cgggaagaacaatgtattttatatcttggaccaagatatggttttggagaggcagggaag cctaaatttggcattgaacctaatgctgagcttatatatgaagttacacttaagagcttc gaaaaggccaaagaatcctgggagatggataccaaagaaaaattggagcaggctgccatt gtcaaagagaagggaaccgtatacttcaagggaggcaaatacatgcaggcggtgattcag tatgggaagatagtgtcctggttagagatggaatatggtttatcagaaaaggaatcgaaa gcttctgaatcatttctccttgctgcctttctgaacctggccatgtgctacctgaagctt agagaatacaccaaagctgttgaatgctgtgacaaggcccttggactggacagtgccaat gagaaaggcttgtataggaggggtgaagcccagctgctcatgaacgagtttgagtcagcc aagggtgactttgagaaagtgctggaagtaaacccccagaataaggctgcaagactgcag atctccatgtgccagaaaaaggccaaggagcacaacgagcgggaccgcaggatatacgcc aacatgttcaagaagtttgcagagcaggatgccaaggaagaggccaataaagcaatgggc aagaagacttcagaaggggtcactaatgaaaaaggaacagacagtcaagcaatggaagaa gagaaacctgagggccacgtatga >gi568815592r:35475838_35742824|GENSCAN_predicted_peptide_6|103_aa MTTDEGAKNNEESPTATVAEQGEDITSKKDRGVLKIVKRVGNGEETPMIGDKVYVHYKGK LSNGKKFDSSHDRNEPFVFSLGKETCSVIGAGMQWHHHNSLQH >gi568815592r:35475838_35742824|GENSCAN_predicted_CDS_6|312_bp atgactactgatgaaggtgccaagaacaatgaagaaagccccacagccactgttgctgag cagggagaggatattacctccaaaaaagacaggggagtattaaagattgtcaaaagagtg gggaatggtgaggaaacgccgatgattggagacaaagtttatgtccattacaaaggaaaa ttgtcaaatggaaagaagtttgattccagtcatgatagaaatgaaccatttgtctttagt cttggcaaagagacttgctctgttattggggctggaatgcaatggcaccatcataactca ctacagcattga >gi568815592r:35475838_35742824|GENSCAN_predicted_peptide_7|154_aa MATEMGRPAAAPREPNALLPGGRTLSRHRCGREPNFAFGSGRRDNRTPGRAERRDAAGAG RIRPPGGAVKGMVAGWPRSGCGRAVPAFTCRLGTSVGEPHRRCRPLVEESAQRRLTGHTE AAERCARAPYALLFWDVKTSSALLVVQKGAPVTV >gi568815592r:35475838_35742824|GENSCAN_predicted_CDS_7|465_bp atggccacggagatggggcggccggccgcggcgccccgggagccgaacgccctccttcca ggcggccggaccctcagccggcaccgctgcgggcgggagccaaacttcgctttcgggtcc gggcgccgggacaaccggaccccagggcgagccgagcgccgggacgccgcgggcgcgggg cggattcgcccacctggcggggctgtgaaggggatggtggcggggtggccgcgctccggg tgtggacgggcggtccccgccttcacctgccgcctcgggacaagcgttggggaaccgcat cgccgctgtcggccgcttgtggaggagtccgcgcagcgccgtctaacaggccataccgaa gcggctgaacgctgtgccagggctccttatgctcttctgttctgggacgtgaaaacgtct tccgccttattggtcgtccaaaaaggtgcaccggtcacagtataa >gi568815592r:35475838_35742824|GENSCAN_predicted_peptide_8|238_aa MQGVALTSVTSRGAWPLSIVSWCPGPLRIRTWGEDEGVLGQGSEWAGLAGRSIIGDAVSL PPSILGAAQDPEPSVPVLPKRDQGQGNTEDMGKSIPQYLGQLDIRKSVVSLATGAGAIYL LYKAIKAGIKCKPPLCSNSPICIARLAVERERHGRDSGELRRLLNSLECKQDEYAKSMIL HSITRCVYLLEAEASACTTDDIVLLGYMLDDKDNSVKTQALNTLKAFSGIRKFRLKIQ >gi568815592r:35475838_35742824|GENSCAN_predicted_CDS_8|714_bp atgcagggtgttgccttgacatcagtgacgtcgcgaggggcgtggcctctctccatcgtc tcctggtgccctgggcccctccgcatccgaacctggggggaggatgagggtgtactgggc caaggctctgagtgggcaggcctggctggccgcagcatcattggagatgctgtttcccta cctccttccatcctgggtgcagcccaggacccagagccttctgtcccagttctcccaaaa agggatcagggccagggcaacactgaagacatgggcaagagcatcccccaatacctgggg caactggacatccgcaaaagcgtagtcagcctggccacaggcgccggggcgatctacctg ctctacaaggccatcaaggctggcataaaatgcaaaccacccctctgtagcaactcaccc atctgcatcgcccgcctggcagtcgagcgagagcggcacgggcgggactcaggtgagctc cggaggctcctcaactctttggagtgcaaacaggatgagtatgccaagagcatgatcctg cacagtatcactcgctgtgtgtacttgctggaggctgaggcctctgcttgtactacggat gacatcgtgttgctgggctacatgctggatgacaaggacaacagtgtcaaaacccaagct ctgaatacacttaaagctttctctggcatcagaaaattcaggctcaaaatccag