GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:08:48 Sequence gi568815581r:11877960_12092829 : 214870 bp : 41.11% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2119 2241 123 1 0 81 107 125 0.952 12.48 1.02 Intr + 3250 3454 205 1 1 57 100 258 0.999 22.18 1.03 Intr + 5627 5791 165 1 0 82 93 158 0.999 14.94 1.04 Intr + 8866 9006 141 0 0 79 57 197 0.741 15.33 1.05 Intr + 13818 13988 171 2 0 123 98 142 0.992 17.92 1.06 Intr + 16415 16537 123 1 0 43 86 156 0.716 10.76 1.07 Intr + 24760 24953 194 0 2 44 58 239 0.054 13.87 1.08 Intr + 27702 27850 149 0 2 80 100 115 0.909 10.86 1.09 Intr + 45855 45982 128 1 2 116 91 82 0.874 10.78 1.10 Intr + 51907 52134 228 0 0 68 96 237 0.671 19.64 1.11 Intr + 53998 54246 249 0 0 41 98 96 0.488 2.61 1.12 Intr + 55921 56112 192 0 0 94 94 140 0.915 13.97 1.13 Intr + 59393 59563 171 1 0 61 93 116 0.877 8.62 1.14 Intr + 64344 64526 183 2 0 94 105 186 0.804 19.96 1.15 Intr + 72451 72529 79 0 1 89 56 14 0.698 -3.49 1.16 Term + 74833 74966 134 2 2 141 55 95 0.287 8.77 1.17 PlyA + 76083 76088 6 1.05 2.00 Prom + 78236 78275 40 -5.95 2.01 Init + 80569 80655 87 0 0 21 81 132 0.265 6.49 2.02 Intr + 83908 84297 390 0 0 105 103 392 0.379 36.39 2.03 Intr + 91341 91528 188 2 2 94 39 215 0.353 14.77 2.04 Intr + 91563 91662 100 0 1 24 107 20 0.177 -3.31 2.05 Intr + 94194 94376 183 2 0 10 131 89 0.303 4.46 2.06 Term + 96933 97415 483 2 0 74 33 255 0.813 12.36 2.07 PlyA + 98153 98158 6 1.05 3.07 PlyA - 98527 98522 6 1.05 3.06 Term - 100785 99998 788 1 2 105 42 599 0.999 49.50 3.05 Intr - 105445 105338 108 2 0 74 95 70 0.573 5.64 3.04 Intr - 106238 106154 85 2 1 105 78 56 0.739 4.77 3.03 Intr - 112591 112503 89 2 2 78 99 12 0.729 0.07 3.02 Intr - 113204 113015 190 2 1 51 66 202 0.884 12.54 3.01 Init - 114870 114484 387 0 0 68 94 468 0.623 41.05 3.00 Prom - 118591 118552 40 -5.55 4.02 PlyA - 119765 119760 6 1.05 4.01 Sngl - 133093 132830 264 1 0 111 49 161 0.448 9.46 4.00 Prom - 137833 137794 40 -5.25 5.04 PlyA - 138347 138342 6 1.05 5.03 Term - 139567 139311 257 2 2 42 47 178 0.838 3.86 5.02 Intr - 142782 142621 162 1 0 32 107 131 0.914 8.43 5.01 Init - 143239 142987 253 1 1 40 7 282 0.401 10.85 5.00 Prom - 155323 155284 40 -5.35 6.05 PlyA - 155501 155496 6 1.05 6.04 Term - 171428 171264 165 2 0 50 32 136 0.479 1.13 6.03 Intr - 185047 184985 63 1 0 66 116 33 0.013 1.90 6.02 Intr - 191300 191264 37 1 1 76 108 24 0.037 0.55 6.01 Intr - 197073 196900 174 2 0 67 105 104 0.312 8.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 22586 22581 6 2 0 98 65 0 0.879 -0.37 S.002 Term - 39723 39564 160 2 1 95 43 123 0.976 4.93 S.003 Intr + 90621 90676 56 1 2 84 103 55 0.899 4.38 S.004 Term - 116993 116928 66 2 0 127 39 115 0.963 7.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:11877960_12092829|GENSCAN_predicted_peptide_1|878_aa XYLQIIEQALEAGAVVLIENLEESIDPVLGPLLGREVIKKGRFIKIGDKECEYNPKFRLI LHTKLANPHYQPELQAQATLINFTVTRDGLEDQLLAAVVSMERPDLEQLKSDLTKQQNGF KITLKTLEDSLLSRLSSASGNFLGETVLVENLEITKQTAAEVEKKVQEAKVTEVKINEAR EHYRPAAARASLLYFIMNDLSKIHPMYQFSLKAFSIVFQKAVERAAPDESLRERVANLID SITFSVYQYTIRGLFECDKLTYLAQLTFQILLMNREVNAVELDFLLRSPVQTGTASPVEF LSHQAWGAVKVLSSMEEFSNLDRDIEGSAKSWKKFVESECPEKEKLPQEWKNKTALQRLC MLRAMRPDRMTYALRDFVEEKLGSKYVVGRALDFATSFEESGPATPMFFILSPGVDPLKD VESQGRKLGYTFNNQNFHNVSLGQGQEVVAEAALDLAAKKGHWVILQNIHLVAKWLSTLE KKLEEHSENSHPEFRVFMSAEPAPSPEGHIIPQGILENSIKITNEPPTGMHANLHKALDN FTQPAPYKRSCVRTLKATLSFQDTLEMCSRETEFKSILFALCYFHAVVAERRKFGPQGWN RSYPFNTGDLTISVNVLYNFLEANAKVPYDDLRYLFGEIMYGGHITDDWDRRLCRTYLGE FIRPEMLEGELSLAPGFPLPGNMDYNGYHQYIDAELPPESPYLYGLHPNAEIGFLTQTSE KLFRTVLELQPRDSQARDGAGATREEKVKALLEEILERVTDEFNIPELMAKVEERTPYIV VAFQECGRMNILTREIQRSLRELELGLKRWNLSLLPRLECGGVIMAQEIFPPQPRLYESG TAAVTKYRRRCGLNNRYLFLTVVEAQSLSSSTGKFGFW >gi568815581r:11877960_12092829|GENSCAN_predicted_CDS_1|2637_bp nnctaccttcaaatcatagagcaggccctggaagctggagctgtggtgctgattgaaaat ctagaggagtccattgatcctgttctgggacccctgcttgggagagaagtcattaaaaaa ggacgattcattaaaattggagacaaagaatgtgaatacaatcccaagttccggctcatc ctccacaccaagctggctaatcctcactaccagcctgagctgcaggctcaggccaccctg atcaacttcaccgtgaccagggatggcctggaggaccagttgctggccgctgtggtcagc atggagaggccagacttggagcagctgaagtccgatctcacaaagcagcagaatggattc aaaattaccctgaaaacgttggaagacagtcttctctctcgcctctcctccgcctctggg aacttcctgggagaaacagtgctggtggaaaacctagagatcaccaagcagactgctgcc gaagttgagaaaaaggtccaggaggccaaggtgactgaagtgaaaatcaacgaggcccga gagcactaccggccagcagctgccagggcctcactgctctacttcatcatgaacgacctc agcaagatccatccaatgtaccagttttctctcaaggccttcagtatcgtcttccagaag gctgtggagagggctgctcctgacgaaagcctcagggagcgggtggccaacctaatagac agcataaccttctctgtgtaccagtacaccatccgcgggctctttgagtgtgataagctg acctaccttgcccagctcacctttcagattctcctcatgaaccgagaagtcaatgcagtg gagttggatttcctgcttcgatctccagtgcagacgggcaccgccagccccgtggagttc ctctcccatcaggcgtggggagctgtcaaggtactttcatcaatggaagaattctctaat ctggatcgggacatagagggatctgctaagagctggaaaaagtttgtggagtccgaatgt cctgagaaagagaagctcccacaggagtggaagaacaagacagccctgcagcgcctctgc atgctgagagccatgcggcccgaccggatgacctatgctttgcgagattttgttgaagag aagttaggaagcaaatacgtggtgggaagagccctagattttgcaacctcatttgaagaa tcgggaccagccactcctatgtttttcatcctgtctccaggggtggacccactgaaggat gtagaaagtcaaggaagaaaacttggatacaccttcaacaatcagaactttcacaacgtg tctttggggcaaggacaggaagtggtggctgaggctgcgctggacctcgctgccaagaaa ggtcactgggttattttgcagaacattcacctggtggccaagtggctcagcaccctggag aagaagctggaggagcacagtgagaacagccacccagagttcagggtcttcatgagtgca gagccagcaccctcccctgagggccacatcatcccccagggcatcctggagaactccatt aagatcaccaatgagccccccacgggcatgcatgccaacctgcacaaggccctggacaac ttcactcagccagccccgtataagagaagttgtgtgcgaaccttaaaagcgacactctca tttcaggacactctggagatgtgttctcgggagacggagtttaagagcatcctctttgct ctttgttacttccatgcggtggtggcagaaagacgaaaatttgggccccagggatggaat cgctcatacccctttaacactggagacctcactatctctgtgaatgtcctctacaacttc ctggaggccaacgcaaaggtcccctatgatgatttgcgctacctgtttggagagatcatg tatggaggccatatcacagatgactgggacagaagactctgcagaacctacctgggggaa ttcattcgaccagaaatgttagaaggagaactgtctttggccccagggttcccactccca ggcaacatggactacaatggttatcatcagtacatcgatgctgagctgcccccagaatcc ccctacctctatggcctccacccgaacgcagagattggcttcctgacccaaacctcagaa aagctcttccgcactgtgctggagctgcagcctcgggacagccaggccagagacggagcg ggcgccacaagagaagaaaaggtcaaggcacttctggaagaaatattggagcgggtgaca gacgagtttaacatcccagaactgatggccaaagtggaggagcgcaccccttacattgta gttgccttccaggagtgtggccggatgaatatcctcaccagagagattcagcgctcactg agggagctggagctcggcttaaagagatggaatctttctctattgcccagattggagtgt ggtggtgtgatcatggctcaagagatcttcccacctcagccacggctgtatgagtcaggt acagctgctgtaacaaaataccgtagacggtgtggcttaaacaacagatatttatttctc acagttgtggaggctcaaagtctaagctcaagcaccggcaaatttggtttctggtga >gi568815581r:11877960_12092829|GENSCAN_predicted_peptide_2|476_aa MYNAKGKPYGKLRTLSDNEVNAGASLVTKGELTMTSHMENLQNALYFDMVPESWARRAYP STAGLAAWFPDLLNRIKELEAWTGDFTMPSTVWLTGFFNPQSFLTAIMQSTARKNEWPLD QMALQCDMTKKNREEFRSPPREGAYIHGLFMEGACWDTQAGIITEAKLKDLTPPMPVMFI KAIPADKQDCRSVYSCPVYKTSQRGPTYVWTFNLKTKENPSKFSILQSHRENKKAGLGGC LEGQVGEGSPQTLRTPVWWSLHTDAHEIWCRDSDRGTSLGRSILRPPALCSMRKIHLRPQ VLRLISPRNISPISNPHALKGLKPVITRLLQHGLLKPINSPYNSPILPVLKPDKPYKLVQ DLCLINQIVLPIHPMMPNPYTLLSSIPPSTIHYSVLDLKRAFFTTPLYLSSQPLFAFTWT DPDTHQVQQITWAVLPQGFTDSPHYFSQAQISSSSVTYLGIILVKTHELSLLIVSD >gi568815581r:11877960_12092829|GENSCAN_predicted_CDS_2|1431_bp atgtacaacgcaaagggcaaaccctacggtaaactacggactctgagtgataatgaggtg aatgcaggtgcatcacttgtaacaaagggggagctgactatgaccagccacatggagaac ttacagaatgccctgtacttcgatatggtgccagagtcctgggctagacgagcctaccct tccacagcaggcctggcagcctggtttccagacctcctcaacagaatcaaggagctagag gcttggacgggtgactttacaatgccctccactgtgtggctgacaggcttcttcaacccc cagtcgttcctgactgccatcatgcagtccacggctcgcaagaatgagtggccactggac cagatggccctgcaatgtgacatgacgaagaagaacagagaagagtttaggagtcctcct cgggaaggggcctacatccatggcctcttcatggaaggtgcctgctgggacacacaggct gggatcattacagaggcaaagctgaaggatctgacaccccctatgcctgtgatgttcatc aaggccattcctgcagataagcaggactgccgcagtgtctattcctgtcctgtgtacaag actagtcagcggggacccacctacgtgtggactttcaacctgaagactaaggaaaaccca tccaaatttagcatcctgcagagccaccgagaaaataaaaaagctgggcttggaggctgc ctagagggacaggtgggtgaagggtcaccacagacacttagaacgcctgtttggtggtct cttcacacggacgcacatgaaatttggtgccgtgactcggatcgggggacctcccttggg agatcaatcctccgtcctcctgctctttgctccatgagaaagatccacctacgacctcag gtcctcagactgatcagcccaagaaacatctcaccaatctcaaatccgcatgctttgaaa ggattaaagcctgttatcactcgcctgctacagcatggccttttaaagcctataaactct ccttacaattcccccattttacctgtcctaaaaccagacaagccttacaagttagttcag gatctatgccttatcaaccaaattgttttgcctatccaccctatgatgccaaacccatat actctcctatcctcaatacctccctccacaatccattattctgttctggatctcaaacgt gctttctttactactcctttgtacctgtcatcccagcctctcttcgctttcacttggact gaccctgacactcatcaggttcagcaaattacctgggctgtactgccgcaaggcttcaca gacagcccccattacttcagtcaagcccaaatttcatcctcatctgttacctatctcggc ataattctcgtaaaaacacacgagctctccctgctgatcgtgtctgattaa >gi568815581r:11877960_12092829|GENSCAN_predicted_peptide_3|548_aa MPVDLGQALGLLPSLAKAEDSQFSESDAALQEELSSPETARQLFRQFRYQVMSGPHETLK QLRKLCFQWLQPEVHTKEQILEILMLEQFLTILPGEIQMWVRKQCPGSGEEAVTLVESLK GDPQRLWQWISIQVLGQDILSEKMESPSCQVGEVEPHLEVVPQELGLENSSSGPGELLSH IVKEESDTEAELALAASQPARLEERLIRDQDLGASLLPAAPQEQWRQLDSTQKEQYWDLM LETYGKMVSGGISHPKSDLTNSIEFGEELAGIYLHVNEKIPRPTCIGDRQENDKENLNLE NHRDQELLHASCQASGEVPSQASLRGFFTEDEPGCFGEGENLPEALQNIQDEGTGEQLSP QERISEKQLGQHLPNPHSGEMSTMWLEEKRETSQKGQPRAPMAQKLPTCRECGKTFYRNS QLIFHQRTHTGETYFQCTICKKAFLRSSDFVKHQRTHTGEKPCKCDYCGKGFSDFSGLRH HEKIHTGEKPYKCPICEKSFIQRSNFNRHQRVHTGEKPYKCSHCGKSFSWSSSLDKHQRS HLGKKPFQ >gi568815581r:11877960_12092829|GENSCAN_predicted_CDS_3|1647_bp atgcccgttgacttggggcaggccctaggcctgctgccatcgctggcgaaggccgaggac tcccagttctcagaatcagatgctgcccttcaagaggaactctccagccctgagaccgca cgccagcttttcaggcagttccgttaccaggtgatgtctgggcctcatgagaccttgaag caacttcggaagctctgtttccagtggctacagccagaggttcacaccaaagagcagatc ctagagatcctcatgttggagcagtttctgaccatcctgcctggggagatccagatgtgg gtgcggaaacagtgtccaggaagtggagaagaggcagtgacccttgtggaaagcttgaag ggggacccccagagactgtggcaatggatcagtatccaggttctaggacaggacatctta tcagagaagatggaatctccaagctgccaagtgggggaagtggagccccatcttgaagtg gtgcctcaggagttgggacttgagaattcatcctcagggcctggggagcttctgagccac atcgtgaaagaggaatctgacacagaagcagaactagccctggctgcctcccagcctgcc cgactggaggaaaggctgatcagagaccaggacctcggagcctcactgctcccagcagca cctcaggaacagtggagacaactggattccactcaaaaggagcaatactgggatctcatg ctggagacctatgggaaaatggtctcaggaggcatttcccatcccaaatctgacctgact aattcaatagaatttggggaagagctggcaggaatataccttcatgtcaatgagaagatc ccaagacccacctgcataggagatagacaagagaatgacaaggagaacctaaatttggag aatcacagggaccaggagctcctgcatgcttcctgtcaagcttcaggagaggttccttct caggcttccttgaggggcttcttcactgaggatgagccaggatgctttggagaaggagag aatctccctgaggctctgcaaaacattcaggatgagggaacaggggaacagctgtctcct caagaaaggatttctgagaaacaactaggtcagcatttgcctaatcctcattcaggagaa atgtccaccatgtggcttgaggagaagagagagacctcccagaaggggcagccaagagcc cccatggcccagaagctccccacctgcagggagtgtgggaagaccttttataggaattct cagcttatttttcaccaaagaactcacaccggagagacatactttcagtgcaccatctgc aaaaaagcctttctgcggagttcagactttgtgaagcatcagagaactcacacgggagag aagccctgtaaatgtgattactgtgggaaaggctttagtgacttctcaggattgcgccac cacgagaaaatccacacaggagagaaaccctataaatgtcctatctgtgagaaaagtttc attcagagatcaaactttaatagacatcagagggttcacactggagagaaaccttataaa tgttcgcactgtgggaaaagtttcagctggagctcgagccttgacaaacatcaaagatcc cacttaggaaagaagccctttcaatag >gi568815581r:11877960_12092829|GENSCAN_predicted_peptide_4|87_aa MAKLEESTMFPSMLLAFVVNKQVKGKILAKRINVCIGHVKHVKSQDSFLKQVKENYQKTK EDKEKSTWGSTEVPACTAQKSTLRENQ >gi568815581r:11877960_12092829|GENSCAN_predicted_CDS_4|264_bp atggctaaactggaagagtctacaatgttccccagcatgctgttggcatttgttgtaaac aaacaagttaagggcaagattcttgccaagagaattaatgtgtgtattgggcatgttaag cacgtgaagagccaagatagcttcctgaaacaagtgaaggaaaattatcagaaaactaaa gaagacaaagagaaaagtacctggggttcaactgaagtgccagcctgcactgcccagaaa agcacacttcgtgagaaccagtag >gi568815581r:11877960_12092829|GENSCAN_predicted_peptide_5|223_aa MRAGARIQAGPEAAGSGAGSRGGREGEAGGCRGFLSRVRGRPLGQAATTRVAALGGWDLG AAAFLTLHAADGRVAWRRGPYGAGEPDAQPGPDAELQNRPALGARPVKQTAVLDQGCGPA GPKVVCRIKGGPAFVRKEGPQFQPLGKKQGQTPLFGWNSHQIFHIALAIVFEQNAHPQIA KGSGPGHPVTDVASSGRELQAVQVSGCHSSTEFNWKYRACEKL >gi568815581r:11877960_12092829|GENSCAN_predicted_CDS_5|672_bp atgcgggccggagcccggatccaagccgggccggaggcggccgggagcggggcgggaagc cggggagggagagagggagaagccggcggctgccgtggcttcctcagccgggtccggggg cgtccgctggggcaggccgcgacgacgcgggttgccgcgctagggggctgggatctcggc gcggccgcgttccttaccctgcatgctgctgacggccgggtggcctggcgccggggaccc tacggggccggggagcccgacgcgcagcccgggccagatgctgagctgcagaaccgcccg gctctgggtgcgcgacctgtgaagcagacagctgtcttggaccagggctgcgggcccgcg gggcctaaagtggtttgcagaattaaaggtgggcctgcatttgtgcggaaagaaggaccc caatttcagcctctgggaaagaaacaaggtcagacacctctatttggatggaacagtcac cagatattccatatagccttagcaattgtctttgagcaaaacgcccaccctcaaattgcc aagggctcaggaccaggccacccagtgacggatgttgcctcttcaggcagagagctgcaa gctgtgcaggtttctggatgtcattcgagcactgaatttaattggaaatacagggcctgt gagaaactttga >gi568815581r:11877960_12092829|GENSCAN_predicted_peptide_6|146_aa XECQCGTSTCPTMKYQETDLPSYLKQLKKTDRIYRQSGRHWKSSGNKGQQTLKDRKYVRP QFPVAAKTSSQNERVVSRLKESTESKLQGKRKGTPIDTTGYSLEPPHNKEKPGTAVYKSF IEYLQEYWYLTYKYSEHHIAERGAVI >gi568815581r:11877960_12092829|GENSCAN_predicted_CDS_6|441_bp nntgagtgtcaatgtggcacctccacttgtcctactatgaagtatcaggagacagactta ccctcatacctaaaacaactgaaaaaaacagacagaatatataggcaatctggaagacac tggaaatcatcaggaaacaaagggcagcagacactgaaagaccggaaatatgtaaggccc caatttccagtggcagctaaaacctcttcccaaaatgaaagagtggtttctagattgaaa gagtccactgagagcaaactacaaggaaaaagaaagggaacacccattgacaccacaggg tattccctagaaccaccacacaataaagagaaacctggaactgcagtgtacaagtcattc atagaataccttcaagaatattggtatttgacttacaagtattcagaacatcacatcgca gagaggggagctgtcatttaa