GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:02:35 Sequence gi568815596f:126556264_126796139 : 239876 bp : 43.91% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 190 185 6 1.05 1.03 Term - 1935 1366 570 0 0 77 50 427 0.941 32.24 1.02 Intr - 9912 9865 48 0 0 124 93 -20 0.362 0.98 1.01 Init - 20286 20194 93 0 0 51 100 35 0.003 1.38 1.00 Prom - 41453 41414 40 -2.16 2.00 Prom + 50734 50773 40 0.24 2.01 Init + 67663 67745 83 0 2 79 94 31 0.404 3.24 2.02 Intr + 81903 82030 128 0 2 64 63 71 0.034 2.52 2.03 Intr + 87315 87388 74 1 2 117 59 22 0.629 1.33 2.04 Term + 87946 88011 66 0 0 94 51 58 0.614 0.64 2.05 PlyA + 88954 88959 6 1.05 3.00 Prom + 89574 89613 40 -3.76 3.01 Init + 99820 100049 230 0 2 26 109 152 0.213 6.94 3.02 Intr + 108513 108666 154 0 1 112 45 45 0.119 2.67 3.03 Intr + 119369 119486 118 1 1 47 109 36 0.015 1.64 3.04 Intr + 133992 134048 57 1 0 126 109 35 0.705 8.26 3.05 Intr + 137601 137684 84 1 0 126 99 82 0.965 12.89 3.06 Term + 139683 139879 197 1 2 145 55 353 0.980 35.27 3.07 PlyA + 140383 140388 6 1.05 4.07 PlyA - 143385 143380 6 1.05 4.06 Term - 164109 164035 75 0 0 64 51 59 0.039 -2.36 4.05 Intr - 166348 166245 104 2 2 67 105 56 0.042 5.09 4.04 Intr - 176911 176790 122 0 2 64 11 109 0.216 0.94 4.03 Intr - 180147 180044 104 0 2 84 71 117 0.447 8.57 4.02 Intr - 185502 185444 59 1 2 18 97 32 0.123 -4.30 4.01 Init - 186259 186211 49 1 1 86 89 40 0.365 3.01 4.00 Prom - 189438 189399 40 -5.26 5.00 Prom + 193679 193718 40 -3.66 5.01 Init + 196865 196930 66 1 0 73 103 28 0.579 2.19 5.02 Intr + 197429 197553 125 1 2 137 72 57 0.623 8.38 5.03 Intr + 203712 203800 89 0 2 81 81 3 0.194 -1.49 5.04 Intr + 203853 204103 251 1 2 4 105 164 0.442 6.86 5.05 Intr + 209458 209646 189 0 0 25 87 73 0.599 0.68 5.06 Intr + 210807 211080 274 2 1 119 44 151 0.699 10.91 5.07 Intr + 218560 218619 60 2 0 82 75 45 0.417 1.31 5.08 Intr + 222194 222293 100 0 1 60 62 89 0.384 2.67 5.09 Intr + 226574 226661 88 2 1 73 42 56 0.036 -0.53 5.10 Term + 238061 238261 201 1 0 97 32 140 0.376 6.69 5.11 PlyA + 238288 238293 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 14889 15031 143 0 2 130 76 70 0.877 10.00 S.002 Term + 52814 52921 108 1 0 80 54 83 0.835 2.61 S.003 Intr - 128460 128353 108 0 0 94 106 83 0.963 11.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:126556264_126796139|GENSCAN_predicted_peptide_1|236_aa MHLKPEAFLEPPPACKYPGSRVGMIKHALNKCCPGLLKSKFYHLALLPPPTPDTEHPVTD KNELVQKAKLAEQAEQYDDMAACMKSVTKQGAELSNEERNLLSVAYKNVVGARKSSWRVV SSIEQKTEGAEKKQQMAREHREKIETELRDICNDVLSLLEKFLIPNASQAESKVFYLKMK GDYYRYLTEVTAGDDKIGIVDQSQQAYQEAFEISKKEMQPTHPVRLGLALNFCVLL >gi568815596f:126556264_126796139|GENSCAN_predicted_CDS_1|711_bp atgcatctgaagcctgaagccttccttgagcctcctccagcctgcaagtaccctggcagc agagtaggaatgataaaacatgctctaaacaagtgctgcccaggcctgttgaaaagcaaa ttttatcacttggcgcttctgccaccacccactccggacacagaacatccagtcacggat aaaaatgagctggttcagaaggccaaactggccgagcaggctgagcaatatgatgacatg gcagcctgcatgaagtctgtaactaagcaaggagctgaattatccaatgaggagaggaat cttctctcagttgcttataaaaatgttgtaggagcccgtaagtcatcttggagggtcgtc tcaagtattgaacaaaaaacggaaggtgctgagaaaaaacagcagatggctcgagaacac agagagaaaattgagacggagctaagagatatctgtaatgatgtattgtctcttttggaa aagttcttgatccccaatgcttcacaagcagagagcaaagtcttctatttgaaaatgaaa ggagattactaccgttacttgactgaggttactgctggtgatgacaagatagggattgtg gatcagtcacaacaagcataccaagaagcttttgaaatcagcaaaaaagaaatgcaacca acacatcctgtcagattgggtctggcccttaacttctgtgttctattatga >gi568815596f:126556264_126796139|GENSCAN_predicted_peptide_2|116_aa MKLQSSTFQKYIGSVQKSGKNSKWQQERGPMRRRKWSVLANMPRNCEIRLRFMWILKGKR CAGEKSTKLQGVQHESPPCSCAVAPGVHGDMGMWERCPLGSFQKGAILDAERSTLE >gi568815596f:126556264_126796139|GENSCAN_predicted_CDS_2|351_bp atgaaacttcaatcaagtacatttcagaaatacattggttcagtccagaaaagtgggaaa aattcaaagtggcagcaggagagggggcctatgagacggaggaaatggtcggtgctggcg aacatgccaaggaactgtgaaatcagactaaggtttatgtggatcctgaaaggcaaacgt tgtgctggagagaaaagcacgaagctgcaaggtgtccagcatgaatcacccccatgcagc tgtgccgtggcaccaggagtccatggggacatggggatgtgggagagatgcccattagga agcttccagaagggggccattctggatgctgaaagatctacccttgaatga >gi568815596f:126556264_126796139|GENSCAN_predicted_peptide_3|279_aa MKSAYLRARAPPLGPRGRSVTQVPLPLAAEGQEPGSATLPRPGLARPGQSPRSLPGLTPR NVVDEKPQQHGVASQPRALILLYVGSPGSLAVLSVTGGLPGLGICRGSLYLSLWDFTNTA GVDAREAQRIPFSMTSSPVVAVRHYPRKRRLLSLIFRILHCPFLTQAEPDPGMASASTTM HTTTIAEPDPGMSGWPDGRMETSTPTIMDIVVIAGVIAAVAIVLVSLLFVMLRYMYRHKG TYHTNEAKGTEFAESADAALQGDPALQDAGDSSRKEYFI >gi568815596f:126556264_126796139|GENSCAN_predicted_CDS_3|840_bp atgaagtctgcctatctccgggccagagcccctcccctcggcccgcgcgggaggagtgtg acccaggtgccgcttcctctcgccgccgagggtcaggagcccgggagcgcgaccctcccc cggcccggcctggcccggcctggccagtccccgcggtctctgcccgggctgacgcccagg aatgtggtcgacgagaagccccaacagcacggcgtggcctctcagcctcgggctctgatc ctcctctacgttggcagtcctggttccttagcagttctgtctgtcacgggaggactccct gggctgggcatctgcaggggcagcctctacctctccctgtgggacttcactaacacagct ggagtagatgctagggaggctcagagaatccccttctctatgacctcttcacctgtcgtg gctgttcgtcactacccgagaaaacgcagacttcttagtctgatcttcaggatccttcat tgccctttcctcacccaagcagagcctgatccggggatggcctctgcctccaccacaatg catactaccaccattgcagagcctgatccagggatgtctggatggccggatggcagaatg gagacctccacccccaccataatggacattgtcgtcattgcaggtgtgattgctgctgtg gccatcgtcctagtctccctcctcttcgtcatgctgcgctacatgtaccggcacaagggc acgtaccacaccaatgaggccaagggcacggagtttgctgagagtgcagatgcagccctg cagggagaccctgccctccaagatgctggtgatagcagcagaaaggagtactttatttga >gi568815596f:126556264_126796139|GENSCAN_predicted_peptide_4|170_aa MGFHHVGQAGLELLTSVFGHYEQSCYKHSQAGFYVIVIADSWKAYAVMVVVVVVVIVARR TALHRHADNSGSPLQDLQSRMSALGAPRLQGLLEVLAASSLKVFLLLPGPAGAWMQRHLH PASQPDSGNPQGIIGPLNVWSTEEEKAHAGIYFPQVDMLKLTSMDPDSGV >gi568815596f:126556264_126796139|GENSCAN_predicted_CDS_4|513_bp atggggtttcaccatgtgggccaggctggtctcgaactcctgacctcagtttttggtcat tatgaacaaagctgctataaacattcacaagcaggcttttatgtcatcgtcatagcagat tcatggaaggcatatgcggtcatggtggtagtggtggttgtggtcattgtggcgaggagg actgcccttcacagacatgcagacaacagtgggagtcccctccaggacctgcagagccgt atgtctgccctgggtgctccacgtctgcagggcctcttggaggtgctggctgcttcatct ttgaaagtcttcctcctccttcctggacctgctggagcatggatgcagaggcatctgcat cctgcttcacagccagactcggggaatccccagggaatcattggaccactgaatgtctgg tccactgaggaagagaaggctcacgctggcatttatttcccccaagtggacatgctgaag ctgacctctatggatcctgattcgggggtctga >gi568815596f:126556264_126796139|GENSCAN_predicted_peptide_5|480_aa MGQALGSTAGKQGAQACSAGPQVRSPDFHLELLKMRRQLQALLWRKRQLPGVSRIFASSL TPAGSGVHHCHRHTQQQAESLHWSPGLWSEGYDVNQNQFHTPGEKDLEYAGVVIPTTSPF NVPIRPVQKTDGSWRMTVDYHKLNQVATPIVAAVPDVASFLRQINTSPGTCSAAIDLLIS WQQLGVPFNIRHHRTGTKQTGNGHGIKELVLLGQLLFTHLGRRQYISKIFLYEIITQCEI DPLGFAGLPDVDPWTLQSSAPCSSERSDVTALGGSIGGVPSAVIHMGILGLCPTPEEDTA SQRGVCEDPQPQCLCQEQPPIAGPRASSFYAGCHVSEPKADGTSKPLVVGAALFPTTTRT SNRHIPSVLGVWHIRGAQDVAEEQRVWLKFAKAVVPTPHQLLRATHYPSIGAVIGREAGT GRGQDENSPTCLLLRSSHLGSAATSEQGNSDLRDYTALPGTGARGEIATALITRQSIDIP >gi568815596f:126556264_126796139|GENSCAN_predicted_CDS_5|1443_bp atgggccaggcacttgggagcacagcaggtaagcaaggagcccaggcctgcagcgcggga ccccaggtgcgcagccctgactttcacctggagcttctgaagatgcggcgccagctgcaa gcccttctctggaggaagaggcagcttccgggtgtctccaggatatttgcctcaagcctc acacctgctggttctggagttcatcattgccacagacatactcagcagcaggcagaatcc ctgcattggtcccctggtctgtggagtgagggctatgatgtaaaccaaaatcaattccac actcctggagagaaggacttggaatatgcaggggtggtgattcccaccacatccccattc aacgtgcccatccggcctgtgcagaagacagatggatcttggagaatgacagtggattat cataagcttaaccaggtggcgactccaattgtagctgctgttccagatgtggcttcgttt cttagacaaattaacacatcccctggtacctgctctgcagccattgatctgctgatcagc tggcagcagctaggagttccattcaacatcaggcatcacagaacagggacaaagcaaact ggcaatggccatggcatcaaggagttggtcctgctgggtcaactcctgttcactcacctg ggaagaaggcagtacatttcaaaaatatttctatatgaaattattacacagtgtgagatt gatcctctaggcttcgctggcctccctgatgtggatccctggactctgcagagctcagct ccctgcagctctgagcgctctgatgtgacggccttgggagggagtataggcggagtgccc agtgctgtgatccacatggggatcctaggtctgtgccccaccccagaggaggacacagca agccagcgtggtgtctgtgaggacccccaaccccagtgcctatgccaggaacagcccccg atagcaggacctagagccagcagcttctacgcaggttgccacgtttccgagcccaaggca gatggcacttccaagccccttgtggttggggcagccttattccccaccaccaccaggaca tctaatcggcacatccccagcgtcttaggtgtttggcacataagaggtgctcaggacgta gctgaggaacaaagggtatggttgaagtttgctaaagctgtggtccccaccccccaccag ttgctgagagccacccactatcccagcattggtgctgtgattggaagagaagctgggaca gggagaggacaggatgagaactccccaacctgcctcctgctgcggagcagccacctgggg tcagcagccacctctgagcaaggcaacagcgacttgcgggactacacggccctgcctggc actggagcacgcggagaaatagccacagcgctaattaccaggcagagcattgatatccca taa