GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:46:38 Sequence gi568815584f:64886954_65161309 : 274356 bp : 44.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 27462 27581 120 2 0 53 88 101 0.193 6.79 1.02 Intr + 37038 37173 136 2 1 58 97 56 0.197 3.64 1.03 Intr + 39057 39127 71 1 2 70 99 16 0.082 -0.20 1.04 Term + 46134 46286 153 2 0 107 54 46 0.147 1.02 1.05 PlyA + 48385 48390 6 1.05 2.03 PlyA - 48750 48745 6 1.05 2.02 Term - 52885 52535 351 1 0 114 37 507 0.999 42.69 2.01 Init - 55773 55714 60 0 0 95 71 86 0.996 8.85 2.00 Prom - 57409 57370 40 -6.06 3.12 PlyA - 57952 57947 6 1.05 3.11 Term - 61559 61401 159 2 0 78 47 211 0.935 13.94 3.10 Intr - 61780 61715 66 1 0 105 78 93 0.767 9.10 3.09 Intr - 63461 63372 90 2 0 102 111 87 0.796 12.59 3.08 Intr - 64198 64121 78 1 0 107 75 78 0.956 8.15 3.07 Intr - 64710 64650 61 2 1 98 115 60 0.999 8.44 3.06 Intr - 65618 65558 61 0 1 138 93 92 0.999 12.49 3.05 Intr - 68494 68408 87 2 0 88 82 28 0.512 2.14 3.04 Intr - 70416 70392 25 0 1 77 97 12 0.398 -1.40 3.03 Intr - 71036 70915 122 0 2 44 61 92 0.171 2.31 3.02 Intr - 85147 85000 148 1 1 66 99 360 0.960 34.71 3.01 Init - 85807 85757 51 1 0 69 88 70 0.634 6.31 3.00 Prom - 91600 91561 40 -8.76 4.00 Prom + 92673 92712 40 -7.06 4.01 Init + 96814 96898 85 0 1 45 93 50 0.473 2.18 4.02 Intr + 100062 100144 83 1 2 67 94 122 0.447 10.06 4.03 Intr + 103067 103156 90 1 0 -23 99 104 0.135 0.59 4.04 Intr + 117296 117360 65 1 2 80 121 77 0.420 7.72 4.05 Intr + 122166 122184 19 0 1 75 105 -21 0.263 -5.09 4.06 Intr + 124663 124844 182 0 2 101 29 177 0.307 11.77 4.07 Intr + 125364 125436 73 0 1 68 82 46 0.565 1.41 4.08 Intr + 128672 128763 92 1 2 97 70 70 0.696 4.89 4.09 Intr + 140500 140646 147 1 0 69 92 192 0.998 17.05 4.10 Intr + 140745 140828 84 0 0 111 105 61 0.989 9.04 4.11 Intr + 145657 145743 87 1 0 102 100 100 0.992 11.69 4.12 Intr + 153837 153966 130 0 1 105 83 86 0.977 10.50 4.13 Intr + 157358 157490 133 1 1 88 105 113 0.999 13.22 4.14 Intr + 166285 166396 112 2 1 105 89 102 0.987 11.44 4.15 Intr + 167622 167736 115 0 1 92 99 103 0.995 12.25 4.16 Term + 174228 174359 132 2 0 117 44 128 0.959 9.39 4.17 PlyA + 175359 175364 6 1.05 5.00 Prom + 177104 177143 40 -5.86 5.01 Init + 178315 178379 65 0 2 97 82 -1 0.513 0.34 5.02 Intr + 178910 179080 171 2 0 29 100 89 0.450 3.26 5.03 Term + 183048 183156 109 0 1 70 42 123 0.529 3.88 5.04 PlyA + 186661 186666 6 1.05 6.09 PlyA - 188195 188190 6 1.05 6.08 Term - 189710 189523 188 0 2 68 46 266 0.993 18.05 6.07 Intr - 191083 190960 124 1 1 80 105 151 0.923 16.16 6.06 Intr - 206862 206755 108 0 0 115 113 58 0.941 11.48 6.05 Intr - 215389 215351 39 1 0 42 86 90 0.192 2.62 6.04 Intr - 223021 222886 136 0 1 48 75 32 0.166 -1.53 6.03 Intr - 231668 231557 112 0 1 89 82 46 0.608 3.54 6.02 Intr - 238593 238476 118 0 1 77 58 61 0.665 2.04 6.01 Init - 243799 243683 117 1 0 110 87 45 0.562 4.85 6.00 Prom - 246579 246540 40 -6.66 7.04 PlyA - 247148 247143 6 1.05 7.03 Term - 248104 247935 170 2 2 134 53 112 0.962 10.34 7.02 Intr - 248925 248893 33 1 0 109 82 40 0.678 3.69 7.01 Init - 262647 262578 70 0 1 101 35 42 0.250 1.41 7.00 Prom - 273408 273369 40 -1.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 19918 19835 84 1 0 82 89 46 0.860 4.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:64886954_65161309|GENSCAN_predicted_peptide_1|159_aa MRQPYLSSREVSSSRKRWRTFPVDCVAMCGDCVEKEYPNRGNTCLENGSFLLNFTGCAVC SKRDFMLITNKSLKEEDGEEIVTYDHLCKNCHHVIARHEYTFSIMDEFQNSEPKSDQEKN ITQIPIEKHSLKYLTSTSQNYQGHKKQGKSETWSQPRGA >gi568815584f:64886954_65161309|GENSCAN_predicted_CDS_1|480_bp atgcggcaaccgtatctcagttctcgcgaggtttcgtcttcccggaagcgttggaggaca ttccctgttgactgcgtcgcgatgtgtggcgactgtgtggagaaggaatatcccaaccgg ggtaatacctgcctggagaatggatctttcttactgaactttacaggctgtgcagtgtgc agtaagcgggattttatgctgatcacaaacaaatccttgaaagaagaagatggagaagaa atagttacctatgatcatttgtgtaagaattgtcatcatgtaatagccagacatgagtat acattcagtatcatggatgaatttcagaactcagaacccaagtctgatcaggagaaaaac atcacacaaattccaattgagaagcattctttaaaatacctaaccagtacttctcaaaac tatcaaggccataaaaaacaaggaaaatctgagacatggtcacagccaagaggagcctga >gi568815584f:64886954_65161309|GENSCAN_predicted_peptide_2|136_aa MAFIAKSFYDLSAISLDGEKENCQNEEILNSLKYVRPGGGYQPTFTLVQKCEVNGQNEHP VFAYLKDKLPYPYDDPFSLMTDPKLIIWSPVRRSDVAWNFEKFLIGPEGEPFRRYSRTFP TINIEPDIKRLLKVAI >gi568815584f:64886954_65161309|GENSCAN_predicted_CDS_2|411_bp atggctttcattgccaagtccttctatgacctcagtgccatcagcctggatggggagaag gagaactgtcagaatgaggagatcctgaacagtctcaagtatgtccgtcctgggggtgga taccagcccaccttcacccttgtccaaaaatgtgaggtgaatgggcagaacgagcatcct gtcttcgcctacctgaaggacaagctcccctacccttatgatgacccattttccctcatg accgatcccaagctcatcatttggagccctgtgcgccgctcagatgtggcctggaacttt gagaagttcctcatagggccggagggagagcccttccgacgctacagccgcaccttccca accatcaacattgagcctgacatcaagcgcctccttaaagttgccatatag >gi568815584f:64886954_65161309|GENSCAN_predicted_peptide_3|315_aa METGVRIQILLKAFLPRFPAPLAPVMAKQYDVLFRLLLIGDSGVGKTCLLCRFTDNEFHS SHISTIAWATEEDSVKKKEKRHQVCQGVREKASGMPGRVGKGIGYAREAFIQQREVLSSG KNQQPRREREGATMGTGYWGQGLTGVDFKMKTIEVDGIKVRIQIWDTAGQERYQTITKQY YRRAQGIFLVYDISSERSYQHIMKWVSDVDEYAPEGVQKILIGNKADEEQKRQVGREQGQ QLAKEYGMDFYETSACTNLNIKESFTRLTELVLQAHRKELEGLRMRASNELALAELEEEE GKPEGPANSSKTCWC >gi568815584f:64886954_65161309|GENSCAN_predicted_CDS_3|948_bp atggagacgggtgttcgaatccagatcctgctcaaggccttcctacctcggttcccggcc ccgctggccccagtcatggcgaagcagtacgatgtgctgttccggctgctgctgatcggg gactccggggtgggcaagacctgcctgctgtgccgcttcaccgacaacgagttccactcc tcgcacatctccaccatcgcctgggcgacagaggaagactccgtcaagaaaaaagagaaa aggcatcaggtatgccagggtgtgcgggaaaaggcatcgggtatgccagggcgtgtggga aaaggcatcgggtatgccagggaagcctttattcagcagagggaagtgctcagcagtgga aaaaaccaacagccacgcagggagagggaaggagccacgatgggcacgggttactggggc cagggcttgactggtgttgactttaagatgaagaccatagaggtagacggcatcaaagtg cggatacagatctgggacactgcagggcaggagagataccagaccatcacaaagcagtac tatcggcgggcccaggggatatttttggtctatgacattagcagcgagcgctcttaccag cacatcatgaagtgggtcagtgacgtggatgagtacgcaccagaaggcgtccagaagatc cttattgggaataaggctgatgaggagcagaaacggcaggtgggaagagagcaagggcag cagctggcgaaggagtatggcatggacttctatgaaacaagtgcctgcaccaacctcaac attaaagagtcattcacgcgtctgacagagctggtgctgcaggcccataggaaggagctg gaaggcctccggatgcgtgccagcaatgagttggcactggcagagctggaggaggaggag ggcaaacccgagggcccagcgaactcttcgaaaacctgctggtgctga >gi568815584f:64886954_65161309|GENSCAN_predicted_peptide_4|542_aa MYYYTVSQDIKGNNSFRHTACMSENRENEPLYSLRPEHARERLQDDSVETVTSIEQMREE AIMDLVQEYSGQRKQQAQRTKAAECQAKVEEKIQEVFSSYKFNHLVPRKDFWIESHRLFC QTKKEVQPLCFARPVQVAMGGTFHLKASIADLKAKDSDWLQNTVLWPGLWVTRGRLVLQR EKHFHYLKRGLRQLTDAYECLDASRPWLCYWILHSLELLDEPIPQIVATDVCQFLELCQS PEGGFGGGPGQYPHLAPTYAAVNALCIIGTEEAYDIINREKLLQYLYSLKQPDGSFLMHV GGEVDVRSAYCAASVASLTNIITPDLFEGTAEWIARCQNWEGGIGGVPGMEAHGGYTFCG LAALVILKRERSLNLKSLLQWVTSRQMRFEGGFQGRCNKLVDGCYSFWQAGLLPLLHRAL HAQGDPALSMSHWMFHQQALQEYILMCCQCPAGGLLDKPGKSRDFYHTCYCLSGLSIAQH FGSGAMLHDVVLGVPENALQPTHPVYNIGPDKVIQATTYFLQKPVPGFEELKDETSAEPA TD >gi568815584f:64886954_65161309|GENSCAN_predicted_CDS_4|1629_bp atgtattactacacagtttcacaagatattaaaggaaacaattcattccgacatacggct tgtatgagtgaaaacagggagaacgagccgctgtacagtctgaggcccgagcacgcgcga gagcggttgcaggacgactcggtggaaacagtcacgtccatagaacagatgagggaggaa gccatcatggatctggtgcaagagtattctgggcagagaaagcagcaagcacaaaggacc aaggcagcagaatgccaggcaaaagtagaagaaaagatccaagaggtcttcagttcttac aagttcaaccaccttgtaccaagaaaagatttttggatagagtcacacaggctcttctgc cagaccaagaaagaagtgcagcctctgtgctttgcccggcctgtgcaggtggccatgggc ggcacattccatcttaaagccagtattgctgacttgaaagccaaggacagcgactggctg cagaacacggtcctctggccagggctgtgggtcacacgtgggaggcttgttttgcagagg gagaagcacttccattatctgaaaagaggccttcgacaactgacagatgcctatgagtgt ctggatgccagccgcccatggctctgctattggatcctgcacagcttggaactgctagat gaacccatcccccagatagtggctacagatgtgtgtcagttcctggagctgtgtcagagc ccagaaggtggctttggaggaggacccggtcagtatccacaccttgcacccacatatgca gcagtcaatgcattgtgcatcattggcaccgaggaggcctatgacatcattaacagagag aagcttcttcagtatttgtactccctgaagcaacctgacggctcctttctcatgcatgtc ggaggtgaggtggatgtgagaagcgcatactgtgctgcctccgtagcctcgctgaccaac atcatcactccagacctctttgagggcactgctgaatggatagcaaggtgtcagaactgg gaaggtggcattggcggggtaccagggatggaagcccatggtggctataccttctgtggc ctggccgcgctggtaatcctcaagagggaacgttccttgaacttgaagagcttattacaa tgggtgacaagccggcagatgcgatttgaaggaggatttcagggccgctgcaacaagctg gtggatggctgctactccttctggcaggcggggctcctgcccctgctccaccgcgcactg cacgcccaaggtgaccctgcccttagcatgagccactggatgttccatcagcaggccctg caggagtacatcctgatgtgctgccagtgccctgcgggggggcttctggataaacctggc aagtcgcgtgatttctaccacacctgctactgcctgagcggcctgtccatagcccagcac ttcggcagcggagccatgttgcatgatgtggtcctgggtgtgcccgaaaacgctctgcag cccactcacccagtgtacaacattggaccagacaaggtgatccaggccactacatacttt ctacagaagccagtcccaggttttgaggagcttaaggatgagacatcggcagagcctgca accgactag >gi568815584f:64886954_65161309|GENSCAN_predicted_peptide_5|114_aa MALITWLPRPTWPLDFQLYETMWVKQAYYLPFTTGETGWRAYVNHPRFPSCQVVVPEPEP VSAASKAHALPPPQASEGRAVDGTSRQETMKLSSTVHGSHGLPGCQAASRIPDE >gi568815584f:64886954_65161309|GENSCAN_predicted_CDS_5|345_bp atggccctgatcacatggctcccgaggcccacgtggcctctggattttcagttatatgag acaatgtgggtgaagcaggcgtattacctgccttttacaactggagaaacaggttggaga gcttacgtaaatcacccaaggtttcccagctgtcaagtggtggtgcctgagcccgagccc gtatctgctgcctccaaagcccatgcccttccaccaccgcaggcctctgaaggaagagct gttgatggcaccagcagacaggaaacaatgaagctctcaagcactgtgcatgggagccat gggctgccgggctgccaggctgccagccggattccggatgagtga >gi568815584f:64886954_65161309|GENSCAN_predicted_peptide_6|313_aa MPGRNLMVLKCGALPLLLSRSLLLPSKTCFAYPLPLAIILWLCDVPPPPLPSTMIVFPEA SPEAQQMPASRVLYSLQNRQLAREGRPDRLEEQTSSSSCQRPKDTVFLGPLTKPARRYQV LWMNGGLQGRSELDDHQHIHDSGHEHLEERTEPQISPATNQEMSDNDDIEVESDADKRAH HNALERKRRDHIKDSFHSLRDSVPSLQGEKASRAQILDKATEYIQYMRRKNHTHQQDIDD LKRQNALLEQQVRALEKARSSAQLQTNYPSSDNSLYTNAKGSTISAFDGGSDSSSESEPE EPQSRKKLRMEAS >gi568815584f:64886954_65161309|GENSCAN_predicted_CDS_6|942_bp atgcctggcagaaatctgatggttttaaagtgtggggcacttcccctcttgctgtctcgc tccctcctgctgccctctaagacatgctttgcctaccctttaccattggccattattctc tggctatgcgatgtgcctcctcctcctttgccttccactatgattgtgtttcctgaggcc tctccagaagcccagcagatgccagcatcacgtgtcctgtacagcttgcagaaccgtcag ctagccagagagggtcggcctgaccgactagaagaacagacatcttccagcagctgtcag cgtccaaaggacactgtgttccttggacctcttaccaaaccagccaggagatatcaagtg ctatggatgaatggagggcttcaaggaaggtctgagctagatgaccatcaacatatacat gacagtggacatgaacatctagaggaaagaactgaaccccaaatatctccagcaaccaat caggaaatgagcgataacgatgacatcgaggtggagagcgacgctgacaaacgggctcat cataatgcactggaacgaaaacgtagggaccacatcaaagacagctttcacagtttgcgg gactcagtcccatcactccaaggagagaaggcatcccgggcccaaatcctagacaaagcc acagaatatatccagtatatgcgaaggaaaaaccacacacaccagcaagatattgacgac ctcaagcggcagaatgctcttctggagcagcaagtccgtgcactggagaaggcgaggtca agtgcccaactgcagaccaactacccctcctcagacaacagcctctacaccaacgccaag ggcagcaccatctctgccttcgatgggggctcggactccagctcggagtctgagcctgaa gagccccaaagcaggaagaagctccggatggaggccagctaa >gi568815584f:64886954_65161309|GENSCAN_predicted_peptide_7|90_aa MRAAYTVYCTNPEVAMCMHFSVNENLLSVLEEMGDHVVLDPETTSAYLQEAECNLSVMNT GMKMSVPSNPQRFSYFSVLGTVGFSSGRHD >gi568815584f:64886954_65161309|GENSCAN_predicted_CDS_7|273_bp atgagggctgcatatactgtgtactgcacaaatccagaagttgccatgtgcatgcacttc agtgtaaatgagaacctgctgagtgtcctggaggaaatgggagaccatgtggtcctggac ccagaaactaccagtgcttacctccaagaagcagaatgtaatctgagtgtaatgaacaca ggcatgaagatgtctgtgccttccaacccacagaggttctcctacttcagtgtgctgggg actgtgggattctcatcaggcagacacgactga