GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:55:51 Sequence gi568815582r:58064074_58297736 : 233663 bp : 44.78% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7579 7696 118 2 1 120 91 6 0.036 3.62 1.02 Intr + 29712 29888 177 0 0 139 65 4 0.588 2.23 1.03 Intr + 29953 30040 88 1 1 120 16 65 0.367 2.47 1.04 Intr + 32675 32734 60 1 0 58 94 44 0.161 0.93 1.05 Intr + 34464 34623 160 2 1 76 36 83 0.234 1.46 1.06 Term + 34788 34966 179 1 2 103 41 86 0.281 3.25 1.07 PlyA + 36829 36834 6 1.05 2.08 PlyA - 37393 37388 6 1.05 2.07 Term - 49957 49952 6 1 0 122 36 0 0.461 -3.93 2.06 Intr - 50847 50737 111 0 0 28 121 85 0.934 6.38 2.05 Intr - 51384 51196 189 0 0 114 84 310 0.999 32.98 2.04 Intr - 52079 51968 112 1 1 78 111 88 0.998 10.48 2.03 Intr - 52878 52799 80 0 2 103 100 96 0.999 10.65 2.02 Intr - 56004 55951 54 0 0 115 64 17 0.613 1.18 2.01 Init - 65042 64959 84 2 0 68 58 113 0.864 7.12 2.00 Prom - 80294 80255 40 -6.26 3.13 PlyA - 91081 91076 6 1.05 3.12 Term - 94280 94100 181 1 1 147 42 143 0.937 12.68 3.11 Intr - 96992 96821 172 0 1 93 9 57 0.254 -2.70 3.10 Intr - 101635 101487 149 0 2 91 65 188 0.974 16.68 3.09 Intr - 102611 102511 101 2 2 73 98 163 0.999 14.71 3.08 Intr - 103235 103134 102 2 0 88 99 79 0.997 9.37 3.07 Intr - 103722 103612 111 0 0 75 97 94 0.989 9.58 3.06 Intr - 104620 104537 84 1 0 91 76 17 0.634 0.82 3.05 Intr - 110437 110378 60 1 0 64 106 86 0.838 6.93 3.04 Intr - 122783 122682 102 2 0 41 66 127 0.954 6.17 3.03 Intr - 132771 132660 112 2 1 93 113 127 0.987 16.08 3.02 Intr - 133765 133560 206 1 2 98 55 116 0.698 7.30 3.01 Init - 136120 136043 78 1 0 74 82 36 0.370 2.66 3.00 Prom - 146042 146003 40 -2.86 4.00 Prom + 146904 146943 40 -5.56 4.01 Init + 163717 163773 57 0 0 85 106 21 0.465 4.91 4.02 Term + 171221 171349 129 1 0 64 41 124 0.175 3.48 4.03 PlyA + 171518 171523 6 1.05 5.00 Prom + 181587 181626 40 -3.86 5.01 Init + 185942 186042 101 1 2 63 86 205 0.973 17.53 5.02 Intr + 188654 188780 127 2 1 48 86 88 0.720 5.28 5.03 Intr + 189925 190086 162 0 0 78 71 89 0.877 6.37 5.04 Intr + 191004 191291 288 2 0 39 4 220 0.973 6.14 5.05 Intr + 194295 194450 156 2 0 71 110 170 0.999 17.71 5.06 Intr + 198312 198467 156 2 0 11 78 150 0.750 6.51 5.07 Intr + 199432 199500 69 0 0 97 80 47 0.905 4.18 5.08 Intr + 203406 203513 108 2 0 64 75 77 0.813 4.48 5.09 Intr + 214409 214564 156 1 0 79 83 198 0.943 18.61 5.10 Term + 215618 215704 87 1 0 122 42 40 0.811 0.46 5.11 PlyA + 218377 218382 6 1.05 6.08 PlyA - 219464 219459 6 -0.45 6.07 Term - 219819 219796 24 0 0 107 43 -6 0.206 -4.98 6.06 Intr - 220648 220517 132 1 0 48 91 90 0.690 6.14 6.05 Intr - 222122 221954 169 1 1 100 2 167 0.400 9.25 6.04 Intr - 226723 226644 80 1 2 18 75 25 0.546 -7.45 6.03 Intr - 227063 226846 218 0 2 90 60 240 0.906 19.62 6.02 Intr - 229345 229268 78 2 0 113 39 38 0.640 0.92 6.01 Init - 229707 229659 49 0 1 76 81 100 0.925 7.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 178019 177890 130 2 1 85 92 78 0.886 8.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:58064074_58297736|GENSCAN_predicted_peptide_1|260_aa XHWPVAAASSSSPDTLWILVLLFLPLESPNTPHSQLTQLRLSSTRSLIPILRASPQVVPP APQTTPHPPCSSPIAPSPIGVSSSPLHCSSQNPGFILSWSPLAAFFLPLLLAPIFPKNHL LRADMGPWHLICQKGQYHLSSRCICEEQTLSHFCSRAPAALLFFISVFLFKFQREESNSR GLALASPGDHHAIVFSQQPRAGASVFPPGEDFFISPLLAIKPFYAAETQLGAQEGGQPQG PHEDPKTMTTTVTMTTLTVD >gi568815582r:58064074_58297736|GENSCAN_predicted_CDS_1|783_bp ngccattggccagtggcagcagcctcctcatcctctccagacacactctggatcttggtt ctgctcttcttgcctctggagagccccaacactccccattcccagttgactcagctcagg ctctccagcactcgtagccttatacccatcctgcgggcatctccccaagttgttccacca gctcctcagactaccccccacccaccctgctcctccccaattgctcccagccccattggt gtctcctccagccccctccattgctcaagtcagaacccagggttcatcctgagctggtcc ccgctggctgctttcttcctgcccctgctcctggctcccatcttccccaaaaatcacctc ctaagggccgacatgggaccttggcacctcatctgtcaaaaaggacagtaccacctgtcc agcaggtgcatctgtgaggaacagactctttctcacttctgctccagagcccccgcggct ctcctcttcttcatctctgttttcctcttcaaattccagagagaggaatccaacagccgt ggcctggcactggccagcccaggagatcatcatgcaatcgtcttcagccagcagcccagg gcaggagcctcagtcttcccacctggtgaagatttcttcatttctcctctcctcgccata aaacccttctatgcagctgagacacagttgggggcccaagaaggaggccaaccacaagga ccacatgaggaccctaagacaatgacaactacagttaccatgacaacactcactgtggac tga >gi568815582r:58064074_58297736|GENSCAN_predicted_peptide_2|211_aa MFKNTFQSGFLSILYSIGSKPLQIWDKKKIQEEYRHLGWEHEMASEVRNGHIKRITDNDI QSLVLEIEGTNVSTTYITCPADPKKTLGIKLPFLVMIIKNLKKYFTFEVQVLDDKNVRRR FRASNYQSTTRVKPFICTMPMRLDDGWNQIQFNLLDFTRRAYGTNYIETLRVQIHANCRI RRVYFSDRLYSEDELPAEFKLYLPVQNKAKQ >gi568815582r:58064074_58297736|GENSCAN_predicted_CDS_2|636_bp atgttcaaaaacacgttccagagcggcttcctctccatcctctacagcatcggcagcaag cctctgcaaatctgggacaaaaagaaaatacaggaggaatatcgccacttaggttgggaa catgagatggcgagtgaggtacggaatggccacatcaaaagaatcactgataatgacatc cagtccctggtgctagagattgaagggacaaatgtaagcaccacatatatcacatgccct gcagaccccaagaagacgctgggaattaaacttcctttccttgtcatgattatcaaaaac ctgaagaagtattttaccttcgaagtgcaggtactagatgacaagaatgtgcgtcgtcgc tttcgggcaagtaactaccagagcaccacccgggtcaaacccttcatctgcaccatgccc atgcggctggatgacggctggaaccagattcagttcaacttgctagacttcacacggcga gcatacggcaccaattacatcgagaccctcagagtgcagatccatgcaaattgtcgcatc cgacgggtttacttctcagacagactctactcagaagatgagctgccggcagagttcaaa ctgtatctcccagttcagaacaaggcaaagcaataa >gi568815582r:58064074_58297736|GENSCAN_predicted_peptide_3|485_aa MGEVSLLSASHQRISWFPTDEKFGPWRPAARCLLRPTPRVPRRAAAATLCAPRRPPVPPA MPGPAAGSRARVYAEVNSLRSREYWDYEAHVPSWGNQDDYQLVRKLGRGKYSEVFEAINI TNNERVVVKILKPVKKKKIKREVKILENLRGGTNIIKLIDTVKDPVQLYQILTDFDIRFY MYELLKALDYCHSKGIMHRDVKPHNVMIDHQQKKLRLIDWGLAEFYHPAQEYNVRVASRY FKGPELLVDYQMYDYSLDMWSLGCMLASMIFRREPFFHGQDNYDQLVRIAKVLGTEELYG YLKKYHIDLDPHFNDILGQHSRKRWENFIHSENRHLVSPEALDLLDKLLRYDHQQRLTAK EAMEHPYFCRVGGWVGEAIECMTQTKLSWAFSPTREALMSFIEALKHQSLSQASSQTFCN MSALTRSVAVLPLFHKQNKNQIKRLNAYREITFREQTQNGGRFGEHELDQAKGSPPPYIK PHFRM >gi568815582r:58064074_58297736|GENSCAN_predicted_CDS_3|1458_bp atgggggaggtgtcccttctcagtgcatcacatcagaggatcagttggttccctactgat gagaagtttggaccatggcgcccggcggcccgctgcctcctccgcccgacgccccgcgtc ccccgccgcgccgccgccgccaccctctgcgccccgcgccgccccccggtcccgcccgcc atgcccggcccggccgcgggcagcagggcccgggtctacgccgaggtgaacagtctgagg agccgcgagtactgggactacgaggctcacgtcccgagctggggtaatcaagatgattac caactggttcgaaaacttggtcggggaaaatatagtgaagtatttgaggccattaatatc accaacaatgagagagtggttgtaaaaatcctgaagccagtgaagaaaaagaagataaaa cgagaggttaagattctggagaaccttcgtggtggaacaaatatcattaagctgattgac actgtaaaggaccccgtgcaactctaccagatcctgacagactttgatatccggttttat atgtatgaactacttaaagctctggattactgccacagcaagggaatcatgcacagggat gtgaaacctcacaatgtcatgatagatcaccaacagaaaaagctgcgactgatagattgg ggtctggcagaattctatcatcctgctcaggagtacaatgttcgtgtagcctcaaggtac ttcaagggaccagagctcctcgtggactatcagatgtatgattatagcttggacatgtgg agtttgggctgtatgttagcaagcatgatctttcgaagggaaccattcttccatggacag gacaactatgaccagcttgttcgcattgccaaggttctgggtacagaagaactgtatggg tatctgaagaagtatcacatagacctagatccacacttcaacgatatcctgggacaacat tcacggaaacgctgggaaaactttatccatagtgagaacagacaccttgtcagccctgag gccctagatcttctggacaaacttctgcgatacgaccatcaacagagactgactgccaaa gaggccatggagcacccatacttctgcagggtaggtggctgggtaggagaagctattgag tgtatgacgcagaccaagctttcctgggccttctcaccaactagagaagcgctgatgtcg ttcattgaggcacttaaacaccagtcactcagccaggcctcctcccagacattctgtaac atgtcagcactcacaaggtctgttgcggttctcccacttttccataagcagaacaagaac caaatcaaacgtcttaacgcgtatagagagatcacgttccgtgagcagacacaaaacggt ggcaggtttggcgagcacgaactagaccaagcgaagggcagcccaccaccgtatatcaaa cctcacttccgaatgtaa >gi568815582r:58064074_58297736|GENSCAN_predicted_peptide_4|61_aa MADSRFGRGKYRCTSSLPKSGFVEKTEQCGISLDLNFGPTPNGYVTLDKLLNFSEPQSVY E >gi568815582r:58064074_58297736|GENSCAN_predicted_CDS_4|186_bp atggctgattccagatttggacgagggaaatacagatgtacatctagcttgccaaaaagt ggatttgtggaaaagacagagcaatgtgggataagcctggatttgaatttcggccccacc ccgaatggctacgtgaccttggacaagttgcttaacttctctgaacctcagtctgtgtat gagtaa >gi568815582r:58064074_58297736|GENSCAN_predicted_peptide_5|469_aa MTDDESESVLSDSHEGSELELPVIQLCGLVEELSYVNSALKTETEMFEKYYAKLEPRDQR PPRLSEIKISAADYAQFRGRRRSKSRTGMDRGVGLTADQKLELVQKEVADMKDDLRHTRA NAERDLQHHEAVKREKPVVAQSKSESFKTREANGAAFGPWPKALEPPGSCWYKSEGLKAK NLESDVRGQEEQKQLSGMGRRGRARRLSTQASSTCFALAALAANWMAIIEEAEIRWSEVS REVHEFEKDILKAISKKKGSILATQKVMKYIEDMNRRRKEEVSEALHDVDFQQLKIENAQ FLETIEARNQELTQLKLSSGNTLQVLNAYKKSFADPSHSAQVAMVAVGLGSSQSKLHKAM EIYLNLDKEILLRKELLEKIEKETLQVEEDRAKAEAVNKRLRKQLAEFRAPQVMTYVREK ILNADLEKSIRMWERKVEIAEMSLKGHRKAWNRMKITNEQLQADYLAGK >gi568815582r:58064074_58297736|GENSCAN_predicted_CDS_5|1410_bp atgactgatgatgagtccgagagcgtcctctccgactcccatgaagggtcggagctggag ctgcctgttatccagctgtgcgggctggtggaggagctcagctatgtaaactctgctctc aaaactgagactgagatgtttgagaaatattacgctaaactggagcccagggatcagcga cctccacgattatcagaaattaaaatatcagcagcagattatgcacagtttcgaggcagg cgtagatccaaatcccggacaggtatggaccgtggggtaggcctgactgccgaccaaaaa cttgagctggtacaaaaagaggttgcggacatgaaggatgacttacgacacacaagggca aatgcggaacgcgacctgcagcatcacgaggctgtgaaaagagagaagccggtagtggct cagtccaagtctgaaagcttcaaaaccagggaagccaatggtgcagccttcggtccgtgg ccgaaggccctagagcccccaggaagctgctggtacaagtccgagggtctaaaggcaaag aacctggagtctgatgtccgagggcaggaggagcagaagcaattgtcgggcatgggaaga agagggagagccagaagactcagtacacaagcttcttccacctgctttgctctagctgct ctggcagccaattggatggcgatcattgaggaggctgaaattcgatggagtgaagtttcg agagaagtgcatgagtttgaaaaagatattctaaaagccatatccaagaagaaagggagt attttggccactcagaaagtgatgaaatacattgaggacatgaaccgccggaggaaggaa gaggtgagtgaggcccttcacgatgttgattttcagcagttgaagatagagaacgctcaa tttcttgagacaattgaagcaaggaatcaagaactgacccagctaaagctgtcatctgga aacactctgcaggttctcaatgcctacaaaaaaagttttgccgacccctcgcatagtgct caggtggcaatggtggctgtgggcttaggtagctcgcagagcaagcttcacaaggcaatg gaaatatacctcaatctggacaaggagatcttgctgagaaaagagctacttgaaaaaatt gaaaaagaaacactacaagtagaggaggaccgggccaaagccgaggcagtgaataagagg ctccggaagcagctggccgagttccgggcaccacaggtgatgacttacgtccgggagaag atcttaaatgcggacctggagaagagcatcaggatgtgggaaaggaaagtggagatagca gagatgtccttaaaaggccatcgtaaggcttggaatcgaatgaaaataaccaatgagcag ttgcaggcagattaccttgctgggaagtag >gi568815582r:58064074_58297736|GENSCAN_predicted_peptide_6|249_aa MRGVLLVLLGLLYSSTREYCSFANYNPQLQGWSREQGASTAAGCGVQKASVFYGPDPKEG LVSSMEFPWVVSLQDSQYTHLAFGCILSEFWVLSIASAIQNRPVPLPLGPATSPGLPSLI SAVRTAGLATEKCNDESSIQCRKDIVVIVGISNMDPSKIAHTEYPVNTIIIHEDFDNNSM SNNIALLKTDTAMHFGNLTGNHMTMSVLRKIFVKDLDMCPLYKLQKTECGSHTKEETKTA CLDPERQDF >gi568815582r:58064074_58297736|GENSCAN_predicted_CDS_6|750_bp atgcgaggggtgctcctggtgctgctcggccttctctattcttccaccagagaatactgc tcttttgcaaactacaacccccaactgcagggctggtcccgggagcagggggcttctact gctgcaggttgtggcgtccagaaagcttccgttttctacggtcctgaccccaaggagggc ttggtcagcagcatggagttcccgtgggtggtgtcgctgcaggactcccagtacacacac ctggctttcggctgcatcctgagcgagttctgggtcctcagcatcgcatccgccattcag aacaggccagtgcccctgcctttggggcccgcaacatcgccggggttgccaagcctgatc tctgctgtgaggactgcaggccttgcaacggaaaaatgcaatgatgagtcctccattcaa tgcaggaaggacattgtcgttatagtgggtataagtaacatggatcctagcaagattgct cacacagagtatccagtcaataccatcatcatccatgaggactttgataacaactccatg agcaacaacatagccctcctgaagacagacacagcgatgcattttggcaacctgacagga aatcacatgacgatgagtgtcctgaggaaaatcttcgtgaaagatcttgacatgtgtccc ctatacaaactccagaagacagaatgcggcagccacacgaaagaggaaaccaagactgcc tgcttggatccagaaaggcaggatttttag