GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:03:18 Sequence gi568815591r:97887047_98090104 : 203058 bp : 46.34% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10211 10281 71 1 2 91 104 36 0.025 6.02 1.02 Intr + 13614 13720 107 0 2 22 93 66 0.004 0.36 1.03 Intr + 36658 36831 174 2 0 24 84 154 0.656 8.51 1.04 Term + 37176 37240 65 1 2 86 42 67 0.668 -0.05 1.05 PlyA + 38104 38109 6 1.05 2.03 PlyA - 38245 38240 6 -1.75 2.02 Term - 39510 39345 166 2 1 96 41 177 0.882 11.19 2.01 Init - 41254 40974 281 1 2 109 37 157 0.827 9.21 2.00 Prom - 48119 48080 40 -8.86 3.09 PlyA - 50828 50823 6 1.05 3.08 Term - 51880 51360 521 2 2 52 55 479 0.993 35.46 3.07 Intr - 58569 58462 108 1 0 56 62 61 0.293 0.56 3.06 Intr - 60136 59936 201 2 0 55 72 94 0.288 3.76 3.05 Intr - 63473 63378 96 0 0 116 75 36 0.633 5.08 3.04 Intr - 64916 64752 165 0 0 66 55 98 0.636 4.23 3.03 Intr - 65483 65220 264 0 0 62 76 124 0.381 6.08 3.02 Intr - 68838 68737 102 1 0 -22 68 133 0.470 0.45 3.01 Init - 72608 72524 85 2 1 90 101 -3 0.619 2.25 3.00 Prom - 74012 73973 40 -9.36 4.05 PlyA - 75152 75147 6 1.05 4.04 Term - 79996 79612 385 0 1 88 42 321 0.748 21.86 4.03 Intr - 81752 81718 35 2 2 97 34 -14 0.384 -8.88 4.02 Intr - 82006 81959 48 1 0 89 121 81 0.796 10.38 4.01 Init - 85208 85089 120 2 0 94 69 182 0.799 15.39 4.00 Prom - 97409 97370 40 -7.16 5.05 PlyA - 97458 97453 6 1.05 5.04 Term - 97937 97912 26 0 2 130 42 2 0.567 -1.81 5.03 Intr - 100110 100001 110 2 2 82 75 137 0.761 11.73 5.02 Intr - 101502 101370 133 1 1 69 71 163 0.852 12.40 5.01 Init - 103058 102998 61 2 1 49 75 103 0.956 6.51 5.00 Prom - 104092 104053 40 -3.56 6.00 Prom + 105650 105689 40 -6.56 6.01 Init + 110585 110668 84 1 0 65 64 56 0.060 -0.28 6.02 Intr + 134751 134961 211 2 1 128 95 45 0.942 7.69 6.03 Term + 137030 137208 179 0 2 59 43 141 0.981 4.55 6.04 PlyA + 138113 138118 6 1.05 7.03 PlyA - 140189 140184 6 1.05 7.02 Term - 164473 164233 241 0 1 65 34 163 0.110 4.10 7.01 Intr - 199463 199337 127 0 1 37 91 83 0.204 3.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 10211 10372 162 1 0 91 48 178 0.970 8.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:97887047_98090104|GENSCAN_predicted_peptide_1|138_aa MAYRLKCKTHNYKNAEDNLGNTIRTHGSYCKWSLNDFISCSEKKSPSAPIVEEEVWIGPA TSANESNVNFLSLPDTVSPEIRLDHSPPVPDRSVSPLEHIPRIFPKPGTGLHTSTPSSNR NVLHVFFLLKGLSTNMRR >gi568815591r:97887047_98090104|GENSCAN_predicted_CDS_1|417_bp atggcctacagacttaaatgtaaaacccataactataaaaacgctgaagacaacctaggc aataccatccgtacacatggctcctactgcaaatggtctctaaatgacttcatcagttgc tcagaaaaaaaatcaccctctgctccaatcgtggaggaagaagtatggattggacctgca acctcagccaatgagtccaacgtcaacttcctgtccttgcctgacactgtcagccctgag atcagacttgaccattcacctccagtacctgataggtccgtcagtcctttggaacatatc ccacgaatattccctaaaccaggcaccggactccacacatcaacaccgtcatctaacagg aatgtgcttcatgtcttcttcctgctcaaaggactgtccaccaacatgcgcaggtag >gi568815591r:97887047_98090104|GENSCAN_predicted_peptide_2|148_aa MASPSGSSKATGKPRGRDGRPRREEDDVPPEEKRLRLLLEGGSAQPQDCEDGEDAPRPGR EETGTQTGGDGRGVSDAGAGVRGCRERGGAGDAGFYGNTLFEEPMKKCQAGVRISSDFFE EESLFLHSQAMSEWIKKNRVPFYEILSV >gi568815591r:97887047_98090104|GENSCAN_predicted_CDS_2|447_bp atggccagcccttccggcagctccaaagccactggcaagccccgaggcagggatggccgg cccaggagggaggaggacgacgtccctcccgaagagaagaggctgcggctcttgctggag gggggaagcgcacagccccaggactgcgaggacggggaggacgcgccgcggccgggcagg gaggagaccggcacccagacaggtggcgacggcagaggagtaagtgacgcgggcgcgggg gtccgggggtgccgggagcgcgggggtgctggggacgcgggtttctatgggaatactctc tttgaagaacccatgaagaagtgtcaggctggtgtgaggatcagcagtgatttctttgag gaggagagcctgtttcttcactcacaggccatgtctgagtggatcaagaagaacagagtg cccttttatgagattttgtctgtgtag >gi568815591r:97887047_98090104|GENSCAN_predicted_peptide_3|513_aa MRAVISPPSPTLDITIHIAGGRGCPPRCDDIRINITGWVYTYCDIERHIMLSPSLDIRNN ITESPVEHDCVKLLDSVDSSRPDIRDQPWASVDWELYVHGSSFFNPQGERVAGYAVITLD TVVEARSLPQATSAQKAELIAFIRALELSEAPDLRPAYSKEEKDFLQVEGQVMEEGWIRL PDGRVAVPQLRGAAVVLAVQETTHRDIRKNVTGDVNTSAILRVVSSSPPLHIGNNITGIG MYLTSAVSPPPSNGVVASVMYAAVTPMLNPFIYSLRNRDIQSALRRVLSRTVEFHDLFHP FSCVEICFNEETVVYTPNEVLFSLKRKKLLSAADKMDEIAGCSKNRGTISNHLFPQRCDE SKSQVAPYIQLKNIKSYARGLQLELWRRQLSLGLARGVDLGTPQKGPMWRLTERRQKAHR MLKLYNGLSEGEAVGLPAGPDPLDPTDLNGAHFDPEVYLDKLPRECSLAQLMDSETDMVQ QIRALDSDMQTLVYENYDKFIPATEIDKQHKTL >gi568815591r:97887047_98090104|GENSCAN_predicted_CDS_3|1542_bp atgcgggcagtaatatcgcccccctctcccaccctggatattacgatccacatcgcaggg gggcgagggtgcccaccgagatgcgatgatattcggatcaatatcaccgggtgggtgtac acctactgcgatattgaacgtcatatcatgctctctccctccctggacattaggaacaat atcactgagagccctgtcgagcatgattgtgtaaaattgttggactcagttgactctagc agacctgacatccgggaccagccttgggcatcagtagactgggaactatacgtgcatggg agcagcttcttcaacccccaaggagagagagttgcagggtatgcagtgataaccctggac actgttgttgaagccagatcgttgccccaggccacttcagcccagaaagctgaactcatt gctttcattcgggccttagaactcagtgaggcacctgatcttagacctgcttattctaaa gaagaaaaggactttctccaggtagagggacaagtgatggaagaaggatggattcggtta ccagatgggagagtagctgtgccacagctgcgaggagctgcagttgtactggctgtgcaa gaaaccacccatcgagatattaggaaaaatgtcactggggatgtgaacacctctgcgata ttgagagtagtatcatcctctccccccttgcatattgggaacaatatcacaggcattggc atgtacctgacttcagctgtgtcaccaccccccagcaatggtgtagtggcgtcagtgatg tatgctgcggtcactcccatgctgaaccctttcatctacagcctgagaaacagggacata caaagtgccctgcggagggtgctcagcagaacagtcgaatttcatgatctgttccatcct ttttcttgtgtggaaatttgtttcaatgaagaaactgtggtatacacacccaatgaagta ttattcagcctaaaaaggaagaaactcctctccgctgcagacaaaatggatgagattgca ggttgcagtaaaaacaggggtaccataagcaaccacctctttcctcaacgatgtgatgaa agcaaaagccaagtagctccatatatccaacttaaaaatataaaaagttacgcccgtggg ctgcagttggagctatggcggcggcagctgtcactgggcctagcccggggtgtggacctg gggactccccaaaagggcccgatgtggaggctcacggagcgtcggcagaaggcgcacagg atgctaaagctttacaacggcctctcggaaggggaggcggtgggactccccgcggggccc gaccccctggaccccactgatctgaacggggcgcatttcgacccggaagtttacctagac aagctgcctagagagtgctctctggcccagctgatggacagtgagacggacatggtgcag cagatccgggctctagacagcgacatgcaaaccctggtctatgagaactacgataagttc atcccagccacagaaattgacaaacagcataaaactctatga >gi568815591r:97887047_98090104|GENSCAN_predicted_peptide_4|195_aa MAAAAAGAGSGPSAAQEKQFLPALLSFFIYNPRFWPREGEDKVYSSVLQQCYSMYKPYSR NVSPLMKKQRCPNYADPQNLTDVSIFLLLEVSGDPELQPVLAGLFLSMCLVTVLGNLLII LAISPDSHLHTPMYFFLSNLSLPDIGFTSTTVPKMIVDIQSHSRVISYAGCLTQMSLFAI FGGMEERHAPECDGL >gi568815591r:97887047_98090104|GENSCAN_predicted_CDS_4|588_bp atggctgcagcggcggccggggctgggagcgggccctcggcggcccaggagaagcagttc ctgccggcgctgctgagtttcttcatctacaacccgcgcttctggccgcgggaaggagag gacaaggtttatagctcggtgctgcagcagtgctacagcatgtacaagccttatagtagg aatgtgtccccattgatgaaaaagcaaaggtgtccaaattatgcagacccacagaatcta acagatgtctctatattcctcctcctagaagtctcaggggatccagaactgcagccagtc cttgctgggctgttcctgtccatgtgcctggtcacggtgctggggaacctgctcatcatc ctggccatcagccctgactcccacctccacacccccatgtacttcttcctctccaacctg tccttgcctgacatcggtttcacctccaccacggtccccaagatgattgtggacatccag tctcacagcagagtcatctcctatgcaggctgcctgactcagatgtctctctttgccatt tttggaggcatggaagagagacatgctcctgagtgtgatggcctatga >gi568815591r:97887047_98090104|GENSCAN_predicted_peptide_5|109_aa MSITDVLSADDIAAALQECQDPDTFEPQKFFQTSGLSKMSASQVKDVFRFIDNDQSGYLD EEELKFFLQKFESGARELTESETKSLMAAADNDGDGKIGAEEFQEMVHS >gi568815591r:97887047_98090104|GENSCAN_predicted_CDS_5|330_bp atgagcatcacggacgtgctcagtgctgatgacattgcagcagcgctccaggaatgccaa gacccagacacttttgaaccccaaaaattcttccagacgtcaggcctctccaagatgtca gccagtcaggtgaaggatgttttccggttcatagacaacgaccagagcgggtatctggat gaagaagagcttaagtttttcctccagaagtttgagagtggtgccagagaactgaccgag tcagaaaccaagtccttgatggctgcggcggataatgatggagatgggaaaattggagca gaggaattccaggaaatggtgcattcttaa >gi568815591r:97887047_98090104|GENSCAN_predicted_peptide_6|157_aa MRLWFTAVLTSWAQAILPSWPPNVVVLQAGPPLSHVTHSAITPCSHPSYVLTFTQDEKVP HRVQATSPCPMTYVPSLPKGSFSPYLAPFTRPKGSPCAGKADLTLHVDGVTVLNVGKESL SSQLRQQKPETGAKKKDPAGLGWKVLDAQLFTPTQCM >gi568815591r:97887047_98090104|GENSCAN_predicted_CDS_6|474_bp atgcgattgtggttcaccgcagtcttgacctcctgggctcaagccatcctcccttcttgg cctcccaatgtggtggtattacaggccgggcctccactttctcatgtcacccactctgct attactccatgctcccatccttcctacgtccttactttcacccaggatgagaaagtgccc caccgtgtccaggccacatcaccttgtcccatgacctatgtcccttccctccccaaggga agcttcagcccataccttgcgccatttacccggcccaagggctctccctgcgcaggaaag gccgacctcacactgcatgtggatggcgtgacagtcctgaatgtggggaaagagagcttg tcctcgcagctacgtcaacagaagccagagactggggccaagaagaaagaccctgcgggg ttgggctggaaagtcctggacgcccagctcttcacacccactcagtgcatgtga >gi568815591r:97887047_98090104|GENSCAN_predicted_peptide_7|122_aa XTLLATRLLPQAHKTEGVTCCVSFYQASWNRLGELVPDWHMGRVADAVTRDPVGQDNAVL YIQEVPPLIHSWGPLAERAIREATGGAPELVVATGPALRRRPGYHGYSRAGAEPNRIPEV GN >gi568815591r:97887047_98090104|GENSCAN_predicted_CDS_7|369_bp nccactctgctggccacacgacttttgccccaagcccataaaaccgagggagtcacctgc tgcgtcagtttctaccaggccagctggaacaggctgggcgagttggtgccagactggcac atgggcagggtggcagatgcagttacaagggacccagtgggacaggacaacgctgttctt tatattcaagaagtgccccctttgattcatagctggggacccctcgccgagcgggccatc agagaggccaccggtggcgccccggagctcgtcgtggcgactggtccagccttgcgaagg agacctggttaccatggatacagcagggcgggggcggagccaaaccggatcccggaagtg ggtaattag