GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:00:55 Sequence gi568815587f:70103460_70306470 : 203011 bp : 51.25% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 540 691 152 2 2 99 84 111 0.982 11.27 1.02 Intr + 2206 2329 124 1 1 40 87 80 0.526 4.19 1.03 Intr + 4894 4945 52 0 1 105 94 57 0.996 6.97 1.04 Intr + 5219 5398 180 0 0 25 92 76 0.796 2.06 1.05 Intr + 7667 7732 66 0 0 50 115 26 0.643 0.87 1.06 Intr + 8248 8303 56 2 2 114 95 57 0.553 8.19 1.07 Intr + 12999 13040 42 2 0 92 106 71 0.945 8.22 1.08 Intr + 20891 20955 65 1 2 124 110 103 0.998 14.11 1.09 Intr + 22602 22736 135 0 0 79 96 217 0.895 21.79 1.10 Intr + 27232 27329 98 1 2 61 46 94 0.292 2.65 1.11 Intr + 28234 28429 196 2 1 -1 57 116 0.506 -1.41 1.12 Intr + 28460 28620 161 2 2 84 86 136 0.775 13.14 1.13 Intr + 39367 39512 146 2 2 122 20 47 0.054 1.81 1.14 Intr + 44258 44390 133 1 1 87 18 56 0.223 -0.78 1.15 Intr + 46251 46333 83 1 2 79 78 221 0.645 20.05 1.16 Intr + 49598 49669 72 1 0 101 64 50 0.314 4.00 1.17 Intr + 53488 53562 75 0 0 89 115 41 0.969 7.11 1.18 Intr + 57702 57903 202 2 1 96 109 401 0.999 42.38 1.19 Intr + 58163 58274 112 0 1 93 70 121 0.999 10.74 1.20 Intr + 59824 59881 58 1 1 109 76 74 0.963 7.68 1.21 Intr + 62011 62111 101 0 2 90 86 147 0.900 14.11 1.22 Intr + 63281 63361 81 2 0 88 38 77 0.780 1.95 1.23 Intr + 63783 63928 146 0 2 102 61 342 0.994 33.34 1.24 Intr + 67428 67580 153 1 0 51 109 271 0.907 26.06 1.25 Intr + 76545 76705 161 1 2 126 16 143 0.398 11.12 1.26 Intr + 78420 78461 42 2 0 50 110 45 0.518 1.82 1.27 Intr + 79043 79227 185 1 2 85 84 447 0.829 43.10 1.28 Intr + 82131 82236 106 0 1 103 80 270 0.978 28.42 1.29 Term + 84279 84545 267 2 0 103 37 531 0.999 45.33 1.30 PlyA + 85562 85567 6 1.05 2.00 Prom + 86566 86605 40 -0.81 2.01 Init + 100001 100286 286 1 1 106 86 586 0.951 55.44 2.02 Term + 102674 103014 341 0 2 115 45 355 0.556 28.95 2.03 PlyA + 106358 106363 6 1.05 3.00 Prom + 111514 111553 40 -2.11 3.01 Init + 113203 113271 69 0 0 68 56 39 0.007 -0.28 3.02 Intr + 114086 114196 111 1 0 18 109 81 0.010 4.28 3.03 Intr + 123773 123849 77 1 2 100 83 51 0.776 4.61 3.04 Term + 126816 126927 112 0 1 125 44 32 0.672 0.83 3.05 PlyA + 127056 127061 6 1.05 4.04 PlyA - 127133 127128 6 1.05 4.03 Term - 127421 127362 60 2 0 72 38 41 0.062 -4.40 4.02 Intr - 132976 132876 101 2 2 45 110 100 0.407 8.23 4.01 Init - 138485 138467 19 2 1 41 106 12 0.132 -1.50 4.00 Prom - 139076 139037 40 -0.91 5.03 PlyA - 139105 139100 6 1.05 5.02 Term - 140850 140635 216 0 0 71 47 98 0.285 1.47 5.01 Init - 148899 148771 129 0 0 84 59 80 0.406 4.81 5.00 Prom - 156417 156378 40 -0.11 6.00 Prom + 163188 163227 40 -0.81 6.01 Init + 163882 163897 16 0 1 82 26 17 0.184 -4.57 6.02 Intr + 167095 167455 361 2 1 64 116 208 0.555 16.04 6.03 Intr + 167678 167860 183 2 0 8 85 116 0.639 2.72 6.04 Intr + 168162 168312 151 0 1 69 -4 56 0.744 -4.82 6.05 Intr + 168714 168977 264 2 0 62 80 234 0.611 18.25 6.06 Term + 170572 170640 69 0 0 52 53 53 0.513 -3.57 6.07 PlyA + 172895 172900 6 -0.45 7.03 PlyA - 173730 173725 6 1.05 7.02 Term - 175720 175453 268 0 1 12 43 283 0.897 11.50 7.01 Init - 182670 182660 11 0 2 76 110 16 0.521 1.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 113203 113242 40 0 1 68 91 55 0.915 4.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:70103460_70306470|GENSCAN_predicted_peptide_1|1149_aa MYHINETRGLLKKINSVLQKITDPIQPKVAEHRPQTMKRLSYPFSREKQHLALKPRELPA LVNCVLFPSRYFHRFDLSDKDSFFDSKTRSTIVYEILKRTTCTKAKYSMGWKQTSVRDDR GIPGPLLTLTVQWEVIGVPAGAASPCGCSGPLKAQLGLLKAAPPDQAPPGQGEGRKKDSA LLSKRRKCGKYGITSLLANGVYAAAYPLHDGDYNGENVEFNDRKLLYEEWARYGVFYKYQ PIDLVRKYFGEKIGLYFAWLGVYTQMLIPASIVGIIVFLYGCATMDENIPSSWDLAPGSP VLSKAEACTERDPSSLAGPPGFPRLVQQHGSHACEAALDGKEGQGMAKHPAYVHYGMDLR DPRQPPAWAQRFLPGQLGTQEVLHHGPTSMEMCDQRHNITMCPLCDKTCSYWKMSSACAT ARASHLFDNPATVFFSVFMALWGGKGRDRKPVLCCLLFQGYESHHEDPTLMTSAKPDQPP KAPSSNTITWGTLTSPGKPFLRDPSFFWAAPISGSSIGAEHLLLSCHPETDASSCSATFM EHWKRKQMRLNYRWDLTGFEEEEDHPRAEYEARVLEKSLKKESRNKETDKVKLTWRDRFP AYLTNLVSIIFMIAVTFAIVLGVIIYRISMAAALAMNSSPSVRSNIRVTVTATAVIINLV VIILLDEVYGCIARWLTKIEVPKTEKSFEERLIFKAFLLKFVNSYTPIFYVAFFKGRFVG RPGDYVYIFRSFRMEECAPGGCLMELCIQLSIIMLGKQLIQNNLFEIGIPQQSHDTCPLP FGERAEEIIDKEMMANKKMKKLIRYLKLKQQSPPDHEECVKRKQRYEVDYNLEPFAGLTP EYMEMIIQFGFVTLFVASFPLAPLFALLNNIIEIRLDAKKFVTELRRPVAVRAKDIGIWY NILRGIGKLAVIINVSDIRDLGRMEVPAELPGGRGPVEKELGMGLLPLAMTALEGEDCHG PILQAFVISFTSDFIPRLVYLYMYSKNGTMHGFVNHTLSSFNVSDFQNGTAPNDPLDLGY EVQICRYKDYREPPWSENKYDISKDFWAVLAARLAFVIVFQNLVMFMSDFVDWVIPDIPK DISQQIHKEKVLMVELFMREEQDKQQLLETWMEKERQKDEPPCNHHNTKACPDSLGSPAP SHAYHGGVL >gi568815587f:70103460_70306470|GENSCAN_predicted_CDS_1|3450_bp atgtaccacattaatgagacccgtggcctcctgaaaaaaatcaactctgtgctccagaaa atcacagatcccatccagcccaaagtggctgagcacaggccccagaccatgaagagactc tcctatcccttctcccgggagaagcagcatctggccttgaaacccagagaactgccagct ctggtgaactgtgtcttgtttccttcccgatatttccacagatttgacttgtctgataag gattcctttttcgacagcaaaacccggagcacgattgtctatgagatcttgaagagaacg acgtgtacaaaggccaagtacagcatgggctggaaacagaccagtgttagggatgaccgg gggattcctggccctctcttgaccctcacggtccagtgggaagtaattggggttccagcc ggggcggcatctccatgtggctgttccggacccttaaaagcacagctgggcttgctgaaa gcagcgccccctgaccaggcaccaccagggcaaggcgagggaagaaagaaggactccgcc cttctaagtaaaaggcggaaatgtgggaagtatggcatcacgagcctgctggccaatggt gtgtacgcggctgcatacccactgcacgatggagactacaacggtgaaaacgtcgagttc aacgacagaaaactcctgtacgaagagtgggcacgctatggagttttctataagtaccag cccatcgacctggtcaggaagtattttggggagaagatcggcctgtacttcgcctggctg ggcgtgtacacccagatgctcatccctgcctccatcgtgggaatcattgtcttcctgtac ggatgcgccaccatggatgaaaacatccccagtagttgggatctggccccaggctcacct gtgctcagcaaagcagaagcctgcactgagcgggacccatcatccctggcaggccccccc ggcttcccgaggctggtgcaacagcatgggtcacacgcctgtgaagctgccctggatggc aaggagggacagggcatggctaagcacccggcttatgtccactatggcatggacctgagg gaccctcgccaacccccagcttgggcccagcgctttctcccaggccagctcggcacccag gaggtgctccatcatggcccaacaagcatggagatgtgtgaccagagacacaatatcacc atgtgcccgctttgcgacaagacctgcagctactggaagatgagctcagcctgcgccacg gcccgcgccagccacctcttcgacaaccccgccacggtcttcttctctgtcttcatggcc ctctggggtggcaaagggagagacaggaagcccgttctctgttgtctcctcttccaaggg tacgaatcccatcacgaggaccccaccctcatgacctcagctaaacccgatcaaccccca aaggccccatcttcaaataccatcacctgggggactctgacttcccctgggaagcccttc ctccgtgatcccagcttcttctgggcagctccaatttctggctccagcattggcgctgag cacctcctactgtcctgccatcctgaaacggatgccagctcctgctctgccaccttcatg gagcactggaagcggaaacagatgcgactcaactaccgctgggacctcacgggctttgaa gaggaagaggatcatcctagagctgaatacgaagccagagtcttggagaagtctctgaag aaagagtccagaaacaaagagactgacaaagtgaagctgacatggagagatcggttccca gcctacctcactaacttggtctccatcatcttcatgattgcagtgacgtttgccatcgtc ctcggcgtcatcatctacagaatctccatggccgccgccttggccatgaactcctccccc tccgtgcggtccaacatccgggtcacagtcacagccaccgcagtcatcatcaacctagtg gtcatcatcctcctggacgaggtgtatggctgcatagcccgatggctcaccaagatcgag gtcccaaagacggagaaaagctttgaggagaggctgatcttcaaggctttcctgctgaag tttgtgaattcctacacccccatcttttacgtggcgttcttcaaaggccggtttgttgga cgcccgggcgactacgtgtacattttccgttccttccgaatggaagagtgtgcgccaggg ggctgcctgatggagctatgcatccagctcagcatcatcatgctggggaaacagctgatc cagaacaacctgttcgagatcggcatcccccagcaaagccatgacacttgccccctgcct tttggggaaagggcagaagagatcatcgacaaagaaatgatggcaaacaagaagatgaag aagctcatccgctacctgaagctgaagcagcagagcccccctgaccacgaggagtgtgtg aagaggaaacagcggtacgaggtggattacaacctggagcccttcgcgggcctcacccca gagtacatggaaatgatcatccagtttggcttcgtcaccctgtttgtcgcctccttcccc ctggccccactgtttgcgctgctgaacaacatcatcgagatccgcctggacgccaaaaag tttgtcactgagctccgaaggccggtagctgtcagagccaaagacatcggaatctggtac aatatcctcagaggcattgggaagcttgctgtcatcatcaatgtaagtgacatcagggac cttggcagaatggaagtcccggctgaactgccaggtggccggggccctgtggagaaggag ctgggcatgggtcttctgcctctagccatgacagcccttgaaggcgaggactgccatggc cccatcttacaggccttcgtgatctccttcacgtctgacttcatcccgcgcctggtgtac ctctacatgtacagtaagaacgggaccatgcacggcttcgtcaaccacaccctctcctcc ttcaacgtcagtgacttccagaacggcacggcccccaatgaccccctggacctgggctac gaggtgcagatctgcaggtataaagactaccgagagccgccgtggtcggaaaacaagtac gacatctccaaggacttctgggccgtcctggcagcccggctggcgtttgtcatcgtcttc cagaacctggtcatgttcatgagcgactttgtggactgggtcatcccggacatccccaag gacatcagccagcagatccacaaggagaaggtgctcatggtggagctgttcatgcgggag gagcaagacaagcagcagctgctggaaacctggatggagaaggagcggcagaaggacgag ccgccgtgcaaccaccacaacaccaaagcctgcccagacagcctcggcagcccagccccc agccatgcctaccacgggggcgtcctgtag >gi568815587f:70103460_70306470|GENSCAN_predicted_peptide_2|208_aa MDPFLVLLHSVSSSLSSSELTELKFLCLGRVGKRKLERVQSGLDLFSMLLEQNDLEPGHT ELLRELLASLRRHDLLRRVDDFEAGAAAGAAPGEEDLCAAFNVICDNVGKDWRRLARQLK VSDTKIDSIEDRYPRNLTERVRESLRIWKNTEKENATVAHLVGALRSCQMNLVADLVQEV QQARDLQNRSGAMSPMSWNSDASTSEAS >gi568815587f:70103460_70306470|GENSCAN_predicted_CDS_2|627_bp atggacccgttcctggtgctgctgcactcggtgtcgtccagcctgtcgagcagcgagctg accgagctcaagttcctatgcctcgggcgcgtgggcaagcgcaagctggagcgcgtgcag agcggcctagacctcttctccatgctgctggagcagaacgacctggagcccgggcacacc gagctcctgcgcgagctgctcgcctccctgcggcgccacgacctgctgcggcgcgtcgac gacttcgaggcgggggcggcggccggggccgcgcctggggaagaagacctgtgtgcagca tttaacgtcatatgtgataatgtggggaaagattggagaaggctggctcgtcagctcaaa gtctcagacaccaagatcgacagcatcgaggacagatacccccgcaacctgacagagcgt gtgcgggagtcactgagaatctggaagaacacagagaaggagaacgcaacagtggcccac ctggtgggggctctcaggtcctgccagatgaacctggtggctgacctggtacaagaggtt cagcaggcccgtgacctccagaacaggagtggggccatgtccccgatgtcatggaactca gacgcatctacctccgaagcgtcctga >gi568815587f:70103460_70306470|GENSCAN_predicted_peptide_3|122_aa MGAVCEALQQYSPGNLLSLMGVRASKAIGSASLQLLSQEDLGRSQTALGQSSRGSGSGCQ TWIPEGTQCVSRGSGWSAGELNAQKRTFIPLQYTDATENLKFDLAFSFDLPTQVLSPIKS CF >gi568815587f:70103460_70306470|GENSCAN_predicted_CDS_3|369_bp atgggggctgtctgtgaagccttgcagcagtacagcccaggtaatttgctgagcctaatg ggtgtcagggcttccaaggcgatcggcagtgccagtcttcagctgctaagccaagaagat ctgggaaggagtcagacggccttgggccagagttccaggggctctggaagtggctgccag acttggatccccgagggcacacagtgtgtttcccggggaagtgggtggtctgctggtgaa ctgaatgcccagaagaggacatttattccattacagtataccgatgccacagagaacctc aaatttgacctcgctttctcttttgatctgccaacacaggtcctaagtccaatcaagagc tgcttttga >gi568815587f:70103460_70306470|GENSCAN_predicted_peptide_4|59_aa MSDGWVGCDLHRGGIGREGTDVDSQPAMTAVVTTEIPYGQGATPMLITKPVKWDKQNLP >gi568815587f:70103460_70306470|GENSCAN_predicted_CDS_4|180_bp atgagtgatggctgggtggggtgcgacttgcaccgaggagggataggaagagaagggaca gatgtggacagccaacctgcaatgacagctgtggtcaccacagagatcccttatgggcag ggggcaacccccatgctaatcaccaaacctgtcaaatgggacaagcagaaccttccttaa >gi568815587f:70103460_70306470|GENSCAN_predicted_peptide_5|114_aa MTAHNVARMDGGHHIVPLAQYSENPLAVIQNKGEMRPVSWAVEPQEANTALTVSRYHLLS FPPDSQTIPHTRPFQNPLSPEICSLSRIPHAKDCTTGCPNQNPVIARILPALHP >gi568815587f:70103460_70306470|GENSCAN_predicted_CDS_5|345_bp atgactgctcacaatgtggccaggatggatggaggacatcatattgttcctctagcccag tattctgagaacccgctggctgtgattcagaataaaggagaaatgaggcctgtttcttgg gcagtggagccccaggaagctaacacagccctcactgtatctcgctaccatctcctgagc ttcccacctgacagccagacaattcctcacactcggcccttccaaaacccactctcccct gagatctgttccctctccagaattcctcatgccaaggattgcaccactggctgcccaaac cagaacccggtcatcgcccggatcctccctgccctccacccctga >gi568815587f:70103460_70306470|GENSCAN_predicted_peptide_6|347_aa MVVCAANQCPGAASRPLLRRQRLRTRAKVLLGPRPRGGADARADALDVGHVDAGAAQPGP LLLRSASVRPRAGLSDWGGGPRGRGVGRAGGRRPRAAFVGSQALPERAAAREPPRSLLAR SRRRASPAPRDPGAGARGGDLWPLKALARAGSVVNGPHPPGFGASAAPDTGGEECGSSRL EVTIPISAHQIQPKNSGSRQFFPGADCTHSCKMPDGFCTCQFCWLYAPSVLFLKNDWMMC EVMPTISEAEGPPGGGGGHGSGSPSQPDADSHFEQLMVSMLEERDRLLDTLRETQETLAL TQGKLHEVGHERDSLQRQLNTALPQCSSLGMPSSVDNMMMPTVHVTT >gi568815587f:70103460_70306470|GENSCAN_predicted_CDS_6|1044_bp atggtggtctgcgctgccaatcagtgccccggggccgcctcccgccccctcctgaggaga cagcgcttgcgtactcgggccaaggtgctcctcgggccccgcccccggggcggtgctgac gctcgcgccgacgcgctcgacgtcgggcacgtagacgccggcgccgcgcagccgggcccg ctcctcctccgctccgccagtgtccggccgcgggccggccttagtgactggggcggcggg ccccggggccgcggcgtggggcgggcaggcggacgccggccgcgggctgctttcgtcggc tcccaagctctcccggagcgagcagccgcccgcgagccgccgcggagcctcctcgcccgc tcccgccggcgagcaagccccgcgccccgtgacccgggcgctggggcgaggggcggggac ctgtggccgctgaaagccctggcccgagcgggctccgttgtcaacggtcctcaccccccg gggttcggggcttccgccgcacctgatacaggaggggaggagtgcggctcgagcaggctg gaggtcaccatccccatcagtgctcaccaaatccagcccaagaattcagggagccgtcag ttcttcccgggagctgactgcacacattcctgcaagatgcctgatggattttgtacttgt cagttttgctggttgtatgctccttctgtgcttttcctgaaaaatgattggatgatgtgc gaggtgatgccgaccatcagcgaagcagaaggcccccctggaggaggtggaggccatggt tccggctccccttcacagccagatgcagattcacattttgaacagttgatggtctccatg ctagaagaaagggaccgccttcttgatacactgagagagactcaagaaacgctggcctta acccaggggaagttacacgaggttggtcatgaaagagattccttgcagagacagctcaac acggcacttccacagtgtagcagcctgggaatgcctagttcagtggataacatgatgatg ccaacggttcacgttaccacctga >gi568815587f:70103460_70306470|GENSCAN_predicted_peptide_7|92_aa MAVRTISPGRVGATAAVNSTAILEYLTAEVLELARNASKDFTVKHTTPHYLQLAIRGDKE LGSLFKSTIAGKGVIPHIHKSLIGKKGQQQTV >gi568815587f:70103460_70306470|GENSCAN_predicted_CDS_7|279_bp atggcagtcaggacaatcagtcctggacgtgtgggcgcgactgccgctgtgaatagcaca gccatcctggagtacctcaccgcagaggtacttgaactggcaagaaatgcgtcaaaagac ttcacagtaaagcatactacccctcattacttgcaacttgccattcgtggagataaagaa ttgggctctctcttcaagtctacaattgctggtaaaggtgtcattccacacatccacaaa tctctaattgggaagaaaggacaacagcagactgtctaa