GENSCAN 1.0 Date run: 3-Oct-119 Time: 17:24:40 Sequence gi568815582r:74614193_74816623 : 202431 bp : 45.90% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 PlyA - 320 315 6 1.05 1.11 Term - 9879 9736 144 0 0 129 42 152 0.993 12.61 1.10 Intr - 12362 12151 212 0 2 93 93 138 0.979 13.43 1.09 Intr - 14474 14260 215 1 2 72 119 3 0.571 0.16 1.08 Intr - 16765 16589 177 0 0 62 91 100 0.980 6.73 1.07 Intr - 18481 18331 151 2 1 104 109 95 0.998 12.42 1.06 Intr - 22385 22154 232 2 1 67 103 53 0.532 2.05 1.05 Intr - 23778 23664 115 2 1 98 98 39 0.877 6.35 1.04 Intr - 30261 30170 92 0 2 65 100 113 0.999 8.99 1.03 Intr - 30543 30349 195 0 0 72 110 130 0.854 13.21 1.02 Intr - 37930 37711 220 0 1 86 36 95 0.236 2.40 1.01 Init - 47257 46740 518 1 2 68 99 192 0.427 12.66 1.00 Prom - 48090 48051 40 -3.06 2.00 Prom + 49372 49411 40 -7.66 2.01 Init + 51417 51471 55 2 1 59 59 70 0.307 0.65 2.02 Term + 53312 53577 266 0 2 17 33 245 0.969 7.57 2.03 PlyA + 54627 54632 6 1.05 3.12 PlyA - 54682 54677 6 1.05 3.11 Term - 58764 58733 32 1 2 79 48 38 0.397 -3.08 3.10 Intr - 60908 60768 141 0 0 62 54 253 0.574 19.62 3.09 Intr - 62478 62343 136 0 1 92 58 -16 0.278 -4.06 3.08 Intr - 64788 64707 82 2 1 61 113 44 0.142 3.84 3.07 Intr - 68594 68459 136 0 1 69 111 52 0.368 5.23 3.06 Intr - 71391 71294 98 2 2 92 110 87 0.948 10.95 3.05 Intr - 77276 77085 192 1 0 -22 85 145 0.308 1.81 3.04 Intr - 81529 81090 440 1 2 27 36 439 0.097 24.91 3.03 Intr - 81940 81860 81 1 0 75 42 108 0.203 4.73 3.02 Intr - 86490 86261 230 1 2 53 105 73 0.230 3.09 3.01 Init - 94347 94287 61 0 1 76 69 75 0.486 5.81 3.00 Prom - 96908 96869 40 -6.76 4.11 PlyA - 96974 96969 6 1.05 4.10 Term - 99813 99683 131 1 2 58 49 128 0.932 4.24 4.09 Intr - 102407 102155 253 2 1 104 94 430 0.743 42.21 4.08 Intr - 104968 104796 173 2 2 96 105 316 0.991 33.76 4.07 Intr - 112139 112033 107 1 2 94 95 146 0.861 15.76 4.06 Intr - 113194 113052 143 1 2 97 93 175 0.977 18.05 4.05 Intr - 115705 115519 187 0 1 81 -10 147 0.083 3.89 4.04 Intr - 123105 123095 11 0 2 113 94 6 0.036 -3.24 4.03 Intr - 125923 125831 93 1 0 120 77 83 0.087 10.56 4.02 Intr - 153861 153766 96 0 0 44 49 91 0.254 1.01 4.01 Init - 160563 160294 270 0 0 103 92 490 0.993 45.87 4.00 Prom - 164944 164905 40 -3.16 5.00 Prom + 192440 192479 40 -4.56 5.01 Sngl + 192926 193285 360 1 0 49 42 172 0.672 4.98 5.02 PlyA + 194221 194226 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 114482 114417 66 2 0 66 37 95 0.829 3.27 S.002 Intr + 135190 135309 120 2 0 103 89 92 0.979 11.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:74614193_74816623|GENSCAN_predicted_peptide_1|756_aa MAHEAMEYDVQVQLNHAEQQPAPAGMASSQGGPALLQPVPADVVSSQGVPSILQPAPAEV ISSQATPPLLQPAPQLSVDLTEVEVLGEDTVENINPRTSEQHRQGSDGNHTIPASSLHSM TNFISGLQRLHGMLEFLRPSSSNHSVGPMRTRRRVSASRRARAGGSQRTDSARLRAPLDA YFQVSRTQPDLPATTYDSETRNPVSEELQVSSSSDSDSDSSAEYGGVVDQAEESGAVILE GQYFTQPSPQKSEPLLPSASMDEEEGDTCTICLEQWTNAGDHRLSALRCGHLFGYRCIST WLKGQVRKCPQCNKKARHSDIVVLYARTLRALDTSEQERMKSSLLKEQMLRKQAELESAQ CRLQLQVLTDKCTRLQRRVQDLQKLTSHQSQNLQQPRGSQAWVLSCSPSSQGQHKHKYHF QKTFTVSQAGNCRIMAYCDALSCLVISQPSPQASFLPGFGVKMLSTANMKSSQYIPMHGK QIRGLAFSSYLRGLLLSASLDNTIKLTSLETNTVVQTYNAGRPVWSCCWCLDEANYIYAG LANGSILVYDVRNTSSHVQELVAQKARCPLVSLSYMPRAASAAFPYGGVLAGTLEDASFW EQKMDFSHWPHVLPLEPGGCIDFQTENSSRHCLVTYRPDKNHTTIRSVLMEMSYRLDDTG NPICSCQPVHTFFGGPTCKLLTKNAIFQSPENDGNILVCTGDEAANSALLWDAASGSLLQ DLQTDQPVLDICPFEVNRNSYLATLTEKMVHIYKWE >gi568815582r:74614193_74816623|GENSCAN_predicted_CDS_1|2271_bp atggctcatgaagcaatggaatatgatgttcaggtgcagttaaatcatgccgaacaacag ccagctcctgctggcatggccagcagccaagggggaccagccctcctccagcctgttcct gctgatgtggtcagcagccagggggtaccatccatcctccagccagctcctgctgaggtg atcagcagccaagcgacaccacccctgctccagcctgctccgcaactgtctgttgacctg acagaagtggaggtcttgggagaagacactgtggagaacatcaatccaagaacttcagaa caacataggcagggatctgatggtaatcacaccatcccagcatcttcgttgcattcaatg accaacttcatcagcggactgcagagacttcatggcatgctggaattcctgagaccttca tcttcaaaccacagtgtagggccaatgagaacaagaaggagggtatctgcttcacggagg gcaagagccggagggtctcagaggacagacagtgccaggttgagagcaccattggatgct tactttcaggtgagcaggacccagcctgacttgccagctaccacttatgattcagagact aggaatcctgtatctgaagagttgcaggtgtctagtagttctgattctgacagtgacagc tctgcagagtatggaggggttgttgaccaggcagaggaatctggagctgtcattttagaa ggtcagtattttacccagccatctccccagaagtctgagcctctgctaccttctgcttct atggatgaggaagaaggggacacttgtacaatatgtctggaacagtggaccaatgctggg gaccaccggctctcagcattacgctgtgggcatctctttgggtataggtgcatttccacg tggcttaaaggacaagtacgaaaatgtccccagtgcaacaagaaagccaggcacagtgac attgtcgtcctttatgcccgaaccctgagagctttggacactagtgaacaggagcgcatg aaaagttccctactgaaggaacagatgctaaggaaacaggccgagttagaatcagcacag tgccgactccaactgcaggtcctcactgataagtgcactaggcttcaaaggcgtgttcag gacttgcaaaaacttacgtcacatcaaagtcagaatttacagcaacccaggggctcccaa gcatgggtcctgagctgctcaccctccagccagggccagcacaagcacaagtaccacttc caaaagaccttcacagtatctcaggcaggaaactgccggatcatggcatactgtgatgct ctgagctgcctggtgatatcacagccttctcctcaggcctcttttcttccaggctttggt gttaagatgttgagtactgccaacatgaagagcagtcagtacattccgatgcatggcaaa cagatccgtggactggcgtttagcagttacctcagaggcttgctactctctgcttcccta gacaacactattaaactgaccagcctggagacaaataccgtggtccagacttataatgct ggacgtcctgtctggagctgttgctggtgtcttgatgaggctaactacatctatgctgga ctggccaatggttcaattctggtatatgacgtgcgaaacacgagcagtcatgtgcaggag ttagtagctcagaaagccagatgcccactggtctccctgtcatacatgcccagagctgcc tcagctgcatttccatatggtggggtgctggctggaaccttggaggatgcttcattctgg gaacagaaaatggacttttctcattggcctcatgtgctgcccttggagccagggggctgc atagactttcagacagagaacagctcccggcactgtcttgtgacctacaggcctgataaa aatcacaccaccatacgaagtgtgctgatggaaatgtcctaccgactggatgacactgga aatccaatctgctcctgccagcctgtacatacattttttggaggacctacttgcaaacta ttgaccaaaaatgccattttccaaagcccagagaatgatggcaacatcctggtgtgtact ggggatgaagcagcaaattctgccctgctgtgggatgctgccagtggctcgttgctccag gacctacagaccgatcagcctgtgttggacatctgcccatttgaggtgaaccgtaacagc tacttggctaccttaacagagaagatggtccacatctataagtgggagtga >gi568815582r:74614193_74816623|GENSCAN_predicted_peptide_2|106_aa MSRAWWRAPVVPAIAGSRGPPLAASAKRKGPPEFSSYVRGARAHSRLGSEPAAGRKATKK TDKPRQDDKDDLDVTELTNEDPLDQLVKYGVNCGPIVGTTRKLYEK >gi568815582r:74614193_74816623|GENSCAN_predicted_CDS_2|321_bp atgagccgggcgtggtggcgcgcacctgtggtcccagctattgcgggaagccgagggccg ccgctcgccgccagcgccaaaagaaaggggcccccggaattctccagctacgtacgagga gcgcgagcccacagccgtctcggctccgagcccgccgccggcaggaaagccacaaagaaa actgataaacccagacaagatgataaagacgatctagatgtaacagaactcactaatgaa gatcctttggatcagcttgtgaaatacggagtgaattgtggtcctattgtgggaacaacc aggaagctgtatgagaaatag >gi568815582r:74614193_74816623|GENSCAN_predicted_peptide_3|542_aa MQAKQMLALAKVLGQKQEHRGRDRGALSPPATCGVRYSQQSAGCGHQERGRRGTPAGLAF ANFSPGQEAVVGKKVEERAFWNCARDRLDAHPSGGAKGRFTPRIQPTKGECEEAAQLPQS EVEQVIHKRCEEMKYCKKQCRRLGHRVLGLIKPLEMLQDQGKRSVPSEKLTTAMNRFKAA LEEANGEIEKFSNRSNICRFLTASQDKILFKDVNRKLSDVWKELSLLLQVEQRMPVSPIS QGASWAQEDQQDADEDRRAFQMLRRGKLGLWSDLPPKCMQEIPQEQIKEIKKEQLSGSPW ILLRENEVSTLYKGEYHRAPVAIKVFKKLQAGSIAIVRQTFNKEIKTMKKFESPNILRIF GICIDETVTPPQFSIVMEYCELGTLRELLDREKDLTLGKRMVLVLGAARGLYRLHHSEAP ELHGKIRSSNFLVTQGYQVKMPFPLCDGDISTPKTLFLLLSRSSCSLTIRCAIPPFIPGI PYGARCCNSEKIRKLVAVKRQQEPLGEDCPSELREIIDECRAHDPSVRPSVDEQKRRLND VF >gi568815582r:74614193_74816623|GENSCAN_predicted_CDS_3|1629_bp atgcaagccaagcagatgctggctttggcaaaagtccttggacaaaagcaggaacataga ggtagggatcggggcgccttgtcgccgccagccacgtgtggcgtccggtacagtcagcag agtgcagggtgcgggcaccaggaaagggggcgcaggggaactcccgcgggcctcgcgttt gcaaacttctcgcctgggcaggaggcggtcgtgggaaagaaggtggaagagcgagctttt tggaactgtgcacgggacagattggacgcacacccctcgggaggcgcgaagggccgcttc accccacgcatccagccaaccaagggagagtgtgaggaggcggcacagctgccccagtcc gaagtagagcaggtcatccacaaacggtgtgaagagatgaaatactgcaagaaacagtgc cggcgcctgggccaccgcgtcctcggcctgatcaagcctctggagatgctccaggaccaa ggaaagaggagcgtgccctctgagaagttaaccacagccatgaaccgcttcaaggctgcc ctggaggaggctaatggggagatagaaaagttcagcaatagatccaatatctgcaggttt ctaacagcaagccaggacaaaatactcttcaaggacgtgaacaggaagctgagtgatgtc tggaaggagctctcgctgttacttcaggttgagcaacgcatgcctgtttcacccataagc caaggagcgtcctgggcacaggaagatcagcaggatgcagacgaagacaggcgagctttc cagatgctaagaagaggcaagctgggtctttggtcagatttaccaccaaaatgcatgcag gagatcccgcaagagcaaatcaaggagatcaagaaggagcagctttcaggatccccgtgg attctgctaagggaaaatgaagtcagcacactttataaaggagaataccacagagctcca gtggccataaaagtattcaaaaaactccaggctggcagcattgcaatagtgaggcagact ttcaataaggagatcaaaaccatgaagaaattcgaatctcccaacatcctgcgtatattt gggatttgcattgatgaaacagtgactccgcctcaattctccattgtcatggagtactgt gaactcgggaccctgagggagctgttggatagggaaaaagacctcacacttggcaagcgc atggtcctagtcctgggggcagcccgaggcctataccggctacaccattcagaagcacct gaactccacggaaaaatcagaagctcaaacttcctggtaactcaaggctaccaagtgaag atgcctttccccctctgcgatggtgatataagtactcccaaaacactgtttctactactc tcacgctcttcatgcagcctgactataagatgcgctattccacctttcatccctggtatc ccatacggcgctagatgctgtaattctgagaagatccgcaagctggtggctgtgaagcgg cagcaggagccactgggtgaagactgcccttcagagctgcgggagatcattgatgagtgc cgggcccatgatccctctgtgcggccctctgtggatgagcagaagcgcagacttaatgat gtgttctga >gi568815582r:74614193_74816623|GENSCAN_predicted_peptide_4|487_aa MAPAPPPAASFSPSEVQRRLAAGACWVRRGARLYDLSSFVRHHPGGEQLLRARAGQDISA DLDGPPHRHSANARRWLEQYYVGELRGEQQTGDKHPMRSETHHITETALAGVTRTFAFLH PVGSMENEPVALEETQKTDPAMEPRFKVVDWDKNTASGCLTPRAEDWLSIHRDTLILTPI VCGRLGPTSITDEGTEGLREEKMHVQDLTAGQRHSQALMDLVDWRKPLLWQVGHLGEKYD EWVHQPVTRPIRLFHSDLIEGLSKTVWYSVPIIWVPLVLYLSWSYYRTFAQGNVRLFTSF TTEYTVAVPKSMFPGLFMLGTFLWSLIEYLIHRFLFHMKPPSDSYYLIMLHFVMHGQHHK APFDGSRLVFPPVPASLVIGVFYLCMQLILPEAVGGTVFAGGLLGYVLYDMTHYYLHFGS PHKGSYLYSLKAHHVKHHFAHQKSGGSHPLGGQVALGDPLLPGASLPRAQPTGLLQAVAT GSSRKGK >gi568815582r:74614193_74816623|GENSCAN_predicted_CDS_4|1464_bp atggcccccgctccgccccccgccgcctccttctcgccctccgaggtccagcggcgcctg gcggccggcgcgtgctgggtccgccgcggggcccgcctctacgacctctccagcttcgtg cggcaccacccggggggcgagcagctgctgcgggccagggcgggccaggacatcagcgcc gacctggacgggccgccgcacaggcactcggccaacgcgcgccgctggctggagcagtac tacgtgggagagctccgcggggagcagcagacaggtgataagcatcccatgcgctctgaa acccaccacatcacagaaacagcccttgctggcgttaccaggaccttcgccttcctccac ccggtgggctccatggagaacgagcctgtagcccttgaggaaactcagaagacagatcct gctatggaaccacggttcaaagtggtggattgggacaagaacacagccagtggctgcctg actccccgtgctgaagattggctgagcatccaccgagacaccctcatcctcactccgatt gtgtgtggcagattggggcccaccagcatcacagatgagggcactgaggggctccgggag gaaaagatgcacgtccaggatctcacagctggtcaaaggcacagccaggctctaatggac ctggtggactggcgaaagcctctcctgtggcaggtgggccacttgggagagaagtacgat gagtgggttcaccagccggtgaccaggcccatccgcctcttccactcagacctcattgag ggcctctctaagactgtctggtacagtgtccccatcatctgggtgcccctggtgctgtat ctcagctggtcctactaccgaacctttgcccagggcaacgtccgactcttcacgtcattt acaacagagtacacggtggcagtgcccaagtccatgttccccgggctcttcatgctgggg acattcctctggagcctcatcgagtacctcatccaccgcttcctgttccacatgaagccc cccagcgacagctattacctcatcatgctgcacttcgtcatgcacggccagcaccacaag gcacccttcgacggctcccgcctggtcttcccccctgtgccagcctccctggtgatcggc gtcttctacttgtgcatgcagctcatcctgcccgaggcagtagggggcactgtgtttgcg gggggcctcctgggctacgtcctctatgacatgacccattactacctgcactttggctcg ccgcacaagggctcctacctgtacagcctgaaggcccaccacgtcaagcaccactttgca catcagaagtcaggagggtcacatccacttggtggccaggtggcccttggtgacccactt cttcctggagcgtccctgcctagagctcagcccacaggactgcttcaggccgtggccaca ggtagcagccgcaaggggaaatga >gi568815582r:74614193_74816623|GENSCAN_predicted_peptide_5|119_aa MECSAVSKEEALEGVALLCPLVILLSAVLSREEALKRVAPLCRQVVQMSLQASEALSKED SSSLQVVIPFLSLSSGLCPAVAEPRAFMDLSGEEMHANWWMRSHGQAQRRRFKSLLWSV >gi568815582r:74614193_74816623|GENSCAN_predicted_CDS_5|360_bp atggagtgttcagctgtcagcaaagaggaggccctggagggggtagctcttctctgccca ctggtcatcctgctgtctgctgttctcagcagagaggaggctctgaagagggtggctcct ctttgtaggcaggttgttcagatgtctctgcaggcctctgaagctctcagcaaagaggat agctcctctctgcaggtagttatccccttcctctctctgtcctctggtctttgccctgct gtggccgagcccagggcttttatggacctgagcggggaggaaatgcatgccaattggtgg atgcgcagccatgggcaggcccagaggaggcgcttcaagtccctactctggtctgtgtga