GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:46:08 Sequence gi568815582r:74574960_74795757 : 220798 bp : 46.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 783 778 6 1.05 1.02 Term - 8875 8756 120 1 0 60 44 71 0.051 -1.63 1.01 Init - 32135 31698 438 2 0 95 78 451 0.803 38.92 1.00 Prom - 46690 46651 40 -4.16 2.12 PlyA - 47971 47966 6 1.05 2.11 Term - 49112 48969 144 2 0 129 42 152 0.986 12.61 2.10 Intr - 51595 51384 212 2 2 93 93 138 0.979 13.43 2.09 Intr - 53707 53493 215 0 2 72 119 3 0.571 0.16 2.08 Intr - 55998 55822 177 2 0 62 91 100 0.980 6.73 2.07 Intr - 57714 57564 151 1 1 104 109 95 0.998 12.42 2.06 Intr - 61618 61387 232 1 1 67 103 53 0.532 2.05 2.05 Intr - 63011 62897 115 1 1 98 98 39 0.877 6.35 2.04 Intr - 69494 69403 92 2 2 65 100 113 0.999 8.99 2.03 Intr - 69776 69582 195 2 0 72 110 130 0.854 13.21 2.02 Intr - 77163 76944 220 2 1 86 36 95 0.236 2.40 2.01 Init - 86490 85973 518 0 2 68 99 192 0.427 12.66 2.00 Prom - 87323 87284 40 -3.06 3.00 Prom + 88605 88644 40 -7.66 3.01 Init + 90650 90704 55 1 1 59 59 70 0.307 0.65 3.02 Term + 92545 92810 266 2 2 17 33 245 0.969 7.57 3.03 PlyA + 93860 93865 6 1.05 4.12 PlyA - 93915 93910 6 1.05 4.11 Term - 97997 97966 32 0 2 79 48 38 0.397 -3.08 4.10 Intr - 100141 100001 141 2 0 62 54 253 0.574 19.62 4.09 Intr - 101711 101576 136 2 1 92 58 -16 0.278 -4.06 4.08 Intr - 104021 103940 82 1 1 61 113 44 0.142 3.84 4.07 Intr - 107827 107692 136 2 1 69 111 52 0.368 5.23 4.06 Intr - 110624 110527 98 1 2 92 110 87 0.948 10.95 4.05 Intr - 116509 116318 192 0 0 -22 85 145 0.308 1.81 4.04 Intr - 120762 120323 440 0 2 27 36 439 0.097 24.91 4.03 Intr - 121173 121093 81 0 0 75 42 108 0.203 4.73 4.02 Intr - 125723 125494 230 0 2 53 105 73 0.230 3.09 4.01 Init - 133580 133520 61 2 1 76 69 75 0.486 5.81 4.00 Prom - 136141 136102 40 -6.76 5.11 PlyA - 136207 136202 6 1.05 5.10 Term - 139046 138916 131 0 2 58 49 128 0.932 4.24 5.09 Intr - 141640 141388 253 1 1 104 94 430 0.743 42.21 5.08 Intr - 144201 144029 173 1 2 96 105 316 0.991 33.76 5.07 Intr - 151372 151266 107 0 2 94 95 146 0.861 15.76 5.06 Intr - 152427 152285 143 0 2 97 93 175 0.977 18.05 5.05 Intr - 154938 154752 187 2 1 81 -10 147 0.083 3.89 5.04 Intr - 162338 162328 11 2 2 113 94 6 0.036 -3.24 5.03 Intr - 165156 165064 93 0 0 120 77 83 0.087 10.56 5.02 Intr - 193094 192999 96 2 0 44 49 91 0.254 1.01 5.01 Init - 199796 199527 270 2 0 103 92 490 0.993 45.87 5.00 Prom - 204177 204138 40 -3.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 153715 153650 66 1 0 66 37 95 0.829 3.27 S.002 Intr + 174423 174542 120 1 0 103 89 92 0.979 11.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:74574960_74795757|GENSCAN_predicted_peptide_1|185_aa MAACGRVRRMFRLSAALHLLLLFAAGAEKLPGQGVHSQGQGPGANFVSFVGQAGGGGPAG QQLPQLPQSSQLQQQQQQQQQQQQPQPPQPPFPAGGPPARRGGAGAGGGWKLAEEESCRE DVTRVCPKHTWSNNLAVLECLQDVRETWSSGGRSHGESFEREMLGHWLIGVWICLPEKMM IELKE >gi568815582r:74574960_74795757|GENSCAN_predicted_CDS_1|558_bp atggcggcgtgtggacgtgtacggaggatgttccgcttgtcggcggcgctgcatctgctg ctgctattcgcggccggggccgagaaactccccggccagggcgtccacagccagggccag ggtcccggggccaactttgtgtccttcgtagggcaggccggaggcggcggcccggcgggt cagcagctgccccagctgcctcagtcatcgcagcttcagcagcaacagcagcagcagcaa cagcaacagcagcctcagccgccgcagccgcctttcccggcgggtgggcctccggcccgg cggggaggagcgggggctggtgggggctggaagctggcggaggaagagtcctgcagggag gacgtgacccgcgtgtgccctaagcacacctggagcaacaacctggcggtgctcgagtgc ctgcaggatgtgagggagacatggtcgagtggaggacgtagccatggtgagagctttgaa agagagatgcttggacattggctaattggagtctggatatgtctccctgagaagatgatg attgagctgaaggaatga >gi568815582r:74574960_74795757|GENSCAN_predicted_peptide_2|756_aa MAHEAMEYDVQVQLNHAEQQPAPAGMASSQGGPALLQPVPADVVSSQGVPSILQPAPAEV ISSQATPPLLQPAPQLSVDLTEVEVLGEDTVENINPRTSEQHRQGSDGNHTIPASSLHSM TNFISGLQRLHGMLEFLRPSSSNHSVGPMRTRRRVSASRRARAGGSQRTDSARLRAPLDA YFQVSRTQPDLPATTYDSETRNPVSEELQVSSSSDSDSDSSAEYGGVVDQAEESGAVILE GQYFTQPSPQKSEPLLPSASMDEEEGDTCTICLEQWTNAGDHRLSALRCGHLFGYRCIST WLKGQVRKCPQCNKKARHSDIVVLYARTLRALDTSEQERMKSSLLKEQMLRKQAELESAQ CRLQLQVLTDKCTRLQRRVQDLQKLTSHQSQNLQQPRGSQAWVLSCSPSSQGQHKHKYHF QKTFTVSQAGNCRIMAYCDALSCLVISQPSPQASFLPGFGVKMLSTANMKSSQYIPMHGK QIRGLAFSSYLRGLLLSASLDNTIKLTSLETNTVVQTYNAGRPVWSCCWCLDEANYIYAG LANGSILVYDVRNTSSHVQELVAQKARCPLVSLSYMPRAASAAFPYGGVLAGTLEDASFW EQKMDFSHWPHVLPLEPGGCIDFQTENSSRHCLVTYRPDKNHTTIRSVLMEMSYRLDDTG NPICSCQPVHTFFGGPTCKLLTKNAIFQSPENDGNILVCTGDEAANSALLWDAASGSLLQ DLQTDQPVLDICPFEVNRNSYLATLTEKMVHIYKWE >gi568815582r:74574960_74795757|GENSCAN_predicted_CDS_2|2271_bp atggctcatgaagcaatggaatatgatgttcaggtgcagttaaatcatgccgaacaacag ccagctcctgctggcatggccagcagccaagggggaccagccctcctccagcctgttcct gctgatgtggtcagcagccagggggtaccatccatcctccagccagctcctgctgaggtg atcagcagccaagcgacaccacccctgctccagcctgctccgcaactgtctgttgacctg acagaagtggaggtcttgggagaagacactgtggagaacatcaatccaagaacttcagaa caacataggcagggatctgatggtaatcacaccatcccagcatcttcgttgcattcaatg accaacttcatcagcggactgcagagacttcatggcatgctggaattcctgagaccttca tcttcaaaccacagtgtagggccaatgagaacaagaaggagggtatctgcttcacggagg gcaagagccggagggtctcagaggacagacagtgccaggttgagagcaccattggatgct tactttcaggtgagcaggacccagcctgacttgccagctaccacttatgattcagagact aggaatcctgtatctgaagagttgcaggtgtctagtagttctgattctgacagtgacagc tctgcagagtatggaggggttgttgaccaggcagaggaatctggagctgtcattttagaa ggtcagtattttacccagccatctccccagaagtctgagcctctgctaccttctgcttct atggatgaggaagaaggggacacttgtacaatatgtctggaacagtggaccaatgctggg gaccaccggctctcagcattacgctgtgggcatctctttgggtataggtgcatttccacg tggcttaaaggacaagtacgaaaatgtccccagtgcaacaagaaagccaggcacagtgac attgtcgtcctttatgcccgaaccctgagagctttggacactagtgaacaggagcgcatg aaaagttccctactgaaggaacagatgctaaggaaacaggccgagttagaatcagcacag tgccgactccaactgcaggtcctcactgataagtgcactaggcttcaaaggcgtgttcag gacttgcaaaaacttacgtcacatcaaagtcagaatttacagcaacccaggggctcccaa gcatgggtcctgagctgctcaccctccagccagggccagcacaagcacaagtaccacttc caaaagaccttcacagtatctcaggcaggaaactgccggatcatggcatactgtgatgct ctgagctgcctggtgatatcacagccttctcctcaggcctcttttcttccaggctttggt gttaagatgttgagtactgccaacatgaagagcagtcagtacattccgatgcatggcaaa cagatccgtggactggcgtttagcagttacctcagaggcttgctactctctgcttcccta gacaacactattaaactgaccagcctggagacaaataccgtggtccagacttataatgct ggacgtcctgtctggagctgttgctggtgtcttgatgaggctaactacatctatgctgga ctggccaatggttcaattctggtatatgacgtgcgaaacacgagcagtcatgtgcaggag ttagtagctcagaaagccagatgcccactggtctccctgtcatacatgcccagagctgcc tcagctgcatttccatatggtggggtgctggctggaaccttggaggatgcttcattctgg gaacagaaaatggacttttctcattggcctcatgtgctgcccttggagccagggggctgc atagactttcagacagagaacagctcccggcactgtcttgtgacctacaggcctgataaa aatcacaccaccatacgaagtgtgctgatggaaatgtcctaccgactggatgacactgga aatccaatctgctcctgccagcctgtacatacattttttggaggacctacttgcaaacta ttgaccaaaaatgccattttccaaagcccagagaatgatggcaacatcctggtgtgtact ggggatgaagcagcaaattctgccctgctgtgggatgctgccagtggctcgttgctccag gacctacagaccgatcagcctgtgttggacatctgcccatttgaggtgaaccgtaacagc tacttggctaccttaacagagaagatggtccacatctataagtgggagtga >gi568815582r:74574960_74795757|GENSCAN_predicted_peptide_3|106_aa MSRAWWRAPVVPAIAGSRGPPLAASAKRKGPPEFSSYVRGARAHSRLGSEPAAGRKATKK TDKPRQDDKDDLDVTELTNEDPLDQLVKYGVNCGPIVGTTRKLYEK >gi568815582r:74574960_74795757|GENSCAN_predicted_CDS_3|321_bp atgagccgggcgtggtggcgcgcacctgtggtcccagctattgcgggaagccgagggccg ccgctcgccgccagcgccaaaagaaaggggcccccggaattctccagctacgtacgagga gcgcgagcccacagccgtctcggctccgagcccgccgccggcaggaaagccacaaagaaa actgataaacccagacaagatgataaagacgatctagatgtaacagaactcactaatgaa gatcctttggatcagcttgtgaaatacggagtgaattgtggtcctattgtgggaacaacc aggaagctgtatgagaaatag >gi568815582r:74574960_74795757|GENSCAN_predicted_peptide_4|542_aa MQAKQMLALAKVLGQKQEHRGRDRGALSPPATCGVRYSQQSAGCGHQERGRRGTPAGLAF ANFSPGQEAVVGKKVEERAFWNCARDRLDAHPSGGAKGRFTPRIQPTKGECEEAAQLPQS EVEQVIHKRCEEMKYCKKQCRRLGHRVLGLIKPLEMLQDQGKRSVPSEKLTTAMNRFKAA LEEANGEIEKFSNRSNICRFLTASQDKILFKDVNRKLSDVWKELSLLLQVEQRMPVSPIS QGASWAQEDQQDADEDRRAFQMLRRGKLGLWSDLPPKCMQEIPQEQIKEIKKEQLSGSPW ILLRENEVSTLYKGEYHRAPVAIKVFKKLQAGSIAIVRQTFNKEIKTMKKFESPNILRIF GICIDETVTPPQFSIVMEYCELGTLRELLDREKDLTLGKRMVLVLGAARGLYRLHHSEAP ELHGKIRSSNFLVTQGYQVKMPFPLCDGDISTPKTLFLLLSRSSCSLTIRCAIPPFIPGI PYGARCCNSEKIRKLVAVKRQQEPLGEDCPSELREIIDECRAHDPSVRPSVDEQKRRLND VF >gi568815582r:74574960_74795757|GENSCAN_predicted_CDS_4|1629_bp atgcaagccaagcagatgctggctttggcaaaagtccttggacaaaagcaggaacataga ggtagggatcggggcgccttgtcgccgccagccacgtgtggcgtccggtacagtcagcag agtgcagggtgcgggcaccaggaaagggggcgcaggggaactcccgcgggcctcgcgttt gcaaacttctcgcctgggcaggaggcggtcgtgggaaagaaggtggaagagcgagctttt tggaactgtgcacgggacagattggacgcacacccctcgggaggcgcgaagggccgcttc accccacgcatccagccaaccaagggagagtgtgaggaggcggcacagctgccccagtcc gaagtagagcaggtcatccacaaacggtgtgaagagatgaaatactgcaagaaacagtgc cggcgcctgggccaccgcgtcctcggcctgatcaagcctctggagatgctccaggaccaa ggaaagaggagcgtgccctctgagaagttaaccacagccatgaaccgcttcaaggctgcc ctggaggaggctaatggggagatagaaaagttcagcaatagatccaatatctgcaggttt ctaacagcaagccaggacaaaatactcttcaaggacgtgaacaggaagctgagtgatgtc tggaaggagctctcgctgttacttcaggttgagcaacgcatgcctgtttcacccataagc caaggagcgtcctgggcacaggaagatcagcaggatgcagacgaagacaggcgagctttc cagatgctaagaagaggcaagctgggtctttggtcagatttaccaccaaaatgcatgcag gagatcccgcaagagcaaatcaaggagatcaagaaggagcagctttcaggatccccgtgg attctgctaagggaaaatgaagtcagcacactttataaaggagaataccacagagctcca gtggccataaaagtattcaaaaaactccaggctggcagcattgcaatagtgaggcagact ttcaataaggagatcaaaaccatgaagaaattcgaatctcccaacatcctgcgtatattt gggatttgcattgatgaaacagtgactccgcctcaattctccattgtcatggagtactgt gaactcgggaccctgagggagctgttggatagggaaaaagacctcacacttggcaagcgc atggtcctagtcctgggggcagcccgaggcctataccggctacaccattcagaagcacct gaactccacggaaaaatcagaagctcaaacttcctggtaactcaaggctaccaagtgaag atgcctttccccctctgcgatggtgatataagtactcccaaaacactgtttctactactc tcacgctcttcatgcagcctgactataagatgcgctattccacctttcatccctggtatc ccatacggcgctagatgctgtaattctgagaagatccgcaagctggtggctgtgaagcgg cagcaggagccactgggtgaagactgcccttcagagctgcgggagatcattgatgagtgc cgggcccatgatccctctgtgcggccctctgtggatgagcagaagcgcagacttaatgat gtgttctga >gi568815582r:74574960_74795757|GENSCAN_predicted_peptide_5|487_aa MAPAPPPAASFSPSEVQRRLAAGACWVRRGARLYDLSSFVRHHPGGEQLLRARAGQDISA DLDGPPHRHSANARRWLEQYYVGELRGEQQTGDKHPMRSETHHITETALAGVTRTFAFLH PVGSMENEPVALEETQKTDPAMEPRFKVVDWDKNTASGCLTPRAEDWLSIHRDTLILTPI VCGRLGPTSITDEGTEGLREEKMHVQDLTAGQRHSQALMDLVDWRKPLLWQVGHLGEKYD EWVHQPVTRPIRLFHSDLIEGLSKTVWYSVPIIWVPLVLYLSWSYYRTFAQGNVRLFTSF TTEYTVAVPKSMFPGLFMLGTFLWSLIEYLIHRFLFHMKPPSDSYYLIMLHFVMHGQHHK APFDGSRLVFPPVPASLVIGVFYLCMQLILPEAVGGTVFAGGLLGYVLYDMTHYYLHFGS PHKGSYLYSLKAHHVKHHFAHQKSGGSHPLGGQVALGDPLLPGASLPRAQPTGLLQAVAT GSSRKGK >gi568815582r:74574960_74795757|GENSCAN_predicted_CDS_5|1464_bp atggcccccgctccgccccccgccgcctccttctcgccctccgaggtccagcggcgcctg gcggccggcgcgtgctgggtccgccgcggggcccgcctctacgacctctccagcttcgtg cggcaccacccggggggcgagcagctgctgcgggccagggcgggccaggacatcagcgcc gacctggacgggccgccgcacaggcactcggccaacgcgcgccgctggctggagcagtac tacgtgggagagctccgcggggagcagcagacaggtgataagcatcccatgcgctctgaa acccaccacatcacagaaacagcccttgctggcgttaccaggaccttcgccttcctccac ccggtgggctccatggagaacgagcctgtagcccttgaggaaactcagaagacagatcct gctatggaaccacggttcaaagtggtggattgggacaagaacacagccagtggctgcctg actccccgtgctgaagattggctgagcatccaccgagacaccctcatcctcactccgatt gtgtgtggcagattggggcccaccagcatcacagatgagggcactgaggggctccgggag gaaaagatgcacgtccaggatctcacagctggtcaaaggcacagccaggctctaatggac ctggtggactggcgaaagcctctcctgtggcaggtgggccacttgggagagaagtacgat gagtgggttcaccagccggtgaccaggcccatccgcctcttccactcagacctcattgag ggcctctctaagactgtctggtacagtgtccccatcatctgggtgcccctggtgctgtat ctcagctggtcctactaccgaacctttgcccagggcaacgtccgactcttcacgtcattt acaacagagtacacggtggcagtgcccaagtccatgttccccgggctcttcatgctgggg acattcctctggagcctcatcgagtacctcatccaccgcttcctgttccacatgaagccc cccagcgacagctattacctcatcatgctgcacttcgtcatgcacggccagcaccacaag gcacccttcgacggctcccgcctggtcttcccccctgtgccagcctccctggtgatcggc gtcttctacttgtgcatgcagctcatcctgcccgaggcagtagggggcactgtgtttgcg gggggcctcctgggctacgtcctctatgacatgacccattactacctgcactttggctcg ccgcacaagggctcctacctgtacagcctgaaggcccaccacgtcaagcaccactttgca catcagaagtcaggagggtcacatccacttggtggccaggtggcccttggtgacccactt cttcctggagcgtccctgcctagagctcagcccacaggactgcttcaggccgtggccaca ggtagcagccgcaaggggaaatga