GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:06:04 Sequence gi568815582r:74523931_74761449 : 237519 bp : 45.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 28282 28488 207 0 0 39 48 272 0.460 15.64 1.02 PlyA + 28842 28847 6 1.05 2.03 PlyA - 30262 30257 6 1.05 2.02 Term - 59904 59785 120 0 0 60 44 71 0.057 -1.63 2.01 Init - 83164 82727 438 1 0 95 78 451 0.802 38.92 2.00 Prom - 97719 97680 40 -4.16 3.12 PlyA - 99000 98995 6 1.05 3.11 Term - 100141 99998 144 1 0 129 42 152 0.986 12.61 3.10 Intr - 102624 102413 212 1 2 93 93 138 0.979 13.43 3.09 Intr - 104736 104522 215 2 2 72 119 3 0.571 0.16 3.08 Intr - 107027 106851 177 1 0 62 91 100 0.980 6.73 3.07 Intr - 108743 108593 151 0 1 104 109 95 0.998 12.42 3.06 Intr - 112647 112416 232 0 1 67 103 53 0.532 2.05 3.05 Intr - 114040 113926 115 0 1 98 98 39 0.877 6.35 3.04 Intr - 120523 120432 92 1 2 65 100 113 0.999 8.99 3.03 Intr - 120805 120611 195 1 0 72 110 130 0.854 13.21 3.02 Intr - 128192 127973 220 1 1 86 36 95 0.236 2.40 3.01 Init - 137519 137002 518 2 2 68 99 192 0.427 12.66 3.00 Prom - 138352 138313 40 -3.06 4.00 Prom + 139634 139673 40 -7.66 4.01 Init + 141679 141733 55 0 1 59 59 70 0.307 0.65 4.02 Term + 143574 143839 266 1 2 17 33 245 0.969 7.57 4.03 PlyA + 144889 144894 6 1.05 5.12 PlyA - 144944 144939 6 1.05 5.11 Term - 149026 148995 32 2 2 79 48 38 0.397 -3.08 5.10 Intr - 151170 151030 141 1 0 62 54 253 0.574 19.62 5.09 Intr - 152740 152605 136 1 1 92 58 -16 0.278 -4.06 5.08 Intr - 155050 154969 82 0 1 61 113 44 0.142 3.84 5.07 Intr - 158856 158721 136 1 1 69 111 52 0.368 5.23 5.06 Intr - 161653 161556 98 0 2 92 110 87 0.948 10.95 5.05 Intr - 167538 167347 192 2 0 -22 85 145 0.308 1.81 5.04 Intr - 171791 171352 440 2 2 27 36 439 0.097 24.91 5.03 Intr - 172202 172122 81 2 0 75 42 108 0.203 4.73 5.02 Intr - 176752 176523 230 2 2 53 105 73 0.230 3.09 5.01 Init - 184609 184549 61 1 1 76 69 75 0.486 5.81 5.00 Prom - 187170 187131 40 -6.76 6.10 PlyA - 187236 187231 6 1.05 6.09 Term - 190075 189945 131 2 2 58 49 128 0.932 4.24 6.08 Intr - 192669 192417 253 0 1 104 94 430 0.743 42.21 6.07 Intr - 195230 195058 173 0 2 96 105 316 0.991 33.76 6.06 Intr - 202401 202295 107 2 2 94 95 146 0.861 15.76 6.05 Intr - 203456 203314 143 2 2 97 93 175 0.977 18.05 6.04 Intr - 205967 205781 187 1 1 81 -10 147 0.083 3.89 6.03 Intr - 213367 213357 11 1 2 113 94 6 0.036 -3.24 6.02 Intr - 216185 216093 93 2 0 120 77 83 0.087 10.56 6.01 Init - 228174 228100 75 0 0 82 6 122 0.002 4.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 37163 37264 102 1 0 81 76 96 0.945 8.11 S.002 Init - 204744 204679 66 0 0 66 37 95 0.829 3.27 S.003 Intr + 225452 225571 120 0 0 103 89 92 0.976 11.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:74523931_74761449|GENSCAN_predicted_peptide_1|68_aa VSPAPDLGPAATKMLMPKKNGIAIYELLFKEGVMVAKKDVHLPKYPDLAEDKNVPNLHIM KAMQSLKS >gi568815582r:74523931_74761449|GENSCAN_predicted_CDS_1|207_bp gtctctccagccccggacctgggccctgcagccaccaagatgctgatgcctaagaagaac gggattgccatctatgaactcctttttaaggagggagtcatggtggccaagaaggatgtc caccttcctaagtacccagacctggcagaagacaagaatgtgcccaaccttcacatcatg aaggccatgcagtctctcaagtcttaa >gi568815582r:74523931_74761449|GENSCAN_predicted_peptide_2|185_aa MAACGRVRRMFRLSAALHLLLLFAAGAEKLPGQGVHSQGQGPGANFVSFVGQAGGGGPAG QQLPQLPQSSQLQQQQQQQQQQQQPQPPQPPFPAGGPPARRGGAGAGGGWKLAEEESCRE DVTRVCPKHTWSNNLAVLECLQDVRETWSSGGRSHGESFEREMLGHWLIGVWICLPEKMM IELKE >gi568815582r:74523931_74761449|GENSCAN_predicted_CDS_2|558_bp atggcggcgtgtggacgtgtacggaggatgttccgcttgtcggcggcgctgcatctgctg ctgctattcgcggccggggccgagaaactccccggccagggcgtccacagccagggccag ggtcccggggccaactttgtgtccttcgtagggcaggccggaggcggcggcccggcgggt cagcagctgccccagctgcctcagtcatcgcagcttcagcagcaacagcagcagcagcaa cagcaacagcagcctcagccgccgcagccgcctttcccggcgggtgggcctccggcccgg cggggaggagcgggggctggtgggggctggaagctggcggaggaagagtcctgcagggag gacgtgacccgcgtgtgccctaagcacacctggagcaacaacctggcggtgctcgagtgc ctgcaggatgtgagggagacatggtcgagtggaggacgtagccatggtgagagctttgaa agagagatgcttggacattggctaattggagtctggatatgtctccctgagaagatgatg attgagctgaaggaatga >gi568815582r:74523931_74761449|GENSCAN_predicted_peptide_3|756_aa MAHEAMEYDVQVQLNHAEQQPAPAGMASSQGGPALLQPVPADVVSSQGVPSILQPAPAEV ISSQATPPLLQPAPQLSVDLTEVEVLGEDTVENINPRTSEQHRQGSDGNHTIPASSLHSM TNFISGLQRLHGMLEFLRPSSSNHSVGPMRTRRRVSASRRARAGGSQRTDSARLRAPLDA YFQVSRTQPDLPATTYDSETRNPVSEELQVSSSSDSDSDSSAEYGGVVDQAEESGAVILE GQYFTQPSPQKSEPLLPSASMDEEEGDTCTICLEQWTNAGDHRLSALRCGHLFGYRCIST WLKGQVRKCPQCNKKARHSDIVVLYARTLRALDTSEQERMKSSLLKEQMLRKQAELESAQ CRLQLQVLTDKCTRLQRRVQDLQKLTSHQSQNLQQPRGSQAWVLSCSPSSQGQHKHKYHF QKTFTVSQAGNCRIMAYCDALSCLVISQPSPQASFLPGFGVKMLSTANMKSSQYIPMHGK QIRGLAFSSYLRGLLLSASLDNTIKLTSLETNTVVQTYNAGRPVWSCCWCLDEANYIYAG LANGSILVYDVRNTSSHVQELVAQKARCPLVSLSYMPRAASAAFPYGGVLAGTLEDASFW EQKMDFSHWPHVLPLEPGGCIDFQTENSSRHCLVTYRPDKNHTTIRSVLMEMSYRLDDTG NPICSCQPVHTFFGGPTCKLLTKNAIFQSPENDGNILVCTGDEAANSALLWDAASGSLLQ DLQTDQPVLDICPFEVNRNSYLATLTEKMVHIYKWE >gi568815582r:74523931_74761449|GENSCAN_predicted_CDS_3|2271_bp atggctcatgaagcaatggaatatgatgttcaggtgcagttaaatcatgccgaacaacag ccagctcctgctggcatggccagcagccaagggggaccagccctcctccagcctgttcct gctgatgtggtcagcagccagggggtaccatccatcctccagccagctcctgctgaggtg atcagcagccaagcgacaccacccctgctccagcctgctccgcaactgtctgttgacctg acagaagtggaggtcttgggagaagacactgtggagaacatcaatccaagaacttcagaa caacataggcagggatctgatggtaatcacaccatcccagcatcttcgttgcattcaatg accaacttcatcagcggactgcagagacttcatggcatgctggaattcctgagaccttca tcttcaaaccacagtgtagggccaatgagaacaagaaggagggtatctgcttcacggagg gcaagagccggagggtctcagaggacagacagtgccaggttgagagcaccattggatgct tactttcaggtgagcaggacccagcctgacttgccagctaccacttatgattcagagact aggaatcctgtatctgaagagttgcaggtgtctagtagttctgattctgacagtgacagc tctgcagagtatggaggggttgttgaccaggcagaggaatctggagctgtcattttagaa ggtcagtattttacccagccatctccccagaagtctgagcctctgctaccttctgcttct atggatgaggaagaaggggacacttgtacaatatgtctggaacagtggaccaatgctggg gaccaccggctctcagcattacgctgtgggcatctctttgggtataggtgcatttccacg tggcttaaaggacaagtacgaaaatgtccccagtgcaacaagaaagccaggcacagtgac attgtcgtcctttatgcccgaaccctgagagctttggacactagtgaacaggagcgcatg aaaagttccctactgaaggaacagatgctaaggaaacaggccgagttagaatcagcacag tgccgactccaactgcaggtcctcactgataagtgcactaggcttcaaaggcgtgttcag gacttgcaaaaacttacgtcacatcaaagtcagaatttacagcaacccaggggctcccaa gcatgggtcctgagctgctcaccctccagccagggccagcacaagcacaagtaccacttc caaaagaccttcacagtatctcaggcaggaaactgccggatcatggcatactgtgatgct ctgagctgcctggtgatatcacagccttctcctcaggcctcttttcttccaggctttggt gttaagatgttgagtactgccaacatgaagagcagtcagtacattccgatgcatggcaaa cagatccgtggactggcgtttagcagttacctcagaggcttgctactctctgcttcccta gacaacactattaaactgaccagcctggagacaaataccgtggtccagacttataatgct ggacgtcctgtctggagctgttgctggtgtcttgatgaggctaactacatctatgctgga ctggccaatggttcaattctggtatatgacgtgcgaaacacgagcagtcatgtgcaggag ttagtagctcagaaagccagatgcccactggtctccctgtcatacatgcccagagctgcc tcagctgcatttccatatggtggggtgctggctggaaccttggaggatgcttcattctgg gaacagaaaatggacttttctcattggcctcatgtgctgcccttggagccagggggctgc atagactttcagacagagaacagctcccggcactgtcttgtgacctacaggcctgataaa aatcacaccaccatacgaagtgtgctgatggaaatgtcctaccgactggatgacactgga aatccaatctgctcctgccagcctgtacatacattttttggaggacctacttgcaaacta ttgaccaaaaatgccattttccaaagcccagagaatgatggcaacatcctggtgtgtact ggggatgaagcagcaaattctgccctgctgtgggatgctgccagtggctcgttgctccag gacctacagaccgatcagcctgtgttggacatctgcccatttgaggtgaaccgtaacagc tacttggctaccttaacagagaagatggtccacatctataagtgggagtga >gi568815582r:74523931_74761449|GENSCAN_predicted_peptide_4|106_aa MSRAWWRAPVVPAIAGSRGPPLAASAKRKGPPEFSSYVRGARAHSRLGSEPAAGRKATKK TDKPRQDDKDDLDVTELTNEDPLDQLVKYGVNCGPIVGTTRKLYEK >gi568815582r:74523931_74761449|GENSCAN_predicted_CDS_4|321_bp atgagccgggcgtggtggcgcgcacctgtggtcccagctattgcgggaagccgagggccg ccgctcgccgccagcgccaaaagaaaggggcccccggaattctccagctacgtacgagga gcgcgagcccacagccgtctcggctccgagcccgccgccggcaggaaagccacaaagaaa actgataaacccagacaagatgataaagacgatctagatgtaacagaactcactaatgaa gatcctttggatcagcttgtgaaatacggagtgaattgtggtcctattgtgggaacaacc aggaagctgtatgagaaatag >gi568815582r:74523931_74761449|GENSCAN_predicted_peptide_5|542_aa MQAKQMLALAKVLGQKQEHRGRDRGALSPPATCGVRYSQQSAGCGHQERGRRGTPAGLAF ANFSPGQEAVVGKKVEERAFWNCARDRLDAHPSGGAKGRFTPRIQPTKGECEEAAQLPQS EVEQVIHKRCEEMKYCKKQCRRLGHRVLGLIKPLEMLQDQGKRSVPSEKLTTAMNRFKAA LEEANGEIEKFSNRSNICRFLTASQDKILFKDVNRKLSDVWKELSLLLQVEQRMPVSPIS QGASWAQEDQQDADEDRRAFQMLRRGKLGLWSDLPPKCMQEIPQEQIKEIKKEQLSGSPW ILLRENEVSTLYKGEYHRAPVAIKVFKKLQAGSIAIVRQTFNKEIKTMKKFESPNILRIF GICIDETVTPPQFSIVMEYCELGTLRELLDREKDLTLGKRMVLVLGAARGLYRLHHSEAP ELHGKIRSSNFLVTQGYQVKMPFPLCDGDISTPKTLFLLLSRSSCSLTIRCAIPPFIPGI PYGARCCNSEKIRKLVAVKRQQEPLGEDCPSELREIIDECRAHDPSVRPSVDEQKRRLND VF >gi568815582r:74523931_74761449|GENSCAN_predicted_CDS_5|1629_bp atgcaagccaagcagatgctggctttggcaaaagtccttggacaaaagcaggaacataga ggtagggatcggggcgccttgtcgccgccagccacgtgtggcgtccggtacagtcagcag agtgcagggtgcgggcaccaggaaagggggcgcaggggaactcccgcgggcctcgcgttt gcaaacttctcgcctgggcaggaggcggtcgtgggaaagaaggtggaagagcgagctttt tggaactgtgcacgggacagattggacgcacacccctcgggaggcgcgaagggccgcttc accccacgcatccagccaaccaagggagagtgtgaggaggcggcacagctgccccagtcc gaagtagagcaggtcatccacaaacggtgtgaagagatgaaatactgcaagaaacagtgc cggcgcctgggccaccgcgtcctcggcctgatcaagcctctggagatgctccaggaccaa ggaaagaggagcgtgccctctgagaagttaaccacagccatgaaccgcttcaaggctgcc ctggaggaggctaatggggagatagaaaagttcagcaatagatccaatatctgcaggttt ctaacagcaagccaggacaaaatactcttcaaggacgtgaacaggaagctgagtgatgtc tggaaggagctctcgctgttacttcaggttgagcaacgcatgcctgtttcacccataagc caaggagcgtcctgggcacaggaagatcagcaggatgcagacgaagacaggcgagctttc cagatgctaagaagaggcaagctgggtctttggtcagatttaccaccaaaatgcatgcag gagatcccgcaagagcaaatcaaggagatcaagaaggagcagctttcaggatccccgtgg attctgctaagggaaaatgaagtcagcacactttataaaggagaataccacagagctcca gtggccataaaagtattcaaaaaactccaggctggcagcattgcaatagtgaggcagact ttcaataaggagatcaaaaccatgaagaaattcgaatctcccaacatcctgcgtatattt gggatttgcattgatgaaacagtgactccgcctcaattctccattgtcatggagtactgt gaactcgggaccctgagggagctgttggatagggaaaaagacctcacacttggcaagcgc atggtcctagtcctgggggcagcccgaggcctataccggctacaccattcagaagcacct gaactccacggaaaaatcagaagctcaaacttcctggtaactcaaggctaccaagtgaag atgcctttccccctctgcgatggtgatataagtactcccaaaacactgtttctactactc tcacgctcttcatgcagcctgactataagatgcgctattccacctttcatccctggtatc ccatacggcgctagatgctgtaattctgagaagatccgcaagctggtggctgtgaagcgg cagcaggagccactgggtgaagactgcccttcagagctgcgggagatcattgatgagtgc cgggcccatgatccctctgtgcggccctctgtggatgagcagaagcgcagacttaatgat gtgttctga >gi568815582r:74523931_74761449|GENSCAN_predicted_peptide_6|390_aa MEPLELADALDVSSEGKEEVEDDTKGSMENEPVALEETQKTDPAMEPRFKVVDWDKNTAS GCLTPRAEDWLSIHRDTLILTPIVCGRLGPTSITDEGTEGLREEKMHVQDLTAGQRHSQA LMDLVDWRKPLLWQVGHLGEKYDEWVHQPVTRPIRLFHSDLIEGLSKTVWYSVPIIWVPL VLYLSWSYYRTFAQGNVRLFTSFTTEYTVAVPKSMFPGLFMLGTFLWSLIEYLIHRFLFH MKPPSDSYYLIMLHFVMHGQHHKAPFDGSRLVFPPVPASLVIGVFYLCMQLILPEAVGGT VFAGGLLGYVLYDMTHYYLHFGSPHKGSYLYSLKAHHVKHHFAHQKSGGSHPLGGQVALG DPLLPGASLPRAQPTGLLQAVATGSSRKGK >gi568815582r:74523931_74761449|GENSCAN_predicted_CDS_6|1173_bp atggaaccattagaacttgctgatgcattggatgtcagcagcgagggaaaggaagaagtt gaggatgacaccaagggctccatggagaacgagcctgtagcccttgaggaaactcagaag acagatcctgctatggaaccacggttcaaagtggtggattgggacaagaacacagccagt ggctgcctgactccccgtgctgaagattggctgagcatccaccgagacaccctcatcctc actccgattgtgtgtggcagattggggcccaccagcatcacagatgagggcactgagggg ctccgggaggaaaagatgcacgtccaggatctcacagctggtcaaaggcacagccaggct ctaatggacctggtggactggcgaaagcctctcctgtggcaggtgggccacttgggagag aagtacgatgagtgggttcaccagccggtgaccaggcccatccgcctcttccactcagac ctcattgagggcctctctaagactgtctggtacagtgtccccatcatctgggtgcccctg gtgctgtatctcagctggtcctactaccgaacctttgcccagggcaacgtccgactcttc acgtcatttacaacagagtacacggtggcagtgcccaagtccatgttccccgggctcttc atgctggggacattcctctggagcctcatcgagtacctcatccaccgcttcctgttccac atgaagccccccagcgacagctattacctcatcatgctgcacttcgtcatgcacggccag caccacaaggcacccttcgacggctcccgcctggtcttcccccctgtgccagcctccctg gtgatcggcgtcttctacttgtgcatgcagctcatcctgcccgaggcagtagggggcact gtgtttgcggggggcctcctgggctacgtcctctatgacatgacccattactacctgcac tttggctcgccgcacaagggctcctacctgtacagcctgaaggcccaccacgtcaagcac cactttgcacatcagaagtcaggagggtcacatccacttggtggccaggtggcccttggt gacccacttcttcctggagcgtccctgcctagagctcagcccacaggactgcttcaggcc gtggccacaggtagcagccgcaaggggaaatga