GENSCAN 1.0 Date run: 6-Nov-116 Time: 03:58:03 Sequence gi568815597f:235280050_235548759 : 268710 bp : 43.02% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 436 538 103 0 1 60 80 67 0.619 3.63 1.02 Term + 16092 16288 197 1 2 -43 39 284 0.694 7.97 1.03 PlyA + 16336 16341 6 1.05 2.05 PlyA - 16425 16420 6 1.05 2.04 Term - 46919 46867 53 0 2 142 49 28 0.663 1.89 2.03 Intr - 47912 47720 193 2 1 90 97 149 0.887 15.07 2.02 Intr - 48749 48695 55 1 1 93 26 74 0.607 0.68 2.01 Init - 52223 52186 38 2 2 86 76 19 0.350 -0.02 2.00 Prom - 52394 52355 40 -5.66 3.00 Prom + 54016 54055 40 -4.16 3.01 Init + 61983 62694 712 2 1 52 60 271 0.009 15.77 3.02 Intr + 82371 82473 103 1 1 83 36 61 0.007 -0.37 3.03 Intr + 87330 87455 126 0 0 84 99 45 0.068 4.99 3.04 Intr + 99970 100100 131 1 2 83 115 115 0.425 14.04 3.05 Term + 100175 100188 14 0 2 77 43 17 0.599 -5.54 3.06 PlyA + 102217 102222 6 1.05 4.04 PlyA - 103014 103009 6 1.05 4.03 Term - 104593 103968 626 2 2 46 42 287 0.716 14.45 4.02 Intr - 105223 104930 294 2 0 39 20 166 0.053 1.88 4.01 Init - 106870 106570 301 1 1 88 -8 241 0.133 11.91 4.00 Prom - 107597 107558 40 -7.76 5.00 Prom + 116446 116485 40 -3.76 5.01 Init + 129155 129195 41 1 2 85 62 39 0.704 0.66 5.02 Intr + 134384 134569 186 2 0 93 115 160 0.978 18.00 5.03 Term + 139424 139583 160 2 1 48 32 97 0.235 -2.49 5.04 PlyA + 145182 145187 6 1.05 6.00 Prom + 150996 151035 40 -4.86 6.01 Init + 152936 153122 187 1 1 85 92 284 0.967 27.72 6.02 Intr + 154010 154162 153 0 0 68 66 56 0.509 1.44 6.03 Intr + 156495 156559 65 1 2 101 71 51 0.702 3.14 6.04 Intr + 157273 157425 153 0 0 75 83 181 0.775 16.57 6.05 Term + 161765 161818 54 1 0 88 48 39 0.406 -2.54 6.06 PlyA + 163795 163800 6 1.05 7.15 PlyA - 164186 164181 6 1.05 7.14 Term - 166035 165969 67 2 1 46 49 84 0.229 -2.29 7.13 Intr - 170291 170191 101 2 2 80 62 74 0.290 2.91 7.12 Intr - 173097 173041 57 0 0 24 111 70 0.774 1.98 7.11 Intr - 174260 174107 154 1 1 36 91 112 0.703 6.37 7.10 Intr - 175635 175510 126 2 0 104 106 44 0.939 7.59 7.09 Intr - 178737 178554 184 1 1 72 78 130 0.790 9.25 7.08 Intr - 185665 185587 79 1 1 97 110 -16 0.590 0.62 7.07 Intr - 190911 190801 111 0 0 48 98 53 0.650 2.88 7.06 Intr - 200100 200005 96 0 0 64 77 131 0.895 9.81 7.05 Intr - 204466 204273 194 2 2 47 69 251 0.991 18.31 7.04 Intr - 209219 209119 101 1 2 40 111 55 0.861 2.75 7.03 Intr - 214779 214632 148 1 1 112 90 57 0.897 7.59 7.02 Intr - 223812 223639 174 1 0 73 18 87 0.459 0.11 7.01 Init - 224203 224092 112 1 1 26 94 265 0.986 19.28 7.00 Prom - 238078 238039 40 -5.46 8.00 Prom + 244477 244516 40 -3.96 8.01 Init + 251373 251393 21 2 0 99 50 17 0.526 -2.27 8.02 Term + 253887 253955 69 2 0 119 39 111 0.921 7.24 8.03 PlyA + 254329 254334 6 1.05 9.03 PlyA - 254341 254336 6 1.05 9.02 Term - 258768 258578 191 1 2 41 52 137 0.269 3.01 9.01 Intr - 264401 264370 32 1 2 116 74 16 0.175 0.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 61983 62723 741 2 0 52 35 309 0.897 18.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:235280050_235548759|GENSCAN_predicted_peptide_1|99_aa MISHNSGPEQLTGFLPLRSSPRKRLLPEPKMAASEQKVNNILVFTVDVKANKYQIKQAKK KLCGTDMANINTLIRPDGKKAHVLPVPDYDDVANKIGII >gi568815597f:235280050_235548759|GENSCAN_predicted_CDS_1|300_bp atgatctcccacaactcaggtcctgagcagctcactgggttcctgcctctccgctcctcg ccacgcaaacggctcctgccagaacccaagatggcagcttcagaacagaaagtcaacaac attcttgtgttcactgtggatgtcaaggccaacaaataccagatcaaacaggctaagaaa aagctctgtggcactgacatggccaatatcaacaccctgatcaggcctgatggaaagaag gcccatgttctaccggttcctgactatgatgacgttgccaacaaaattgggatcatctaa >gi568815597f:235280050_235548759|GENSCAN_predicted_peptide_2|112_aa MECGMIDNGDSKGRRHCRSQFNDDITTFTPPGARAAAKQRYQSPPREEEEPEPLPQQPLD PPPFFPISPPGLLVLGGRRREGTLDVPGSDLASEEGAAEPGVLEDTLVPESS >gi568815597f:235280050_235548759|GENSCAN_predicted_CDS_2|339_bp atggagtgtggaatgatagacaatggagattccaaaggccgccgccactgccgctcacag ttcaacgatgatatcacgactttcacgccgccgggagcccgagcggccgccaaacaaagg taccagtcgccgccgcgggaggaggaggagccggagcctctgcctcagcagccgctggac ccgccgcccttcttccccatctctcccccgggcctgctggttttgggggggagaaggaga gaggggactctggacgtgccagggtcagatctcgcctccgaggaaggtgcagctgaacct ggtgttttagaggataccttggtcccagagtcatcatga >gi568815597f:235280050_235548759|GENSCAN_predicted_peptide_3|361_aa MLHNASLLIDDIEDNSKLRRGFPVAHSIYGIPSVINSANYVYFLGLEKVLTLDHPDAVKL FTRQLLELHQGQGLDIYWRDNYTCPTEEEYKAMVLQKTGGLFGLAVGLMQLFSDYKEDLK PLLNTLGLFFQIRDDYANLHSKEYSENKSFCEDLTEGKFSFPTIHAIWSRPESTQVQNIL RQRTENIDIKKYCVHYLEDVGSFEYTRNTLKELEAKAYKQIDARGGNPELVALVKHLRLP LSTRQIKQELAEEYETTKSPVPPAYSLQLLLSSRTSLRWRRDFLQHFRPEPQASLLGSWL EGLLLGTPGVSAGRSHILDSGYIIMSDTLTADVIGRRVEVNGEHATVRFAGVVPPVAAVS V >gi568815597f:235280050_235548759|GENSCAN_predicted_CDS_3|1086_bp atgttgcataatgccagtttactcatcgatgatattgaagacaactcaaaactccgacgt ggctttccagtggcccacagcatctatggaatcccatctgtcatcaattctgccaattac gtgtatttccttggcttggagaaagtcttaacccttgatcacccagatgcagtgaagctt tttacccgccagcttttggaactccatcagggacaaggcctagatatttactggagggat aattacacttgtcccactgaagaagaatataaagctatggtgctgcagaaaacaggtgga ctgtttggattagcagtaggtctcatgcagttgttctctgattacaaagaagatttaaaa ccgctacttaatacacttgggctctttttccaaattagggatgattatgctaatctacac tccaaagaatatagtgaaaacaaaagtttttgtgaagatctgacagagggaaagttctca tttcctactattcatgctatttggtcaaggcctgaaagcacccaggtgcagaatatcttg cgccagagaacagaaaacatagatataaaaaaatactgtgtacattatcttgaggatgta ggttcttttgaatacactcgtaatacccttaaagagcttgaagctaaagcctataaacag attgatgcacgtggtgggaaccctgagctagtagccttagtaaaacacttaaggttaccc ctaagcaccagacagatcaaacaggagcttgctgaggagtatgagaccactaagagtcca gtgcccccagcctacagcctccaactcctcctaagcagccgtacctcactccggtggagg cgggacttcctacagcacttccggccagagcctcaagcttcgctgctgggcagttggctg gaggggctgctgctgggaacacctggagtctccgcgggcagatctcatattttggattct ggatatattataatgagtgacactttgacagcggatgtcattggtcgaagagttgaagtt aatggagaacatgcaacagtacgttttgctggtgttgtccctcccgtggcagctgtttct gtgtaa >gi568815597f:235280050_235548759|GENSCAN_predicted_peptide_4|406_aa MGKKQSRKTGNSKKQSASPPPKEHSSSPATEQSWTENDFDKLREEGFRRSNYSELQEEIQ TKGKEVKNFEKNLDECITRITNTEKCLKELMELKAKAQELPPHHTYSKIDHIVGSKALLS KCKRSDIITNCLSDHSAIKLEFRIKKLTQNRSTTWKLNNLLLNDYWVHNEMKTEIKMFFE TNENKDTTYQNLWDNIQSKIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVE SLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEQVPFLLKLFQSIEKEGILPN SFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFI PGMQGWFNICKSINVIQHINRTKDKNHMIISIDAEKAFDKIQQPSC >gi568815597f:235280050_235548759|GENSCAN_predicted_CDS_4|1221_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcgcctctcctcct ccaaaggaacacagctcctcaccagcaacggaacaaagctggacagagaatgactttgac aaattgagagaagaaggcttcagacgatcaaactactccgagctacaggaggaaattcaa accaaaggcaaagaagttaaaaactttgaaaaaaatttagacgaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaagccaaggctcaagaacta ccaccacaccacacctattccaaaattgaccacatagttggaagtaaagctctcctcagc aaatgtaaaagatcagacattataacaaactgtctctcagaccacagtgcaatcaaacta gaattcaggattaagaaactcactcaaaaccgctcaactacatggaaactgaacaacctg ctcctgaatgactactgggtacataatgaaatgaagacagaaataaagatgttctttgaa accaacgagaacaaagacacaacataccagaatctctgggacaacattcaaagcaaaata caaactaccatcagagaatactacaaacacctctacgcaaataaactagaaaatctagaa gaaatggataaattcctcgacacatacaccctcccaagactaaaccaggaagaagttgaa tctctgaatagaccaataacaggctctgaaattgtggcaataatcaatagcttaccaacc aaaaagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaa caggtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaac tcattttatgaggccagcatcatcctgataccaaagccaggcagagacacaaccaaaaaa gagaattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactg gcaaaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatc cctgggatgcaaggctggttcaatatatgcaaatcaataaatgtaatccagcatataaac agaaccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaa attcaacaaccttcatgctaa >gi568815597f:235280050_235548759|GENSCAN_predicted_peptide_5|128_aa MGQDDDFTSSMCNWHPTGGSFIRPNKVNFGTDFLTAIKNRYVLEDGPEEDRKEQIVTIGN KPVETIGFDSIMKQQSQLSKLQEVSLRNCAVSCAGEKGGVAEACPSILFTESLLLESDYG FYTCARDC >gi568815597f:235280050_235548759|GENSCAN_predicted_CDS_5|387_bp atgggtcaggatgatgattttaccagcagcatgtgtaattggcacccgacaggaggatcc tttattcgtccgaacaaggtaaattttggaacagactttcttactgcaattaagaaccgc tatgtgttagaagatggaccagaggaagatagaaaagagcaaattgttacaattggaaat aaacctgtggagactatcggttttgactctattatgaaacagcaaagtcagctgagcaag ttgcaagaagtttctctgaggaactgtgcagtaagttgtgctggtgaaaaaggaggagtt gctgaagcatgtcctagtatccttttcaccgagagcttgttattggaatctgactatgga ttttatacctgtgccagagactgctaa >gi568815597f:235280050_235548759|GENSCAN_predicted_peptide_6|203_aa MQKDASKFVDLCVLQKCSTSNCIISAKDHTSMRMNVAKASEVTGRFNSQFKTCAICRTVC RMGLKQSYELSVRHLFAETVTELNSCGGSYEHPGRNSQQRPPEPEPSFSSRCCGCKTSMF PSLKYLVVNDNQISQWSFFNELEKLPSLRALSCLRNPLTKEDKEAETARLLIIASIGQLK TLNKCENMVHLKIGNSKHSNHLC >gi568815597f:235280050_235548759|GENSCAN_predicted_CDS_6|612_bp atgcagaaagacgccagcaagttcgtggatctgtgcgtgctgcagaaatgctccaccagc aactgcatcatcagtgccaaggaccacacatccatgcggatgaacgtggccaaggccagt gaggtcacgggcaggtttaacagccagtttaaaacctgtgctatctgcaggactgtttgc aggatgggcctgaaacagtcttatgaactgtccgtgaggcacttgtttgctgaaacagta actgagctgaattcttgtggtggttcctatgaacacccgggaagaaacagccagcagagg ccgcctgagcctgaaccgagtttctcttccaggtgctgcgggtgcaaaacgtccatgttc ccatccttgaagtacctggtagtaaacgacaatcagatatcacaatggtcgtttttcaat gagctagagaagttaccaagtctacgggctttgtcctgcctaagaaaccccctgaccaaa gaggacaaagaagcagagacggcgcgactactcattatcgccagcattggccagctgaag acgctgaacaaatgtgagaatatggtgcacctgaagattgggaactcaaaacacagcaac cacttatgctga >gi568815597f:235280050_235548759|GENSCAN_predicted_peptide_7|567_aa MRNWLVLLCPCVLGAALHLWLRLRSPPPACASGAGPAGLVSDVRRRPFARERNLVCPSAA SAFCFGSSGSEVVFFPPKLLVFLGQDWGFSAVDNVYQLALFPQWKSTHYDVVVGVLSARN NHELRNVIRSTWMRHLLQHPTLSQRVLVKFIIGAHGCEVPVEDREDPYSCKLLNITNPVL NQEIEAFSLSEDTSSGLPEDRVVSVSFRVLYPIVITSLGVFYDANDVGFQRNITVKLYQA EQEEALFIARFSPPSCGVQVNKLWYKPVEQFILPESFEGTIVWESQDLHGLVSRNLHKVT VNDGGGVLRVITAGEGALPHEFLEGVEGVAGGFIYTIQEGDALLHNLHSRPQRLIDHIRN LHEEDALLKEESSIYDDIVFVDVVDTYRNVPAKLLNFYRWTVETTSFNLLLKTDDDCYID LEAVFNRIVQKNLDGPNFWWGKLNWAVDRTGKWQELEYPSPAYPAFACGSGYVISKDIVK WLASNSGRLKTYQGEDVSMGIWMAAIGPKRYQDSLWLCEKTCETGMLSSPQYSPWELTEL WKLKERHTLPSMAATCGTEHLESGEYD >gi568815597f:235280050_235548759|GENSCAN_predicted_CDS_7|1704_bp atgcgaaactggctggtgctgctgtgcccgtgtgtgctcggggccgcgctgcacctctgg ctgcggctgcgctccccgccgcccgcctgcgcctccggggccggccctgcaggcctggtt agtgatgtccgacgccgcccgttcgcccgggagcggaacctcgtgtgcccttcagccgcc agcgctttctgctttgggagctctggctctgaagttgttttcttcccccccaagttgttg gtctttctcggtcaagactggggttttagtgccgtggataatgtgtatcagttggcctta tttcctcagtggaaatctactcactatgatgtggtagttggcgtgttgtcagctcgcaat aaccatgaacttcgaaacgtgataagaagcacctggatgagacatttgctacagcatccc acattaagtcaacgtgtgcttgtgaagttcataataggtgctcatggctgtgaagtgcct gtggaagacagggaggatccttattcctgtaaactactcaacatcacaaatccagttttg aatcaggaaattgaagcgttcagtctgtccgaagacacttcatcggggctgcctgaggat cgagttgtcagcgtgagtttccgagttctctaccccatcgttattaccagtcttggagtg ttctacgatgccaatgatgtgggtttccagaggaacatcactgtcaaactttatcaggca gaacaagaggaggccctcttcattgctcgcttcagtcctccaagctgtggtgtgcaggtg aacaagctgtggtacaagcccgtggaacaattcatcttaccagagagctttgaaggtaca atcgtgtgggagagccaagacctccacggccttgtgtcaagaaatctccacaaagtgaca gtgaatgatggagggggagttctcagagtcattacagctggggagggtgcattgcctcat gaattcttggaaggtgtggagggagttgcaggtggttttatatatactattcaggaaggt gatgctctcttacacaaccttcattctcgccctcaaagacttattgatcatataaggaat ctccatgaggaagatgccttactgaaggaggaaagcagcatctatgatgatattgttttt gtggatgttgtcgacacttatcgtaatgttcctgcaaaattattgaacttctatagatgg actgtggaaacaacgagcttcaatttgttgctgaagacagatgatgactgttacatagac ctcgaagctgtatttaataggattgtccaaaagaatctggatgggcctaatttttggtgg ggaaaactgaattgggcagttgaccgaaccggaaagtggcaggagttggagtacccgagc cccgcttaccctgcctttgcatgtgggtcaggatatgtgatctccaaggacatcgtcaag tggctggcaagcaactcggggaggttaaagacctatcagggtgaagatgtaagcatgggc atctggatggctgccataggacctaaaagataccaggacagtctgtggctgtgtgagaag acctgtgagacaggaatgctgtcttctcctcagtattctccgtgggaactgacggaactg tggaaactgaaggaacgccatacactgcccagtatggcagccacatgtggtactgagcac ctggaaagcggcgagtatgactga >gi568815597f:235280050_235548759|GENSCAN_predicted_peptide_8|29_aa MGLGPGPPTQCEDEDEDEDFYDDPLSLNE >gi568815597f:235280050_235548759|GENSCAN_predicted_CDS_8|90_bp atggggctggggcctggccctcctactcaatgtgaagatgaagatgaggatgaagacttt tatgatgatccactttcacttaatgaatag >gi568815597f:235280050_235548759|GENSCAN_predicted_peptide_9|74_aa XPQTHTGPRPGASLIHNLSEEQDIRKIGGLFKTLPLTSSSHFISSLTTYRYAFLTGFYSK DLIIEPANTSYTNA >gi568815597f:235280050_235548759|GENSCAN_predicted_CDS_9|225_bp nngccacagacccatactggtccgcggcctggggcgtccctcatccataacctcagtgaa gaacaagacatccgaaaaataggagggctattcaagactctacccctcacttcctcctcc cattttatcagcagcctcacaacttacaggtatgccttcctcacaggcttttactccaaa gacctcattattgaacctgcaaacacatcatacaccaatgcctga