GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:13:40 Sequence gi568815596f:129880116_130080877 : 200762 bp : 44.39% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 Intr - 2076 1994 83 0 2 46 106 84 0.004 4.44 1.11 Intr - 3121 3096 26 2 2 71 94 27 0.002 -0.66 1.10 Intr - 7975 7868 108 2 0 92 53 69 0.048 4.06 1.09 Intr - 10154 9884 271 2 1 57 45 71 0.044 -3.19 1.08 Intr - 12595 12430 166 0 1 26 78 105 0.135 3.26 1.07 Intr - 15765 15696 70 1 1 114 44 43 0.024 0.74 1.06 Intr - 25152 25111 42 1 0 126 97 49 0.747 7.81 1.05 Intr - 25971 25888 84 1 0 128 78 -17 0.648 1.09 1.04 Intr - 28679 28542 138 0 0 106 37 52 0.614 2.34 1.03 Intr - 43566 43443 124 0 1 94 105 117 0.700 14.16 1.02 Intr - 47142 47051 92 1 2 124 61 -57 0.073 -5.09 1.01 Init - 54127 54064 64 1 1 105 117 214 0.995 25.31 1.00 Prom - 63201 63162 40 -3.36 2.00 Prom + 69615 69654 40 -1.66 2.01 Init + 100001 100685 685 1 1 86 -17 479 0.578 32.47 2.02 Term + 101617 101747 131 2 2 25 51 121 0.772 0.44 2.03 PlyA + 101859 101864 6 1.05 3.04 PlyA - 102583 102578 6 1.05 3.03 Term - 110074 110066 9 1 0 150 37 0 0.903 -0.71 3.02 Intr - 110423 110246 178 1 1 57 74 204 0.978 15.82 3.01 Init - 110634 110582 53 0 2 16 101 30 0.393 -2.27 3.00 Prom - 111886 111847 40 -8.86 4.00 Prom + 112292 112331 40 -2.46 4.01 Init + 112774 112863 90 0 0 54 91 106 0.575 8.00 4.02 Term + 113165 113812 648 1 0 44 36 284 0.518 12.88 4.03 PlyA + 114097 114102 6 1.05 5.00 Prom + 116466 116505 40 -5.16 5.01 Init + 118371 118409 39 2 0 111 49 50 0.971 3.62 5.02 Intr + 119482 119697 216 0 0 46 -13 199 0.015 4.30 5.03 Term + 126007 126573 567 0 0 69 48 460 0.306 34.52 5.04 PlyA + 126980 126985 6 1.05 6.04 PlyA - 127104 127099 6 1.05 6.03 Term - 130344 130164 181 2 1 44 49 84 0.003 -2.82 6.02 Intr - 148368 148194 175 1 1 118 94 166 0.815 19.20 6.01 Init - 151484 151436 49 2 1 86 58 23 0.196 -1.77 6.00 Prom - 152637 152598 40 -5.06 7.00 Prom + 153233 153272 40 -5.86 7.01 Init + 156768 156914 147 2 0 90 85 135 0.585 11.64 7.02 Term + 157917 158138 222 2 0 30 48 154 0.660 2.52 7.03 PlyA + 158844 158849 6 1.05 8.05 PlyA - 160732 160727 6 1.05 8.04 Term - 165465 165442 24 0 0 121 48 26 0.325 0.12 8.03 Intr - 169217 169017 201 2 0 -12 34 208 0.501 4.88 8.02 Intr - 169756 169701 56 2 2 98 100 36 0.622 4.50 8.01 Init - 172472 172244 229 2 1 85 8 158 0.568 4.65 8.00 Prom - 174280 174241 40 -9.16 9.04 PlyA - 174334 174329 6 1.05 9.03 Term - 177457 177278 180 1 0 57 42 109 0.474 0.81 9.02 Intr - 177879 177750 130 2 1 76 92 96 0.959 9.50 9.01 Init - 180302 180226 77 2 2 81 65 55 0.367 3.06 9.00 Prom - 192797 192758 40 -3.46 10.03 PlyA - 193478 193473 6 1.05 10.02 Term - 195457 194129 1329 1 0 51 43 2067 0.962 190.06 10.01 Intr - 197127 196966 162 0 0 78 53 102 0.955 5.87 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 119482 119763 282 0 0 46 43 265 0.956 13.23 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:129880116_130080877|GENSCAN_predicted_peptide_1|423_aa MRPLLCALAGLALLCAVGALADVFPPALDPGPPINSGSAFLGFSCLPPVPQAMVEKTVDH LGTQVKGLLGLLEDLAWNLPGGPFSPVPDLLGKGLLESSRRRFQPLDTEGVGAAHLNTCS CCCSWVVLLRPSGLLPGLPGNSHLLLETWERYHLLQEAIPDSSLSPGDGESADDLAQRRL EDRPPCHGRSLDAVVGTAVHEVIVFHMNAWYLPSSCDKGAKEASGPVPPGHSISAPELES TNHTTKETQVGAGGPRTVELKTELVSDGVGYLAKEISKHRAEGAAWFLLTAYSKMQKKRE ELKKKLLNKKEPELEYLRSSQSLCISKSEKSCSEGNTTGVAVQPFDKEITDLLVPRGHSE GLQMPMSVLLAMHGPRHLPASTADDDRCYEVHLLQEEQQRAGAEMSEKHQGGSVQYGTKH CHG >gi568815596f:129880116_130080877|GENSCAN_predicted_CDS_1|1269_bp atgcggcccctgctttgcgcgctggccgggctggccctgctctgtgccgtgggcgctttg gccgatgtgttcccacccgctctggatccagggcctcccatcaattcaggaagtgcattc cttgggttctcttgtcttcctcctgtccctcaggctatggtagagaagaccgtggatcac ctggggacacaggtgaaaggcctgctgggcctgctagaggacctggcctggaacctgccc gggggacccttcagccccgtccccgacctcctcggaaaagggttgctggagagctcaagg aggcggttccagccacttgacaccgagggagtaggagccgctcacctgaacacctgcagc tgctgctgcagctgggtggtcctgctgcgcccctctggcctcctccctggcctcccaggt aactcccacttgctacttgagacttgggaaaggtaccacctcctccaggaagccatccct gattccagcctgagcccaggagatggtgagagcgctgatgacctagcccaaaggcggctg gaagacaggcccccatgccatggccgatcactggatgctgtggtgggcacagcagtgcat gaggtcattgtctttcacatgaatgcctggtacctgccaagctcctgtgacaaaggggcc aaggaggcctcaggccccgtccctcctgggcacagcatttctgcccccgagctagagtct accaaccataccaccaaggagacccaggtcggggcaggtggccctcgcactgtggagctg aagacggagcttgtgagtgatggagttggatatctagccaaggagatttctaagcataga gctgaaggagcagcttggttcctcctgactgcttatagtaaaatgcaaaagaagagagag gaattgaagaagaaactcttaaacaaaaaagaaccagaacttgaatatctgagaagttct cagtctctctgtatttcaaaaagtgagaaatcttgttctgaaggaaacactacaggtgtg gctgtacagccatttgataaagaaatcacggatctcctggtgccaagggggcattcagag ggcctgcagatgcccatgtctgtcctactggctatgcatggtcccagacacctgcctgct tctactgctgatgacgacaggtgctatgaggtgcatcttctgcaggaggaacagcaacga gctggagcagaaatgagtgagaagcatcaagggggatcagtgcagtatggcacaaagcac tgccatggn >gi568815596f:129880116_130080877|GENSCAN_predicted_peptide_2|271_aa MSAGGDFGNPLRKFKLVFLGEQSVAKTSLITRFRYDSFDNTYQAIIGIDFLSKTMYLEDG TIGLRLWDTAGQERLRSLIPRYIRDSAAAVVVYDITNVNSFQQTTKWIDDVRTERGSDVI ITLVGNRTDLADKRQVSVEEGERKAKGLNVTFIETRAKAGYNVKQLFRRVAAALPGMEST QDGSREDMSDIKLEKPQEQTVSEGGCSCYSPMSSSTLPQKPPYSFIDCKEPVEEFKFGTT IKTTELAEITISKAYQQKNPQQNSKNFAQDI >gi568815596f:129880116_130080877|GENSCAN_predicted_CDS_2|816_bp atgtccgcgggcggagacttcgggaatccgctgaggaaattcaagctggtgttcctgggg gagcaaagcgttgcaaagacatctttgatcaccagattcaggtatgacagttttgacaac acctatcaggcaataattggcattgactttttatcaaaaactatgtacttggaggatgga acaatcgggcttcggctgtgggatacggcgggtcaggaacgtctccgtagcctcattccc aggtacatccgtgattctgctgcagctgtagtagtttacgatatcacaaatgttaactca ttccagcaaactacaaagtggattgatgatgtcagaacagaaagaggaagtgatgttatc atcacgctagtaggaaatagaacagatcttgctgacaagaggcaagtgtcagttgaggag ggagagaggaaagccaaagggctgaatgttacgtttattgaaactagggcaaaagctgga tacaatgtaaagcagctctttcgacgtgtagcagcagctttgccgggaatggaaagcaca caggacggaagcagagaagacatgagtgacataaaactggaaaagcctcaggagcaaaca gtcagcgaagggggttgttcctgctactctcccatgtcatcttcaacccttcctcagaag cccccttactctttcattgactgcaaagaaccagtggaagaatttaaatttggcactacg atcaaaactactgaattagcagaaataacgatatctaaagcttaccagcaaaagaaccct cagcagaatagcaaaaactttgctcaggacatttga >gi568815596f:129880116_130080877|GENSCAN_predicted_peptide_3|79_aa MRLGLAVESKELMHALNRWLDNAVIDEITPKLIRDLPNSCTYRKALGEMVVQQESRNVTI AIIRPSTVGATWHDPFPDV >gi568815596f:129880116_130080877|GENSCAN_predicted_CDS_3|240_bp atgagattgggcttagcagtggagagcaaagagctcatgcatgctttgaacaggtggtta gataacgctgttattgacgagatcacacccaagctgatcagagatctgcccaattcttgc acctaccgcaaggccttgggagaaatggtggtgcagcaggagagcagaaatgtaaccatc gccatcataaggccctccactgtgggagcgacgtggcacgaccctttcccagatgtatag >gi568815596f:129880116_130080877|GENSCAN_predicted_peptide_4|245_aa MFAKGKGSLVPSDGQAGEKLALYVYEYLLQPPPHNPSSMMGPHSQPFMSPRYAGGPRPPD QNGKPASGRSSWDTAIAAQFHGSRTTTRPSPHGRINAENEPSPRHGAHGSQPTELRQRHE TTTQFPRPPMPGINMGPGAGRPWPNPNSANSIPYSSSSPGTYVGPPGGGGPPGTPTMPSP SDSTNSSDIYTMINPVPPGGSRSNFPMGPGSDGRMGSMGGMEPHHMNGLLGSGDIDGFQK ILLTT >gi568815596f:129880116_130080877|GENSCAN_predicted_CDS_4|738_bp atgtttgccaaaggcaaaggctccttggtgccctcggacgggcaggctggggaaaagtta gctttatacgtctacgaatatttactgcagcctccacctcacaatcctagcagcatgatg ggaccccacagtcagcctttcatgtcaccgcgatacgcaggcggccccaggccccccgat cagaatgggaaaccagcctccgggaggagttcctgggacacagccattgctgcccaattc catggatcccgcacgacaacaaggccatccccacatgggaggatcaatgcagagaatgaa ccctccccgaggcatggggcccacgggtcccagcccacagaattacggcagcggcatgag accaccacgcaattccctcggccccccatgcccgggattaacatgggcccgggagccggc agaccctggcccaatcctaacagtgctaactcaattccatactcctcctcatcacctggt acctatgtgggaccccctggtggtggcggccctccaggaacacccactatgcccagtccc tcagattcaacaaattccagcgacatctacacaatgattaatccggtgccgcctggaggc agccggtccaacttcccgatgggtcccggctcggacggtcggatgggcagcatgggcggc atggagccacaccacatgaacggattgttagggtcaggcgacatagatggcttccaaaaa attctcctaacaacgtaa >gi568815596f:129880116_130080877|GENSCAN_predicted_peptide_5|273_aa MAEYEELRSTAPSECSSSPATEQSWMENDFDKLREEGFRQSNYSKLQEEIQTNGKEVKNC EKKLDKSITRITNAEKSLKELMELKRTSRLPAAALTAGPDVPADAADSACAMGLPTLEFS DSYLDSPDFRERLQCQEIELERTNKFIKELIKEGSPLTGALRTGNVDCLPSSLTLSPFPK EHTSTQVGDLSMAVQKFSQSLQDFQFECIDNAQTDDEISITQSLKEFARLLIAVEEERRR LIQNANDVLIAPLEKFRKEQIGAAKGGKKFDKE >gi568815596f:129880116_130080877|GENSCAN_predicted_CDS_5|822_bp atggccgaatatgaagagctccggtctacagctcccagcgaatgcagctcctcaccagca acggaacaaagctggatggagaatgactttgacaagctgagagaagaaggcttcagacaa tcaaactactccaagctacaggaggaaattcaaaccaatggcaaagaagttaaaaactgt gaaaaaaaattagacaaatcgataactagaataaccaatgcagagaagtccttaaaggag ctgatggagctgaaacgcacgtcgcggctgccagctgctgccctgaccgccggcccagac gtgcccgcggacgccgctgacagcgcctgtgccatggggctgcctactctggagttcagc gattcctacttggacagcccggatttcagggagcgcttgcagtgtcaggagattgaactg gagcgaaccaacaagttcatcaaggagctcattaaggagggctctccgctcactggggcg ttgaggacaggtaatgttgattgcctacccagttcccttaccctttcaccctttccaaag gaacacacctctacccaggttggggatctgtctatggcagtgcagaaattttcccagtca ttacaagatttccaatttgaatgtattgataatgctcaaacagatgatgaaattagtatt actcagtcactaaaagaatttgcaaggctactcattgcagtagaagaagaaaggcgaaga ctgatccaaaacgctaatgatgtattaattgcaccacttgagaaatttcgaaaagaacag attggtgcagcaaaaggtggaaagaagtttgacaaagagtga >gi568815596f:129880116_130080877|GENSCAN_predicted_peptide_6|134_aa MGFLHVGQAGLELPTSGFQVLATFEIPIPFERALTRPYADFTTSNFRTQYWNAISQQAPA IIYDFYLWLTGRKPSKGTIASTVATTYALPSDTRVRLHIHVLCNKKALNLNGTSIELAHR QLRSRNTLVCVQVK >gi568815596f:129880116_130080877|GENSCAN_predicted_CDS_6|405_bp atggggtttctccacgttggtcaggctggtcttgaactcccgacctcaggtttccaggtc ttggcaacctttgaaattccaattccatttgagagagctttgacgaggccatatgctgat ttcaccaccagcaacttcagaacccagtactggaatgccatcagccagcaggcccctgcc atcatctatgacttctatctgtggctcactggaaggaaacccagcaaagggaccatcgcc agcactgtggctacaacatatgccttgccatcagatacaagagttaggcttcatatccac gtcctgtgcaataaaaaagctttaaatctgaatggaacatccatagaactagctcacaga caactcagaagcaggaacactttggtctgtgttcaagtaaaatga >gi568815596f:129880116_130080877|GENSCAN_predicted_peptide_7|122_aa MPACPRPDQPSRAATHRCPSSQLRAMWPSESILPSISTFTSPYRKRRLQAVAAAATALGL APPVARGLLVVAPASPLELLEATPKRGSCSWPGNCNTRHTYVATARPTARILPCRHICAA HK >gi568815596f:129880116_130080877|GENSCAN_predicted_CDS_7|369_bp atgcccgcctgtccgcgccccgaccagccctcccgggcagccactcaccggtgtccgtct tcccagcttcgcgccatgtggccaagtgaatccatcctgccgtccatctccactttcacc agcccgtaccgcaagcgccgcctgcaggcggtggccgccgccgccacagccctgggcctg gcacccccagtcgcccgcggcctccttgtggtggcaccggcgtccccgctggagctgctg gaggccacgcccaagcgtggcagctgctcctggcctgggaactgcaacaccaggcatacc tatgtggctacggccagacctacagcaagaattctcccctgcaggcacatctgcgcagca cacaagtga >gi568815596f:129880116_130080877|GENSCAN_predicted_peptide_8|169_aa MVIVGLAAGVLLVGPGDGGLISEGVVREDLMCGVWSAGTWSVGTAERCLEKPGALHVIEG PLDSWDGPVMPNGPVKNHKGEQQEVPSKHPQMALEICLCLDFLYYPFLRGDASAGPVTWC TTSDTIILQQHRTLTSQGVDDFLKAKATFKASDFIDALVLSKFLEALIE >gi568815596f:129880116_130080877|GENSCAN_predicted_CDS_8|510_bp atggtcattgttggcctggcagcgggggtcttgttagtgggtcctggtgatgggggtcta atcagtgagggtgtggtcagggaagacctgatgtgtggggtctggtcagcaggaacctgg tcagtggggactgctgagcgctgcttggagaagccaggtgcattgcacgttatcgagggc cctctggacagctgggatggcccagtgatgcccaatggcccggtcaaaaatcataaagga gaacagcaagaggtccccagcaaacatccacagatggccttggaaatctgcctgtgcttg gacttcctgtactacccattcctgaggggcgatgcttctgcagggcctgtgacttggtgc acaacttcagacaccatcatcttgcagcagcaccgcaccctcactagccagggtgttgat gacttcctcaaggccaaggccacattcaaggcttcggactttattgatgcgcttgtgctg agcaagttcctagaagctttaattgaatga >gi568815596f:129880116_130080877|GENSCAN_predicted_peptide_9|128_aa MHKPQCNNCATNGATERKRAVGSGAGPWGLAEKNGLLGQLHGPAAVCILRTLLPASLQPQ LQPWLNDAQCGKATGIQNSPVHERQLWVLNAAKPQVQICPRPWEPSPHSPVPWMWDKDSK RMILELSD >gi568815596f:129880116_130080877|GENSCAN_predicted_CDS_9|387_bp atgcacaagcctcagtgcaacaactgtgctacaaatggagccacagagaggaaacgagca gtaggctcaggagccgggccctggggcctagcagagaagaatggtttactgggccagctc catggccctgctgctgtgtgcatcctcaggacactgctgcctgcatccctgcagccccag ctccagccatggctgaatgatgcacagtgtggaaaagctacaggcattcaaaacagccct gtccatgagaggcagctgtgggtgctgaacgctgcaaagccacaggtgcagatctgccca aggccttgggagcccagccctcacagccctgtgccatggatgtgggacaaggattcaaaa aggatgattttggagctgtcggattga >gi568815596f:129880116_130080877|GENSCAN_predicted_peptide_10|496_aa GQLQYFMKIKISSSDEQNDTQKQFCEEQNTGILHDEILIHEEKQIEVVEKMNSELSLSCK KEKDILHENSTLREEIAMLRLELDTMKHQSQLREKKYLEDIESVKKRNDNLLKALQLNEL TMDDDTAVLVIDNGSGMCKAGFAGDDAPRAVFPSIVGRPRQQGMMGGMHQKESYVGKEAQ SKRGILTLKYPMEHGIITNWDDMEKIWHHTFYNELRVAPEEHPVLLTEATLNPKANREKM TQIMFETFNTPAMYVAIQAVLSLYTSGRTTGIVMDSGDGVTHTVPIYEGNALPHATLRLD LAGRELPDYLMKILTEHGYRFTTMAEREIVRDIKEKLCYVALDFEQEMATVASSSSLEKS YELPDGQVITIGNERFRCPEALFQPCFLGMESCGIHETTFNSIMKSDVDIRKDLYTNTVL SGGTTMYPGMAHRMQKEIAALAPSMMKIRIIAPPKRKYSVWVGGSILASLSTFQQMWISK QEYDESGPSIVHRKCL >gi568815596f:129880116_130080877|GENSCAN_predicted_CDS_10|1491_bp ggacagctccagtatttcatgaaaattaaaatttcttctagtgacgaacaaaatgatact cagaagcaattttgtgaagaacagaacactggaatattacacgatgagattctgattcat gaagaaaagcagatagaagtggttgaaaaaatgaattctgagctttctcttagttgtaag aaagaaaaagacatcttgcatgaaaatagtacgttgcgggaagaaattgccatgctaaga ctggagctagacacaatgaaacatcagagccagctaagagaaaagaaatatttggaggat attgaaagtgtgaaaaaaaggaatgataatcttttaaaggctctacaattgaatgagctc accatggatgatgataccgctgtgctcgtcattgacaacggctctggcatgtgcaaggcc ggctttgcgggcgacgatgccccccgggctgtcttcccttccatcgtggggcgccccagg cagcagggcatgatggggggcatgcatcagaaagagtcctatgtgggcaaggaggcccag agcaaaagaggcatcctgaccctgaagtaccccatggaacacggcatcatcaccaactgg gatgacatggagaagatctggcaccacaccttctacaacgagctgcgtgtggctcccgag gagcaccccgtcctgctgaccgaggccaccctgaaccctaaggccaaccgcgagaagatg acccagatcatgtttgagaccttcaacaccccagccatgtacgtggccatccaggctgtg ctgtccctgtacacctctggccgtactactggcatcgtgatggactctggtgacggggtc acccacactgtgcccatctatgaggggaatgccctcccccatgccaccctgcgcctagac ctggctgggcgggaactgcctgactacctcatgaagatcctcaccgagcatggctatagg ttcaccaccatggccgagcgggaaatcgtgcgtgacatcaaagagaagctgtgctatgtt gccctggacttcgagcaggagatggccacggtggcctccagctcctccctagagaagagc tacgagctgcccgatggccaggtcatcaccatcggcaacgagcggttccgctgccccgag gcgctcttccagccttgcttcctgggcatggaatcctgtggcatccatgaaactaccttc aactccatcatgaagtctgatgtggacatccgcaaagacctgtacaccaacacagtgctg tctggcggcaccaccatgtaccctggcatggcccacagaatgcagaaggagatcgctgcc ctggcgcctagcatgatgaagatcaggatcattgctcctcccaagcgcaagtactccgtg tgggtcggtggctccatcctggcctcgctgtccaccttccagcagatgtggatcagcaag caggagtatgatgagtcaggcccctccattgtccaccgcaaatgcttgtag