GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:22:44 Sequence gi568815591r:55695916_55934511 : 238596 bp : 44.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 Intr - 3233 3088 146 1 2 150 83 181 0.972 23.73 1.06 Intr - 11440 11251 190 2 1 114 110 181 0.994 21.54 1.05 Intr - 12081 12026 56 2 2 86 44 25 0.279 -3.48 1.04 Intr - 16770 16581 190 1 1 84 107 182 0.798 18.34 1.03 Intr - 17335 17190 146 0 2 64 52 136 0.731 7.53 1.02 Intr - 30163 30106 58 2 1 108 101 21 0.560 3.34 1.01 Init - 33322 33313 10 1 1 110 97 8 0.597 4.43 1.00 Prom - 35417 35378 40 -8.36 2.00 Prom + 35693 35732 40 -8.56 2.01 Sngl + 36229 36516 288 0 0 102 48 242 0.982 17.19 2.02 PlyA + 37079 37084 6 1.05 3.00 Prom + 38454 38493 40 -4.76 3.01 Init + 40864 41103 240 0 0 63 75 86 0.539 2.33 3.02 Intr + 41360 41613 254 1 2 -9 72 365 0.493 21.43 3.03 Intr + 41620 41947 328 1 1 -50 41 205 0.677 -2.30 3.04 Intr + 42086 42148 63 1 0 65 16 127 0.633 2.11 3.05 Term + 42902 43036 135 1 0 93 47 124 0.868 6.82 3.06 PlyA + 44603 44608 6 1.05 4.08 PlyA - 44671 44666 6 -3.94 4.07 Term - 47057 46974 84 2 0 105 48 34 0.449 -1.25 4.06 Intr - 48183 48089 95 1 2 63 36 90 0.625 0.98 4.05 Intr - 48351 48237 115 0 1 91 82 35 0.757 3.22 4.04 Intr - 48514 48410 105 1 0 105 71 37 0.791 4.11 4.03 Intr - 50744 50633 112 1 1 76 43 50 0.540 -0.32 4.02 Intr - 52297 52240 58 2 1 88 101 49 0.745 4.14 4.01 Init - 65899 65851 49 1 1 86 27 89 0.134 1.71 4.00 Prom - 83484 83445 40 -5.76 5.00 Prom + 88597 88636 40 -3.46 5.01 Init + 89539 89618 80 0 2 77 75 45 0.326 2.63 5.02 Intr + 102049 102186 138 1 0 120 80 -14 0.135 0.58 5.03 Intr + 102262 102358 97 1 1 86 75 80 0.690 6.51 5.04 Intr + 102609 102869 261 2 0 11 72 346 0.477 22.98 5.05 Term + 107287 107364 78 0 0 70 50 -3 0.047 -8.14 5.06 PlyA + 107663 107668 6 1.05 6.09 PlyA - 107708 107703 6 1.05 6.08 Term - 109301 109290 12 2 0 89 28 12 0.266 -6.50 6.07 Intr - 109475 109343 133 1 1 24 88 135 0.981 7.75 6.06 Intr - 111343 111175 169 2 1 74 95 115 0.996 9.80 6.05 Intr - 115511 114908 604 2 1 79 30 517 0.199 37.06 6.04 Intr - 138671 138510 162 2 0 120 84 55 0.918 8.47 6.03 Intr - 147213 147027 187 2 1 55 95 103 0.962 7.39 6.02 Intr - 148803 148608 196 1 1 96 109 77 0.972 9.07 6.01 Init - 149323 149239 85 1 1 78 89 -6 0.392 -0.45 6.00 Prom - 150927 150888 40 -4.76 7.00 Prom + 155540 155579 40 -4.46 7.01 Init + 156661 156825 165 0 0 61 59 128 0.910 6.83 7.02 Term + 162166 162192 27 0 0 82 49 45 0.756 -1.93 7.03 PlyA + 162387 162392 6 1.05 8.00 Prom + 174715 174754 40 -1.86 8.01 Init + 175944 176051 108 2 0 77 39 96 0.236 3.82 8.02 Intr + 179243 179334 92 1 2 77 93 -12 0.152 -3.01 8.03 Intr + 181827 181871 45 0 0 70 83 84 0.887 3.62 8.04 Intr + 182036 182236 201 2 0 -12 89 204 0.931 8.90 8.05 Intr + 183174 183205 32 0 2 91 37 33 0.334 -3.73 8.06 Intr + 191369 191583 215 0 2 108 41 172 0.290 12.93 8.07 Intr + 191646 191765 120 2 0 34 105 113 0.728 8.29 8.08 Term + 192067 192084 18 0 0 128 49 -3 0.727 -1.88 8.09 PlyA + 193134 193139 6 1.05 9.00 Prom + 195809 195848 40 -0.36 9.01 Init + 225683 225919 237 1 0 92 43 163 0.827 10.21 9.02 Intr + 227247 227373 127 2 1 43 68 162 0.490 9.95 9.03 Intr + 227692 227784 93 2 0 38 111 52 0.606 2.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 77266 77373 108 0 0 98 38 127 0.960 7.21 S.002 Init - 173772 173670 103 0 1 60 68 115 0.974 5.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:55695916_55934511|GENSCAN_predicted_peptide_1|266_aa MPGHGTTWLQENKLRAPTDSTLCYDRDSTFNVFVGKGQLITGMDQALVGMCVNERHFVKI PPKLAYGSEGVSGVIPPNSVLHFDVLLMDIWNSEDRVQIHTYFKPLSCPQTIQVSDFVRY HYNGTFLDGTLFDSRYSKYSHWQEHPREAVLPRRKDIPGQASLVFDVALLDLHNPKDSIS IENKAVPENCERLSQSGDFLRYHYNGTLLDGTLFDSSYSQNRTFDTYIGQGYVIPGMDEG LLGVCIGEKRRIVVPPHLGYGEEGRX >gi568815591r:55695916_55934511|GENSCAN_predicted_CDS_1|798_bp atgcctggccatgggaccacctggttgcaggaaaacaagctcagggctcccactgattct acgttatgctatgacagagactccactttcaacgtgtttgtgggaaaaggacagctgatc acagggatggaccaggctcttgttgggatgtgcgtaaacgagagacatttcgtgaagatt cccccaaagcttgcctacggaagtgaaggagtttctggtgtgatcccccccaattcagtg cttcattttgatgtacttctgatggatatttggaattctgaagaccgggttcagattcac acctatttcaagcccctgagttgccctcagaccatccaggtgtctgattttgtgaggtac cactacaacgggacgttcctggacggaactctgtttgattcgagatattcaaaatactcc cactggcaggaacatccaagagaagcagtactacccagaaggaaagacattcccggtcag gcatctctggtgtttgatgttgcattattggacctccataaccccaaggacagcatttcc attgagaacaaggcagtacctgaaaactgtgagcggttaagtcaaagtggggactttctc aggtatcattacaatggcacgcttctggatggcaccctctttgattccagctactctcag aaccgcacgtttgacacgtacattgggcagggctacgtgattcctgggatggatgaaggt ctacttggtgtttgcattggagagaagcgaaggattgtggtcccccctcacctggggtat ggagaggaaggaagagnn >gi568815591r:55695916_55934511|GENSCAN_predicted_peptide_2|95_aa MADEKPNQGVKSENNDHTNLKVAGQDGSVVQFKIKRHTPLSKLMKAYCKQQGLSMRQIRF RFDGQPIKETDTPAQLEMENEDTIDVFQQQTGGVY >gi568815591r:55695916_55934511|GENSCAN_predicted_CDS_2|288_bp atggccgacgaaaagcccaaccaaggagtcaagtcggagaacaacgatcatactaatttg aaggtggcggggcaggatggttctgtggtgcagtttaagattaagaggcatacaccactt agtaaactaatgaaagcctattgtaagcaacagggattgtcaatgaggcagatcagattc cgattcgacgggcaaccaatcaaggaaacagacacacctgcacagttggaaatggagaat gaagatacaattgatgtgttccaacagcagacgggaggtgtctactga >gi568815591r:55695916_55934511|GENSCAN_predicted_peptide_3|339_aa MYLAHRPLMSASSEASGGVSMFVWRNVEPCSVAVFSWYSVPFLTPPCSRVRPSNLPVTQW PPTRAKNLPSWQLLLTSVHQSLSALRKEQDSSSEKDGRSPNKWDKDHIRWPMSGVHDLQQ AALGPGRAHQGHPNQDNRTVSQILSERWYTLGPNEMQKYQTWPSRWPTCNKDRKKSSSEA KPTSQGLAGVYKGSWEQSISETGTATAPGVSSERLSVVAQTFQSSDTKEQLLWGRTAAHS QGTWLSLAQAFSHSGVLSLDGREIDRQALQELTQHMASEDTASDEEPMVIHEEEGVGEAE DGLREPETEKAVSSSLHVPWTSAGPDHAALPGPLLLPVH >gi568815591r:55695916_55934511|GENSCAN_predicted_CDS_3|1020_bp atgtacttggcccacaggcccctgatgtctgcgtccagcgaggcgtccggtggcgtcagc atgtttgtgtggaggaatgtggaaccttgttctgtggctgtgttctcctggtactctgtc cccttcctgacccctccctgcagccgcgtgaggcccagcaacctgccagtcactcagtgg cctccaaccagagcaaagaacctgccaagttggcagctgttgctcacgagtgtccaccag tccctcagtgcactgcgcaaggaacaggactcatcttctgagaaggatggacgcagcccc aacaaatgggacaaggaccatatccggtggcccatgagtggcgttcatgatcttcagcaa gcggcactaggccctggcagggcgcaccagggtcaccccaaccaggataaccggaccgtc agccagatcctgagcgagcggtggtacaccctggggcccaatgagatgcagaagtaccag acctggccttccaggtggcccacttgcaacaaggaccgaaagaagtccagctcagaggcc aagcccacaagccaggggctagcaggagtgtacaagggctcatgggagcagagcatatca gagacgggcactgccactgcccctggggtgtcctctgaacgcctgtcagttgtggcccag acattccagagctcggataccaaggagcagcttctgtggggcaggacggctgcacacagt cagggaacctggctcagcctggcccaagccttctcccacagcggggtactcagcctggac ggcagggaaatagaccgtcaggcactacaggaactgacacagcacatggccagtgaggac acagcgagtgacgaggagcccatggtcatccatgaggaggagggggtgggagaagccgag gacgggctcagggaaccggagaccgagaaggcggtgtcctcttcactgcacgtgccctgg accagtgccggccctgatcatgcagctcttccaggcccactgcttcttcctgtccactag >gi568815591r:55695916_55934511|GENSCAN_predicted_peptide_4|205_aa MGFRHIGQAGLELLTSGGTIQLQENKLNTPTDSTLWPSSCPTATFPGTAFAFCSLSRPRM SLTRHLRDELILPARLLPPNNLFGLSACPSPGGLGRPTASSSQAPQAQAPNGLRSVGSST PSLGLPVTSAGPSRPEVGVSRPCLPASPELSPAKLFGPTYCLSVACTGPALAVEQPLQAR LLPPNNLFGLSACPSPGGLGRPTAS >gi568815591r:55695916_55934511|GENSCAN_predicted_CDS_4|618_bp atggggtttcgccacattggccaggctggtctggaactcctgacctcgggtgggaccatc cagttgcaggaaaacaagcttaacacgcccactgattctacattatggccaagctcatgc cccacggcgacttttccaggcacagcttttgccttttgcagcctgtccaggcccagaatg tccttaactcggcatctccgggacgagctcatcctcccagcccgactgctgcctcccaac aacctctttggactcagcgcctgcccatctcctggcggccttggtcggcccacagcttcc tcaagccaagctccccaggcccaggccccaaatggtctccggtcggtgggctcctccacg cccagcttgggcctcccggtgacctctgcaggcccaagtcgtcctgaagtcggcgtctcc cggccctgcctcccagcaagccccgaactttctccagccaagctcttcgggcccacctac tgcctctcggtggcctgtacaggcccagctttggctgtagaacagcctctgcaggcccga ctgctgcctcccaacaacctttttggactcagcgcctgcccatctcctggcggccttggt cggcccacagcttcctga >gi568815591r:55695916_55934511|GENSCAN_predicted_peptide_5|217_aa MPVNPGEKERGMNIRNQTSPWRRRGHRPKPPWLRRGLRKCEPLPGTPAQTMYSAHRPLML AFSKASRGLGMFSHAKPSNLPDTQWPPTRQKNLLSWQLLLMSVHQAQSLSALPKEQNSSS EKDGRSPNKWDKDHIRQPVSGTHDLQQAAPGSGRAHQCHPNQDNWTVSQILGEWWYTLGP NERQKYHDLTSQCWDYRREPLHPAEPEHFLKEDIKGK >gi568815591r:55695916_55934511|GENSCAN_predicted_CDS_5|654_bp atgcctgtgaaccctggtgagaaggagcgtggcatgaacatcaggaaccagacttcaccc tggagaaggagaggccacaggcccaagccaccttggctgaggagggggctgaggaagtgt gagcccctgccaggaacccctgcccagaccatgtactcggcccacaggcccctgatgctt gcattcagcaaggcctcccgtggcctcggcatgttcagccacgcaaagcccagcaacctg ccagacactcagtggcctccaactagacaaaagaacctgctgagttggcagctgttgctc atgagcgtccaccaggcccagtctctcagtgccctgcccaaggaacagaactcatcttct gagaaggatggacgcagccccaacaaatgggacaaggaccacatccggcagcccgtgagt ggcactcatgatcttcagcaagcggcaccaggctctggcagggcccaccagtgtcacccc aaccaggacaactggaccgtcagccagatcctgggcgagtggtggtacactctggggccc aatgagaggcagaagtaccatgacctgacctcccagtgctgggattataggcgagagcca ctgcacccggctgaacctgaacattttttaaaagaagacataaaaggcaaatag >gi568815591r:55695916_55934511|GENSCAN_predicted_peptide_6|515_aa MASSSIHVPAKDMISFFFMAENMCHVLTWETGIGKSTLIDTLFNTNLKDNKSSHFYSNVG LQIQTYELQESNVQLKLTVVETVGYGDQIDKEASYQPIVDYIDAQFEAYLQEELKIKRSL FEYHDSRVHVCLYFISPTGHSLKSLDLLTMKNLDSKVNIIPLIAKADTISKNDLQTFKNK IMSELISNGIQIYQLPTDEETAAQANSSVSPPPNTMAPRKGSSWVAKTNSLGRWKLASFL KDFDCKVEIRIKPIESDRQNLFKEVDKLYNIQILRLPKALRKDEWHEWLNYFSLGGNKQG LEEAATADLDITAGAIQTPLTSAETRKVIQVDEMIVEEEEEENKHKNLQIATVKRCPASK NRTQSVQGKGRRERSSCAITVTLVLGLLDVSIVKPTPGLTPRFDSRVFMIPVENENHCDF VKLRDMLLCTNMENLKEKTHTQHYECYRYQKLQKMGFTDVGPNNQPVSFQEIFEAKRQEF YDQCQREEEELKQRFMQRVKEKEATFKEAEKEVPY >gi568815591r:55695916_55934511|GENSCAN_predicted_CDS_6|1548_bp atggcctccagctccatccatgttcctgcaaaggacatgatctccttcttttttatggct gagaacatgtgccatgttctcacttgggagactggaattggaaaatcgacactgatagac acattgtttaatactaacttgaaagataacaaatcctcacatttttactcaaatgttgga cttcaaattcagacatatgaacttcaggaaagcaatgttcagttgaaattgactgttgtg gagacagtagggtatggtgatcaaatagacaaagaagccagctaccaaccaatagttgac tacatagatgcccaatttgaggcctatcttcaagaagaactgaagattaaacgttccttg tttgagtaccatgattctcgcgtccacgtgtgtctttacttcatttcacctacaggacat tccctgaagtctcttgatctattaacaatgaagaaccttgacagtaaggtgaatattata ccactgattgccaaagcagacactatttctaaaaatgatttacagacgtttaagaataag ataatgagtgaattgattagcaatggcatccagatatatcagctcccaacagatgaagaa actgctgctcaagcgaactcctcagttagtcctccacccaacaccatggctcctagaaag ggcagtagttgggtggccaagaccaactccttagggaggtggaagctcgcctcctttctt aaagacttcgactgcaaagtggaaatacgaatcaagccaattgagtctgacaggcagaac ctcttcaaggaggtggataaactctacaacatccagatcctgcggctccccaaggcactg cgcaaagatgaatggcatgaatggctcaactacttctcccttggaggaaacaaacagggc ttggaagaggcagcaacagctgacctggatatcaccgcgggagctattcaaacacccctg acatctgctgaaacacgaaaggtgatacaagtagatgaaatgatagtggaagaggaagaa gaagaaaataaacataagaatcttcaaattgcaacagtcaaaagatgtcctgcatccaag aacagaactcagtctgtacaaggaaaaggcagaagggaaaggtcaagctgtgctatcact gttaccctagttttgggcttattggatgtgtccatagtgaagccaactccaggcctgaca cccaggtttgactcgagggtcttcatgatccctgtggaaaatgaaaatcactgtgacttc gttaagctccgagatatgcttctttgtaccaatatggaaaatctaaaagaaaaaacccac actcagcactatgaatgttataggtaccaaaaactgcagaaaatgggctttacagatgtg ggtccaaacaaccagccagttagttttcaagaaatctttgaagccaaaagacaagagttc tatgatcaatgtcagagggaagaagaagagttgaaacagagatttatgcagcgagtcaag gagaaagaagcaacatttaaagaagctgaaaaagaggtgccctattaa >gi568815591r:55695916_55934511|GENSCAN_predicted_peptide_7|63_aa MNLLKEKIGETFQDIDLGKYFLDNAPQAQETKAKMDKWDHMKLKSFYTAKKVINKNKKAF IYG >gi568815591r:55695916_55934511|GENSCAN_predicted_CDS_7|192_bp atgaacctactaaaagaaaagattggggaaactttccaggacattgacctgggcaaatat ttcttggataatgccccacaagcacaggaaaccaaagcaaaaatggacaaatgggatcat atgaaattaaaaagcttctacacagcaaagaaagtgatcaacaaaaataaaaaagccttc atctatgggtga >gi568815591r:55695916_55934511|GENSCAN_predicted_peptide_8|276_aa MDNYHTERIAGEIDIPEMSGDGKEQRRAEAISISHRAVCCMLYAVCSSSVAGGRGAGLLS VAAVLGSPAAEQFFPEYENPERDDPSTIEKLSKNKQKPITPETAEKLAHDLKIVKYVECS ALTQKGLKNVFDEATLAALEPPESKKSHRVDPVFDETHERPENSGKARGRGPRGPHSPDE KQTRESWWESGAPAMRIAMKEAGSERRQRMRIGDLRPASCAAIAGRLSSEQAAGPSAPSP SRVHRGGGGGGGGGGGGVRGRSLPKRPLSAEIFGLL >gi568815591r:55695916_55934511|GENSCAN_predicted_CDS_8|831_bp atggataattaccacacagaaagaattgctggagagattgacatacctgagatgtctggg gatggcaaggaacaaagaagggcagaagctatcagtatcagccacagagctgtatgctgt atgctctatgcagtatgcagcagttctgttgcagggggtcggggagcagggttgctgtca gtggcggcagtcctaggcagtcctgcagcagaacagttcttccctgaatacgagaaccca gaaagagatgatccctctactattgagaaactttccaagaacaaacagaagcctatcact ccagagactgctgaaaagctggcccatgacctgaagattgtcaagtatgtggagtgttct gcactcacacagaaaggcctaaagaatgtatttgacgaagcaacattggctgcgctggag cctccagaatcgaagaagagccacagagtagatccagtatttgatgaaactcatgaaaga ccggaaaattctgggaaggctcgaggccgcggtccccggggtccgcatagtcccgatgaa aaacagacccgggaaagctggtgggagtcaggagccccggcgatgaggattgcgatgaag gaagcaggctcggagcgccgccagcgcatgcgtattggggatctgaggccagcgtcttgc gccgccattgcggggaggctgtcctcagagcaggctgcggggccctcggcaccttctccc tcccgggtccaccgcggcggcggcggcggcggcggcggcggcggcggcggcgtcaggggg cggagcctgccgaagcgccctttgtctgcggagatttttggtcttttatag >gi568815591r:55695916_55934511|GENSCAN_predicted_peptide_9|153_aa MVEIARELELEVQPEDVVELLLKDEEKKWFLEMESVPGEDALNNVEMTIVDLEYSINLLD KAAGGFEKTDSNFERSSTVESLTFQDVAVDFTREEWDQLYPAQKNLYRDVMLENYRNLVA LGYQLCKPEVIAQLELEEEWVIERDSLLDTHPX >gi568815591r:55695916_55934511|GENSCAN_predicted_CDS_9|459_bp atggtggaaatagcaagagaactagaattagaagtgcagcctgaagatgtggtggaattg ctgcttaaggatgaggaaaaaaagtggtttcttgagatggaatcagttcctggtgaagat gctttgaataatgtggaaatgacaatagtggatttagaatattccataaacttattagat aaagcagcaggaggatttgaaaagactgactccaattttgaaagaagttctactgtggaa tcactgacgtttcaggatgtggccgtggacttcaccagagaggagtgggaccagctgtac cctgcccaaaagaacctctatcgagacgtgatgctggagaactacaggaatctagttgca ctggggtatcagctttgtaagccagaggtaatcgcgcagttggagctagaggaagaatgg gtgatagaaagagacagcctgctggatactcatccagnn