GENSCAN 1.0 Date run: 3-Nov-116 Time: 02:46:39 Sequence gi568815590f:22904435_23117466 : 213032 bp : 46.74% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 PlyA - 246 241 6 1.05 1.11 Term - 686 577 110 0 2 72 28 72 0.083 -1.73 1.10 Intr - 1446 1286 161 2 2 82 71 46 0.077 1.93 1.09 Intr - 4802 4642 161 2 2 54 41 96 0.269 0.29 1.08 Intr - 15876 15750 127 2 1 102 86 112 0.833 13.08 1.07 Intr - 23286 23150 137 0 2 92 100 113 0.098 12.27 1.06 Intr - 23941 23905 37 0 1 75 10 31 0.030 -7.74 1.05 Intr - 25148 24875 274 0 1 68 50 170 0.063 7.80 1.04 Intr - 47844 47775 70 0 1 107 75 41 0.333 3.45 1.03 Intr - 53191 53111 81 1 0 55 75 55 0.217 0.73 1.02 Intr - 57197 57144 54 2 0 78 75 71 0.267 3.98 1.01 Init - 68666 68409 258 2 0 97 42 123 0.021 5.74 1.00 Prom - 68751 68712 40 -5.06 2.00 Prom + 74515 74554 40 -4.16 2.01 Init + 82364 82487 124 1 1 81 99 31 0.703 2.83 2.02 Intr + 83058 83129 72 1 0 134 111 36 0.830 9.88 2.03 Intr + 93266 93336 71 0 2 108 90 45 0.002 5.60 2.04 Intr + 93429 93454 26 2 2 50 98 29 0.001 -3.08 2.05 Intr + 99991 100192 202 1 1 148 94 244 0.985 30.29 2.06 Intr + 100938 101041 104 2 2 117 75 142 0.999 14.77 2.07 Intr + 101526 101711 186 0 0 120 63 188 0.999 18.30 2.08 Intr + 102294 103312 1019 0 2 26 80 1577 0.999 141.00 2.09 Intr + 103559 103677 119 0 2 87 94 182 0.999 18.88 2.10 Intr + 106104 106254 151 2 1 129 93 233 0.999 27.64 2.11 Intr + 110256 110344 89 1 2 96 69 165 0.999 15.09 2.12 Intr + 111204 111309 106 2 1 94 99 204 0.992 21.89 2.13 Term + 112818 113035 218 1 2 88 50 241 0.993 17.61 2.14 PlyA + 113911 113916 6 -0.45 3.16 PlyA - 115725 115720 6 -1.75 3.15 Term - 118220 118075 146 0 2 76 32 56 0.598 -3.13 3.14 Intr - 118550 118263 288 0 0 76 90 286 0.688 24.82 3.13 Intr - 119826 119754 73 0 1 91 89 21 0.842 1.48 3.12 Intr - 122854 122699 156 1 0 100 105 134 0.989 16.51 3.11 Intr - 123319 123288 32 2 2 101 70 45 0.989 1.85 3.10 Intr - 124007 123897 111 0 0 118 96 56 0.972 9.75 3.09 Intr - 124168 124095 74 0 2 84 87 45 0.950 3.05 3.08 Intr - 125287 125176 112 2 1 53 76 148 0.990 9.54 3.07 Intr - 126438 126325 114 1 0 115 92 -10 0.662 2.52 3.06 Intr - 133419 133314 106 0 1 85 79 85 0.953 7.09 3.05 Intr - 145613 145405 209 0 2 80 68 82 0.074 4.20 3.04 Intr - 164508 164317 192 1 0 53 97 92 0.336 6.06 3.03 Intr - 166434 166300 135 1 0 74 66 60 0.456 2.94 3.02 Intr - 167991 167685 307 0 1 82 97 158 0.933 12.12 3.01 Init - 168101 168081 21 2 0 82 100 5 0.903 0.81 3.00 Prom - 173912 173873 40 -3.86 4.00 Prom + 177191 177230 40 -7.46 4.01 Init + 177731 177841 111 1 0 59 42 89 0.224 -0.35 4.02 Intr + 180002 180111 110 1 2 60 109 92 0.356 7.58 4.03 Intr + 180280 180382 103 1 1 50 72 69 0.758 1.68 4.04 Term + 186854 186922 69 1 0 138 47 55 0.933 4.34 4.05 PlyA + 187664 187669 6 1.05 5.02 PlyA - 189616 189611 6 1.05 5.01 Sngl - 191530 191381 150 1 0 98 42 148 0.855 5.37 5.00 Prom - 192337 192298 40 -6.16 6.00 Prom + 198451 198490 40 -3.46 6.01 Init + 198568 198747 180 0 0 62 81 190 0.610 14.91 6.02 Intr + 207286 207391 106 0 1 134 55 55 0.980 6.59 6.03 Intr + 210223 210336 114 2 0 123 92 23 0.731 6.62 6.04 Intr + 211074 211182 109 1 1 37 94 103 0.854 5.14 6.05 Term + 212207 212597 391 2 1 115 48 322 0.965 25.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 95388 95245 144 0 0 31 86 234 0.995 15.62 S.002 Init + 96710 96756 47 1 2 62 64 49 0.819 0.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:22904435_23117466|GENSCAN_predicted_peptide_1|489_aa MTFLNRRLLRTGSKVEKHQPGQGKGVKAGEVPGKGGYACAKAQGPGGEHAFPGNCTHRRM AARQIQKGRRRDEAEKEKEGHIWKGLLFVNIVSVMSGINDVPYEFICISLLGHKNKQPNP QFGPGTHMLPQVVQLVPPQAELGCTMAIVSRYSPDGVIIVPISKDWHEDSEIKVPGTWSK EEPPLEVGSTAATCTVGSPFFQAAITVLKELTPRKVETSLVDAIWKMSFLVLGSPSGSIS LAVAVLWPFGAGPSELASVPMGWTMRLVTAALLLGLMMVVTGDEDENSPCAHEALLDEDT LFCQGLEVFYPELGNIGCKVVPDCNNYRQKITSWMEPIVKFPGAVDASLAGRRRSPSISR GCNGKSRHQGEIGDRHPTPGTCELAEWLGPPMQEGIAEGRWETGPREVYLFQYVIVEQLF PPPRSSFPFTSNIPGPRGEEPARLRPVRWEQQAEYTPQAKVQIASTFLDESDDSIMPIKV ISMSIVFEQ >gi568815590f:22904435_23117466|GENSCAN_predicted_CDS_1|1470_bp atgactttcctgaacaggagacttctgagaacaggcagcaaggttgagaagcaccagcca ggtcaggggaagggagtgaaggctggagaagttcccggcaagggaggctatgcctgtgca aaggcgcaggggcctggtggagagcatgccttcccgggaaactgcacccatcgcaggatg gctgcacggcagattcagaaaggcaggaggagggatgaagccgagaaggaaaaagaaggc catatctggaagggccttctctttgtgaatatcgtttcagtgatgtctggcatcaatgat gtaccttacgagttcatctgcatctcgttattgggccacaagaacaagcagcccaaccct cagtttggtccaggaacacacatgctgccccaggtggttcagctggtgccaccacaagca gaactaggttgtacaatggccattgtttcaagatactcgccagatggggtgataatagtg cctatttccaaggattggcatgaggattcggagataaaagtgcctggcacgtggagtaag gaagaaccgccactggaagtgggctcaacagcagcaacctgcactgtgggctctcccttc ttccaggccgccatcactgtgctaaaggagttgacacccagaaaagttgagacttctcta gtggatgccatttggaagatgagtttcttggtcttgggcagcccttctggcagcatctcc ttggctgtggctgtcctctggcccttcggagctggcccttcggagctggcctcggtgccc atgggttggacaatgaggctggtcacagcagcactgttactgggtctcatgatggtggtc actggagacgaggatgagaacagcccgtgtgcccatgaggccctcttggacgaggacacc ctcttttgccagggccttgaagttttctacccagagttggggaacattggctgcaaggtt gttcctgattgtaacaactacagacagaagatcacctcctggatggagccgatagtcaag ttcccgggggccgtggacgcctcgctggcgggtaggcggagatcgccctctatctcccgg ggctgcaatgggaagagccgacaccagggggagataggtgaccgtcatcctaccccagga acctgtgagctggctgagtggctggggccaccaatgcaggaaggcattgcagagggcaga tgggaaacagggccacgtgaggtgtacctttttcagtatgttatcgtggaacagctattt cctcctcctcgttcatcttttccattcaccagcaacatcccagggccccgtggagaggaa cctgccaggctgagacctgtgaggtgggagcagcaggcagagtatacaccgcaggcaaaa gtacaaattgcttcaacttttctagatgaatctgatgattcaataatgcctatcaaagtc attagcatgtccattgtgtttgagcagtaa >gi568815590f:22904435_23117466|GENSCAN_predicted_peptide_2|828_aa MASLDFTLDQSCFTCLPAGQPRMSGHCEPIAFLLFVFYLCADLVENTTSGQGIVTGQQSP RGEVAGAMVLPGQVKGDVNWHLAFSWSLEGLFGTQKMGSRLMDSDMDYERPNVETIKCVV VGDNAVGKTRLICARACNATLTQYQLLATHVPTVWAIDQYRVCQEVLERSRDVVDDVSVS LRLWDTFGDHHKDRRFAYGRSDVVVLCFSIANPNSLHHVKTMWYPEIKHFCPRAPVILVG CQLDLRYADLEAVNRARRPLARPIKPNEILPPEKGREVAKELGIPYYETSVVAQFGIKDV FDNAIRAALISRRHLQFWKSHLRNVQRPLLQAPFLPPKPPPPIIVVPDPPSSSEECPAHL LEDPLCADVILVLQERVRIFAHKIYLSTSSSKFYDLFLMDLSEGELGGPSEPGGTHPEDH QGHSDQHHHHHHHHHGRDFLLRAASFDVCESVDEAGGSGPAGLRASTSDGILRGNGTGYL PGRGRVLSSWSRAFVSIQEEMAEDPLTYKSRLMVVVKMDSSIQPGPFRAVLKYLYTGELD ENERDLMHIAHIAELLEVFDLRMMVANILNNEAFMNQEITKAFHVRRTNRVKECLAKGTF SDVTFILDDGTISAHKPLLISSCDWMAAMFGGPFVESSTREVVFPYTSKSCMRAVLEYLY TGMFTSSPDLDDMKLIILANRLCLPHLVALTEQYTVTGLMEATQMMVDIDGDVLVFLELA QFHCAYQLADWCLHHICTNYNNVCRKFPRDMKAMSPENQEYFEKHRWPPVWYLKEEDHYQ RARKEREKEDYLHLKRQPKRRWLFWNSPSSPSSSAASSSSPSSSSAVV >gi568815590f:22904435_23117466|GENSCAN_predicted_CDS_2|2487_bp atggccagtctagattttacactagatcagtcctgcttcacatgtctcccggctggccag ccccggatgtccggccactgtgagccgatagcttttctgctgtttgttttctacctgtgt gcagacctggtcgagaacactacttcaggacagggaatagtgacaggccagcagagcccc agaggcgaggtggcaggggctatggtcctgccaggacaggtgaaaggtgatgtcaactgg catctggcattcagctggtctctggaggggctctttggcacacagaagatggggtcccgt ttaatggattctgacatggattatgaaaggccaaacgtagagaccatcaagtgcgttgtg gtgggggacaacgccgtgggtaagaccaggctcatctgtgcccgcgcttgcaatgccacc ctcacccagtaccagctgctggccacgcatgtgcccacagtatgggccatcgaccaatat cgtgtgtgccaggaggtgctggaacgctcccgagacgtggtagatgatgtcagcgtctct ctgcgcctctgggacacctttggagaccaccacaaagaccgtcgctttgcttatgggaga tctgatgtggtggttctgtgcttctccattgccaaccccaattccctccaccatgtcaag accatgtggtacccagaaatcaagcacttctgcccccgagcacctgtcatcttggtgggc tgccagttggacctgcgctacgctgacctggaggctgtcaacagggctaggcgacccttg gctaggcccatcaaacctaatgaaatcctgcccccagagaagggtcgggaggtggccaag gagctgggcatcccctactatgagaccagcgtggtggcccagttcggcatcaaggacgtc tttgacaacgccatccgagctgcactcatctcccgccgccacctgcagttctggaagtcc cacctccgcaatgtgcagcggcctctgctgcaggcacccttcctaccccccaagccaccg cccccgatcatcgtggtgcccgaccctccctccagcagcgaggagtgccccgcccacctc ctggaggacccgctctgcgcggacgtcatcctggtgctgcaggagcgggtgcgcatcttt gcccacaagatctacctctccacctcttcctccaagttctatgacctgttcctcatggac ctgagtgagggggagctggggggcccctcggagccagggggcacccacccagaggaccac cagggccactctgatcaacaccaccaccatcaccaccaccaccatgggcgagacttcctg ctccgagcagccagctttgacgtgtgcgagagcgtggatgaggctgggggctccggtcct gctggcctccgtgcttccaccagcgacgggatcttacggggcaacggaacagggtaccta ccgggcaggggtcgtgtgctgtcttcctggagccgagcttttgtgagcatccaggaagag atggcagaagatcctctcacctacaaatcccggctgatggtggtggtgaagatggacagt tccatccagccggggcccttccgggctgtcctcaagtacctgtacacgggggagctagat gagaacgagcgtgacctcatgcacattgcccacattgctgagctgctcgaggtctttgat ctgcgcatgatggtggccaacattctcaacaatgaggccttcatgaaccaggagatcacc aaggccttccacgtccgccggaccaaccgggttaaggagtgcttggcaaaaggcaccttc tcagatgtgaccttcatcctggatgatggcaccatcagcgcccacaagcccctgttgatt tccagctgtgactggatggctgccatgtttggggggccatttgtggagagctccacccgg gaggtggtgtttccctacacaagcaagagctgcatgcgggccgtgctggaatacctctac accggcatgttcacctccagccccgacctggatgacatgaagctcatcattctagccaac cgcctctgcctgccacacctggttgccctcacagagcagtacacagtgaccgggctgatg gaagcgacccagatgatggtggacatcgatggggacgtccttgtgttcctggaactggct cagttccactgtgcgtaccagctggccgactggtgtctccaccacatctgcaccaactac aacaacgtgtgccgcaagttcccccgagacatgaaggccatgtccccagaaaaccaggag tatttcgagaagcatcggtggccacctgtgtggtacctgaaggaggaagatcattaccag cgggcacggaaggagcgtgagaaggaggactacctccacctcaagcggcagcccaaacgg cgttggctcttctggaacagtccatcctccccgtcttcctcggcagcctcctcctcatcc ccatcttcctcctcggctgtggtctga >gi568815590f:22904435_23117466|GENSCAN_predicted_peptide_3|691_aa MAAIIQKVVRKIHYMKMCSLWLGAGHQLPNSTGSGHGTVSTVPIPNLEARASLPCHFWMQ DREPELGLWSLSRGCEQPSVATRIETFPAHSSNWGTGWVPNRKTIGRVAAGASPTSTSPC SRAPSPIDHPRAEECGRTAREHGAGLAGSSTCNPGLKPTGLRDYKSVPYRHGTTGTERPG RFGGPEKARPRTQGGAGSQAWAPGPQDPCARCRRGPAVVEKGTCRFAHKQPSGGSPGETS PDSCWDPPEDCLIASYQPLNAALSLSSGWRLFCSQTKLRVGLPNNEMQIPGHMNPSNGRS NRMYWRIIQSIDSLLENLVAALQGHHISEDGRDCISCKYGQDYSTHWNDLLFCLRCTRCD SGEVELSPCTTTRNTVCQCEEGTFREEDSPEMCRKCRTGCPRGMVKVGDCTPWSDIECVH KESGIIIGVTVAAVVLIVAVFVCKSLLWKKVLPYLKGICSGGGGDPERVDRSSQRPGAED NVLNEIVSILQPTQVPEQEMEVQEPAEPTGVNMLSPGESEHLLEPAEAERSQRRRLLVPA NEGDPTETLRQCFDDFADLVPFDSWEPLMRKLGLMDNEIKVAKAEAAGHRDTLYTMLIKW VNKTGRDASVHTLLDALETLGERLAKQKIEDHLLSSGKFMYLEGSQTFPGLPFFWKKPNW TPVSRKVPQLSHDRYWKKLSHPTSPSGWNIL >gi568815590f:22904435_23117466|GENSCAN_predicted_CDS_3|2076_bp atggctgctattatacagaaggttgttaggaaaatccactacatgaagatgtgcagtctc tggctgggagcaggacaccagctgcccaacagcacaggctcagggcatggcactgtctcc actgtccccatccccaacctggaagctagagcctcgctgccctgccacttctggatgcag gacagagaaccggaactgggactgtggtctctcagtagaggctgtgaacagccatctgtg gccacacggattgagacctttcctgctcactcttcaaactggggcaccggctgggtgccc aacagaaagacaattggaagagtggcagccggagcctccccgacgagcacctccccctgc tccagggcgcccagtcccatcgaccacccaagggctgaggagtgcgggcgcacggcgcgg gagcacggcgcgggactggcaggcagctccacctgcaaccccgggctgaaacccacgggc ctgagagactataagagcgttccctaccgccatggaacaacggggacagaacgccccggc cgcttcgggggcccggaaaaggcacggcccaggacccagggaggcgcggggagccaggcc tgggccccgggtccccaagacccttgtgctcgttgtcgccgcggtcctgctgttgtagag aaagggacgtgtagatttgcacataaacagccctctggagggtcccctggggaaacctca cctgacagctgctgggatcctcctgaagactgtctcatcgcaagctaccagccgctgaac gcggcactgtccttatcttctggctggcgacttttttgttcccagaccaaactgagggtg gggctgcccaataatgagatgcagattcctggacacatgaatcctagcaatggaagaagc aaccgcatgtattggaggataattcagagcatagacagtctcctggagaaccttgttgca gcccttcaaggacaccatatctcagaagacggtagagattgcatctcctgcaaatatgga caggactatagcactcactggaatgacctccttttctgcttgcgctgcaccaggtgtgat tcaggtgaagtggagctaagtccctgcaccacgaccagaaacacagtgtgtcagtgcgaa gaaggcaccttccgggaagaagattctcctgagatgtgccggaagtgccgcacagggtgt cccagagggatggtcaaggtcggtgattgtacaccctggagtgacatcgaatgtgtccac aaagaatcaggcatcatcataggagtcacagttgcagccgtagtcttgattgtggctgtg tttgtttgcaagtctttactgtggaagaaagtccttccttacctgaaaggcatctgctca ggtggtggtggggaccctgagcgtgtggacagaagctcacaacgacctggggctgaggac aatgtcctcaatgagatcgtgagtatcttgcagcccacccaggtccctgagcaggaaatg gaagtccaggagccagcagagccaacaggtgtcaacatgttgtcccccggggagtcagag catctgctggaaccggcagaagctgaaaggtctcagaggaggaggctgctggttccagca aatgaaggtgatcccactgagactctgagacagtgcttcgatgactttgcagacttggtg ccctttgactcctgggagccgctcatgaggaagttgggcctcatggacaatgagataaag gtggctaaagctgaggcagcgggccacagggacaccttgtacacgatgctgataaagtgg gtcaacaaaaccgggcgagatgcctctgtccacaccctgctggatgccttggagacgctg ggagagagacttgccaagcagaagattgaggaccacttgttgagctctggaaagttcatg tatctagaaggaagtcagaccttccctggtttaccttttttctggaaaaagcccaactgg actccagtcagtaggaaagtgccacaattgtcacatgaccggtactggaagaaactctcc catccaacatcacccagtggatggaacatcctgtaa >gi568815590f:22904435_23117466|GENSCAN_predicted_peptide_4|130_aa MGGEILPLWEALTTFQFSLLRPETGPGFDDWQLLKGPRETRLSSHRLLSPDDQVPGVIKE QPLSCVEKAMETARRCQKTRSPDNLSSERAENPPGEYGGTLQREKAKWVSCHSVTNLILL STAINSTFNT >gi568815590f:22904435_23117466|GENSCAN_predicted_CDS_4|393_bp atgggaggagaaatcctccctctgtgggaggcactgactacattccagttctccctccta cgcccagagactggaccagggtttgatgactggcaacttctcaaggggccgcgtgagact cgcttgtcctcccaccgcctgctctctcctgatgaccaggttccaggagttatcaaagaa cagcctctgagctgcgtggagaaagccatggagacagccaggcgctgccagaagacaagg tcaccagataatctgtcctcagagagggctgagaacccacccggggagtatggaggaacc ctacagagggagaaagcaaaatgggtctcttgccactcagttactaacctgattcttctc tccacggccatcaactcgacttttaatacatga >gi568815590f:22904435_23117466|GENSCAN_predicted_peptide_5|49_aa MAEGDGEAGTFFTRRQEKEKRVKEELPNPYKTIRSRENYHENSMRKPPP >gi568815590f:22904435_23117466|GENSCAN_predicted_CDS_5|150_bp atggcggaaggagatggggaagctggcaccttcttcacaaggcggcaggagaaagagaag agagtgaaggaggaactgccaaacccttataaaaccatcagaagtcgcgagaactatcat gagaacagcatgaggaaaccacccccatga >gi568815590f:22904435_23117466|GENSCAN_predicted_peptide_6|299_aa MQGVKERFLPLGNSGDRAPRPPDGRGRVRPRTQDGVGNHTMARIPKTLKFVVVIVAVLLP VLAYSATTARQEEVPQQTVAPQQQRHSFKGEECPAGSHRSEHTGACNPCTEGVDYTNASN NEPSCFPCTVCKSDQKHKSSCTMTRDTVCQCKEGTFRNENSPEMCRKCSRCPSGEVQVSN CTSWDDIQCVEEFGANATVETPAAEETMNTSPGTPAPAAEETMNTSPGTPAPAAEETMTT SPGTPAPAAEETMTTSPGTPAPAAEETMITSPGTPASSHYLSCTIVGIIVLIVLLIVFV >gi568815590f:22904435_23117466|GENSCAN_predicted_CDS_6|900_bp atgcaaggggtgaaggagcgcttcctaccgttagggaactctggggacagagcgccccgg ccgcctgatggccgaggcagggtgcgacccaggacccaggacggcgtcgggaaccatacc atggcccggatccccaagaccctaaagttcgtcgtcgtcatcgtcgcggtcctgctgcca gtcctagcttactctgccaccactgcccggcaggaggaagttccccagcagacagtggcc ccacagcaacagaggcacagcttcaagggggaggagtgtccagcaggatctcatagatca gaacatactggagcctgtaacccgtgcacagagggtgtggattacaccaacgcttccaac aatgaaccttcttgcttcccatgtacagtttgtaaatcagatcaaaaacataaaagttcc tgcaccatgaccagagacacagtgtgtcagtgtaaagaaggcaccttccggaatgaaaac tccccagagatgtgccggaagtgtagcaggtgccctagtggggaagtccaagtcagtaat tgtacgtcctgggatgatatccagtgtgttgaagaatttggtgccaatgccactgtggaa accccagctgctgaagagacaatgaacaccagcccggggactcctgccccagctgctgaa gagacaatgaacaccagcccggggactcctgccccagctgctgaagagacaatgaccacc agcccggggactcctgccccagctgctgaagagacaatgaccaccagcccggggactcct gccccagctgctgaagagacaatgatcaccagcccggggactcctgcctcttctcattac ctctcatgcaccatcgtagggatcatagttctaattgtgcttctgattgtgtttgtttga