GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:30:20 Sequence gi568815597r:201384716_201596303 : 211588 bp : 49.43% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 843 838 6 -3.64 1.06 Term - 1090 911 180 1 0 116 49 184 0.751 14.91 1.05 Intr - 2463 1620 844 2 1 83 52 551 0.537 42.38 1.04 Intr - 4588 4445 144 0 0 126 92 207 0.996 24.30 1.03 Intr - 12641 12513 129 1 0 61 68 84 0.796 3.51 1.02 Intr - 13429 13310 120 0 0 54 121 22 0.279 1.71 1.01 Init - 14591 14554 38 2 2 86 99 46 0.446 5.30 1.00 Prom - 17915 17876 40 -4.16 2.13 PlyA - 19802 19797 6 1.05 2.12 Term - 25720 25613 108 1 0 81 44 150 0.996 8.41 2.11 Intr - 26818 26642 177 1 0 80 100 269 0.999 27.42 2.10 Intr - 28406 28317 90 2 0 91 105 179 0.999 20.09 2.09 Intr - 29934 29803 132 0 0 41 71 264 0.994 20.84 2.08 Intr - 30539 30498 42 2 0 128 100 22 0.980 5.84 2.07 Intr - 37011 36958 54 0 0 82 121 46 0.941 6.48 2.06 Intr - 46385 46236 150 2 0 78 86 76 0.697 6.76 2.05 Intr - 57567 57542 26 1 2 73 77 5 0.006 -4.36 2.04 Intr - 63532 63381 152 0 2 61 71 86 0.465 4.01 2.03 Intr - 64166 63982 185 2 2 72 25 98 0.430 0.49 2.02 Intr - 70709 70638 72 2 0 86 85 40 0.671 3.10 2.01 Init - 74727 74674 54 0 0 89 100 42 0.884 6.78 2.00 Prom - 79695 79656 40 -7.86 3.11 PlyA - 79812 79807 6 1.05 3.10 Term - 84102 83688 415 2 1 16 40 869 0.683 69.63 3.09 Intr - 85281 85109 173 0 2 78 47 109 0.090 4.54 3.08 Intr - 95655 95607 49 2 1 78 81 53 0.155 2.28 3.07 Intr - 100074 100002 73 1 1 117 33 64 0.324 2.26 3.06 Intr - 100661 100568 94 2 1 75 98 107 0.998 9.94 3.05 Intr - 104269 104140 130 0 1 106 116 154 0.993 20.70 3.04 Intr - 105629 105461 169 0 1 103 64 364 0.667 34.50 3.03 Intr - 111589 111477 113 0 2 117 73 82 0.615 9.62 3.02 Intr - 122186 121909 278 2 2 -12 64 233 0.424 7.21 3.01 Init - 122516 122355 162 2 0 87 109 124 0.636 14.07 3.00 Prom - 124855 124816 40 -4.46 4.00 Prom + 126446 126485 40 -5.46 4.01 Init + 135401 135544 144 1 0 89 42 229 0.157 18.52 4.02 Term + 148029 148196 168 2 0 117 47 60 0.227 2.68 4.03 PlyA + 148861 148866 6 1.05 5.08 PlyA - 152277 152272 6 1.05 5.07 Term - 153708 153653 56 1 2 112 55 26 0.132 -0.58 5.06 Intr - 154711 154553 159 2 0 89 49 138 0.150 9.96 5.05 Intr - 174868 174636 233 0 2 68 49 108 0.379 2.32 5.04 Intr - 175294 175001 294 0 0 80 -25 206 0.593 4.62 5.03 Intr - 178240 178156 85 2 1 76 57 74 0.808 1.98 5.02 Intr - 182684 182528 157 2 1 14 35 140 0.353 0.88 5.01 Init - 193563 193171 393 0 0 14 44 205 0.687 5.44 5.00 Prom - 193779 193740 40 -3.06 6.04 PlyA - 194656 194651 6 1.05 6.03 Term - 196249 196152 98 2 2 102 48 28 0.275 -1.67 6.02 Intr - 196445 196420 26 1 2 101 98 33 0.146 3.27 6.01 Init - 211141 211092 50 1 2 80 113 18 0.676 3.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 154570 154627 58 0 1 44 117 114 0.824 9.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:201384716_201596303|GENSCAN_predicted_peptide_1|484_aa MAVSRKDWSALSRLSLSIAAHRLGLGGHLPRPSRPHSALKHCGQPMGSREGKSPHRALWA DAAAANSHPDEAPGTEDRSRSAPWVNKPQRAGRKGLLARQRTLEDEEEQERERRRRHRNL SSTTDDEAPRLSQNGDRQASASERLPSVEEAEVPKPLPPASKDEDEDIQSILRTRQERRQ RRQVVEAAQAPIQERLEAEEGRNSLSPVQATQKPLVSKKELEIPPRRRLSREQRGPWALE EESLVGREPEERKKGVPEKSPVLEKSSMPKKTAPEKSLVSDKTSISEKVLASEKTSLSEK IAVSEKRNSSEKKSVLEKTSVSEKSLAPGMALGSGRRLVSEKASIFEKALASEKSPTADA KPAPKRATASEQPLAQEPPASGGSPATTKEQRGRALPGKNLPSLAKQGASDPPTVASRLP PVTLQVKIPSKEEEADMSSPTQRTYSSSLKRSSPRTISFRVRALGTLVCEPLEGKWRKPA SITT >gi568815597r:201384716_201596303|GENSCAN_predicted_CDS_1|1455_bp atggctgtcagcaggaaggactggtccgcgctgtccaggcttagcctctccatagcagct catcgcctgggactggggggccacctccccagaccctcaagacctcacagtgctctgaag cactgtgggcagcccatgggctctagagaggggaaaagccctcatagagccctgtgggca gacgcagctgctgccaacagtcaccctgatgaggcacctggaacagaggaccgctcccgc tccgctccctgggtgaacaagcctcagcgggcaggcaggaaagggctccttgcccggcag aggactctggaggatgaggaggaacaggagcgcgagcgcaggcggcggcaccgcaacctg agctccaccacggacgatgaggctcccaggctcagccagaatggagaccggcaggcctct gcttctgagagactaccgagcgtggaagaagcagaggtgcccaagccactgcccccagcc tccaaagatgaggacgaggacatccagagcatcctcagaacacggcaggagcggaggcag aggcggcaggtggtggaggctgcacaggcccccatccaggagaggctggaggcagaggag gggaggaacagcttgagccctgtgcaggccacacagaaacccctagtctccaagaaggaa ctggaaatcccacctcgccggagactgagtcgggaacagcggggcccctgggccctggag gaggagagcttggtgggcagggagccagaagagaggaagaaaggggttccagaaaagtcc ccagtcttggagaagtcctccatgccaaagaagacggcacctgaaaagagcctggtctcc gataaaacctccatctctgagaaggtgctggcctcagagaagacatctctatcagagaag atagcagtgtcagagaaaagaaacagctcagagaagaagtctgttctagaaaaaaccagt gtctctgagaagtcgctggccccagggatggcactgggctcaggaaggaggctggtgtct gagaaagcttccatctttgagaaggcactggcctcagagaagagcccaactgcagatgct aagccggccccaaagagggccacagcctcagagcagcccctggcgcaggagccgccagcc tctgggggaagcccagccaccaccaaggagcagagaggaagggccctccctgggaagaac ctgccctctttggcaaagcagggggcttcagaccctccgactgtggcctcccgcctccca cccgtcacactccaggtgaaaatccccagcaaggaggaagaggcagatatgtcctcaccc acacagcgaacctacagcagctccctcaaacgctccagccccaggaccatctcctttcgg gtgagagccctgggcaccctggtctgcgagccactggagggcaagtggaggaagcccgca tccattacaacatag >gi568815597r:201384716_201596303|GENSCAN_predicted_peptide_2|413_aa MAFQVEEITCAKVPGPEKVDVSWAPGSSWDTSDEWSREEIRQGINAKDDYPYFTDEKTGS KRRKALPEATQGVRGSASIHSSSGSGDGVCVVNLFERPEGRLKACLPELLKEQSDSIPQD DSSDPSSPLPKLFPGSLGGGGEKDYERRDGTTQAEVGPHYDAQAGHRVEGVVIVQGDLPG PKFNNNKGPVPSCCFYVCPSTLLEGEQFITYVKSAVYGEAQASPAPRGLNKRKPKITASR KLLLKSLMLAKAKECWEQEHEEREAEKVRYLAERIPTLQTRGLSLSALQDLCRELHAKVE VVDEERYDIEAKCLHNTREIKDLKLKVMDLRGKFKRPPLRRVRVSADAMLRALLGSKHKV SMDLRANLKSVKKEDTEKERPVEVGDWRKNVEAMSGMEGRKKMFDAAKSPTSQ >gi568815597r:201384716_201596303|GENSCAN_predicted_CDS_2|1242_bp atggcgttccaggtggaagaaatcacatgtgcaaaggtgccaggaccagagaaggtggat gtgagctgggctcctgggagctcctgggacacttcagatgagtggagccgggaagaaatc cggcagggtataaatgcaaaagatgattacccctattttacagatgagaaaacaggttca aagagaaggaaggctttacctgaggccacccagggagtaagaggaagtgccagcattcac agcagctctgggagtggggatggagtatgtgtggtgaatttattcgagaggcccgaaggc cgtctgaaagcatgtctgcctgagctgcttaaggagcagtcagattccatcccacaggat gactcgtctgaccccagctccccactccctaaactgttcccagggtccctgggtgggggc ggagagaaagattatgagaggcgagatggcacaacacaggcagaggtgggacctcactat gatgcccaggctgggcacagggtggagggggtggtcatagttcagggtgacctcccaggg cctaagtttaacaacaacaaaggcccagtgcccagctgttgtttctatgtgtgtccgagc acattgttggagggcgagcagttcatcacctatgtcaagtctgcagtctacggcgaggca caggccagcccagctccacgaggactgaacaagagaaaacccaagatcactgcctcccgc aaactcttgctgaagagcctgatgctggccaaggccaaggaatgctgggagcaggagcac gaggagcgcgaggctgagaaggtgcgctacctggcagagcgcatccccacgctgcagacc cgtggcctgtccctcagtgccctgcaggacctgtgccgggagctgcacgccaaggtggag gtggtggatgaggagcgatacgacattgaggccaaatgcctccacaacaccagggagatt aaggacctgaagctgaaggtgatggacctccgtgggaagttcaagcgcccgcccctgcgt cgagtccgtgtctcggctgacgccatgctccgggccctgctgggctccaagcacaaggtg tccatggatctgcgggccaacctcaagtctgtgaagaaggaagacacagagaaggagcgg cctgtggaggtgggtgactggaggaagaacgtggaggccatgtctggcatggaaggccgg aagaagatgtttgatgccgccaagtctccgacctcacaatag >gi568815597r:201384716_201596303|GENSCAN_predicted_peptide_3|551_aa MTSALEWCSVQSPLVGVARDTPSAERNKTARRPGAPSRHFRERLPPLRRRASCQALGPSP SRAFLGFLDTVDFRVCVTQGDPAPRNRLQVGQNNRRFCAPLTLIVWQRDEGIRATPSLTL LLGRKCPRAGWGKHLGLTLDICVPERRMPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKS CFLCMVCKKNLDSTTVAVHGEEIYCKSCYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHE EAPGHRPTTNPNASKFAQKIGGSERCPRCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGL ESTTLADKDGEIYCKGCYAKNFGPKGFGFGQGAGALVHSEDPLAFQRFLMEAPGSPLCMR EKSILGEGRGRTESAERKEQGWSCAYTWDWLCHRSSVGACKCAVERTADSGIPSLELSAN LGAPMTAAATATVLKEGVLEKRSGGLLQLWKRKRCVLTERGLQLFEAKGTGGRPKELSFA RIKAVECVESTGRHIYFTLVTEGGGEIDFRCPLEDPGWNAQITLGLVKFKNQQAIQTVRA RQSLGTGTLVS >gi568815597r:201384716_201596303|GENSCAN_predicted_CDS_3|1656_bp atgacatcagccctggagtggtgcagtgtgcaaagcccactggttggcgtggcccgggac acgccttccgcggagcggaacaaaacggcgcgcaggccgggcgcacccagccgccacttc cgagagcgcctgccgcccctgcgccgccgagccagctgccaggcactaggtccgtcccca tcccgggccttcctgggcttcctggacacagttgacttccgcgtgtgcgtgacccagggc gacccggctccccgcaaccgcctgcaagtcgggcagaacaatcggcgattctgcgcgcca ctgacgctcattgtatggcagagagatgagggcatccgtgccaccccctccctcaccctc ctgctggggagaaaatgcccgcgggcggggtggggcaaacatctgggcctgaccttggac atctgtgtccctgagcgcagaatgccgaactggggaggaggcaagaaatgtggggtgtgt cagaagacggtttactttgccgaagaggttcagtgcgaaggcaacagcttccataaatcc tgcttcctgtgcatggtctgcaagaagaatctggacagtaccactgtggccgtgcatggt gaggagatttactgcaagtcctgctacggcaagaagtatgggcccaaaggctatggctac gggcagggcgcaggcaccctcagcactgacaagggggagtcgctgggtatcaagcacgag gaagcccctggccacaggcccaccaccaaccccaatgcatccaaatttgcccagaagatt ggtggctccgagcgctgcccccgatgcagccaggcagtctatgctgcggagaaggtgatt ggtgctgggaagtcctggcataaggcctgctttcgatgtgccaagtgtggcaaaggcctt gagtcaaccaccctggcagacaaggatggcgagatttactgcaaaggatgttatgctaaa aacttcgggcccaagggctttggttttgggcaaggagctggggccttggtccactctgaa gatcccctggcctttcagcgcttcctgatggaggcaccagggagcccactctgtatgcgg gagaagtccattcttggggaaggccgagggaggacagaatctgcggagcggaaggagcaa ggttggagctgcgcttacacctgggactggctgtgccatcggtccagcgtcggcgcctgc aagtgtgcggtggagaggactgcagacagtggcatccccagcctggagctttccgcgaac ctcggggcgcccatgacggcggcggcgacggctaccgtgctcaaggagggcgtgctggag aagcgcagcggcgggctgctgcagctgtggaagcggaagcgctgcgtcctcaccgaacgc gggctgcagctcttcgaggccaagggcacgggcggccggcccaaggagctcagcttcgcc cgcatcaaggccgtggagtgcgtggagagcaccgggcgccacatctacttcacgctggtg accgaagggggcggcgagatcgacttccgctgccccctggaagatcccggctggaacgcc cagatcaccctaggcctggtcaagttcaagaaccagcaggccatccagacagtgcgggcc cggcagagcctcgggaccgggaccctcgtgtcctaa >gi568815597r:201384716_201596303|GENSCAN_predicted_peptide_4|103_aa MVAKKDVHMPKHPELADKNAPNLRVMKAMQSLKSRGYVKEVCLETFLLGITCLPPPHPFL TTAGFTSTAFTSSRIWQFILFQLLSSSPTLTKVDIAASKSERS >gi568815597r:201384716_201596303|GENSCAN_predicted_CDS_4|312_bp atggtggccaagaaggatgtccacatgcctaagcacccggagctggcagacaagaatgcg cccaaccttcgtgtcatgaaggccatgcagtctctcaagtcccgaggctacgtgaaggaa gtttgcctggagacatttctactgggaatcacctgcctacctccacctcatccatttctc accacagctggattcacctctacagcctttacctccagtcgtatttggcaattcatcctt ttccagctcctatcatcctcccctaccctcaccaaagtggacatcgcagcctctaaatct gaacggagctga >gi568815597r:201384716_201596303|GENSCAN_predicted_peptide_5|458_aa MGSTVELRRTIRELEDRTIEINQYEQQKENRPKQKAEQILRGLWDYNKRSNFRVIEVLVR EKKEDGAEKVLQEKIVENFAKLARDINPQIQESEQMPNKIDLKKSQIKHLRVKDKETILK AGRETGNLNLQWPSSYSAGSNPGNLTGQGPSGPANALVPVDSPIRKNNRGNPLNQQKPVP HEEGCYHPYFSEKETDAQTDPVIQQHPGSGLSAGLPQPEPLTPTFFQSEKEGTRACGTCF LSPQDAELRAACIGEEGVGIGLGSAGQLHPLVAATGRCKAMQTKAEPDSSSFPESLYTAF IKHLLADGTGFSDQCFNGQFCCPLPTCMAPRPSSRTKLLTSIKTLLPVVTLRTLSLPQPT RDLKAATVGRALFLVVYVRVSSVRSSKGPASQASQNGKRQTLSAATARPPRAGHSPERLP ARAPGPRRSRAESRIPKGSAGTWEWEEGEEEERGNDRT >gi568815597r:201384716_201596303|GENSCAN_predicted_CDS_5|1377_bp atgggatcaacagtagaattaaggagaacaatcagagaactggaagatagaacaatagaa attaaccaatatgaacaacagaaagaaaatagaccaaaacaaaaagcagagcagattctt aggggcctgtgggactataacaaaagatctaactttcgtgttattgaagttctggtaaga gaaaagaaagaggatggtgctgaaaaagtactccaagaaaaaatagttgaaaactttgca aagttggcaagagacataaacccacagattcaagaatctgagcaaatgccaaacaagata gacctaaagaaatcacaaattaagcatctgcgagttaaagacaaagaaacaatcttgaaa gcaggaagagaaacaggaaaccttaatctacagtggcccagctcctacagcgcgggctcc aaccccggtaacttgacaggacaaggcccctctggcccagccaatgccctcgtacctgtg gacagccccatcagaaaaaacaacagaggcaaccccctgaaccaacagaagcctgttccc catgaagagggctgttatcatccctacttttcagaaaaggaaactgatgctcagacagat cccgtgatccaacagcatccaggatctggactgagtgcaggactgccccagcctgagcct ctcaccccaactttcttccagtctgagaaggaggggacacgtgcctgtgggacctgcttc cttagcccacaagatgcagagctgcgagctgcctgcataggagaggagggcgtggggatc gggctgggttctgctgggcaactgcaccctctggtggcggcaacaggaagatgcaaggcc atgcagacaaaggccgaaccagattcttcctccttcccagaatccctctacaccgccttt attaagcacctattggcggatggcacgggcttcagcgatcagtgcttcaatggacaattt tgctgccctctgcccacatgcatggcacctcgaccatcatctcggacaaagcttctcacc tccatcaaaaccctgctcccagtcgtaactctaaggaccctcagcctcccccaacctaca agagacttaaaagcagcaactgttggaagagcactgttcctggtggtttatgtgcgagtc tccagtgtcagatcctccaagggacctgccagccaagcctcgcaaaacgggaagcgccag accctcagcgccgcgaccgcgcgaccgccccgcgccggccactcaccggagcgcctgccc gcccgggctccgggtccgcgccgcagccgcgcagagtcgcgcatccccaagggcagcgcg gggacttgggagtgggaggagggggaagaagaggagagaggaaatgacaggacctga >gi568815597r:201384716_201596303|GENSCAN_predicted_peptide_6|57_aa MAWWKRQKKGTLVAGDTMTAQKAFKAAVLPGGMPFTIVSIQKSIYLSKFSSSITYLL >gi568815597r:201384716_201596303|GENSCAN_predicted_CDS_6|174_bp atggcttggtggaagaggcagaagaagggcaccttggttgcaggggacacgatgactgct cagaaagcctttaaggctgctgtgttgcctggtgggatgccctttaccatcgtgtccatc cagaaatccatttatctttcaaagtttagctcaagcataacctacctcctctaa