GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:54:14 Sequence gi568815597r:201368406_201568786 : 200381 bp : 50.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4570 4644 75 0 0 46 44 136 0.203 4.10 1.02 Intr + 9199 9328 130 0 1 124 93 67 0.863 11.07 1.03 Term + 9877 9959 83 2 2 86 54 53 0.895 -0.44 1.04 PlyA + 10981 10986 6 -3.64 2.12 PlyA - 11010 11005 6 1.05 2.11 Term - 13488 13483 6 0 0 122 48 0 0.514 -2.73 2.10 Intr - 14334 14248 87 0 0 74 68 46 0.700 1.37 2.09 Intr - 14839 14669 171 1 0 72 100 105 0.631 10.24 2.08 Intr - 14984 14912 73 1 1 91 63 94 0.899 6.61 2.07 Intr - 16430 16387 44 2 2 108 75 34 0.879 1.14 2.06 Intr - 17400 17296 105 0 0 116 80 108 0.821 13.21 2.05 Intr - 18773 17930 844 1 1 83 52 551 0.537 42.38 2.04 Intr - 20898 20755 144 2 0 126 92 207 0.996 24.30 2.03 Intr - 28951 28823 129 0 0 61 68 84 0.796 3.51 2.02 Intr - 29739 29620 120 2 0 54 121 22 0.279 1.71 2.01 Init - 30901 30864 38 1 2 86 99 46 0.446 5.30 2.00 Prom - 34225 34186 40 -4.16 3.13 PlyA - 36112 36107 6 1.05 3.12 Term - 42030 41923 108 0 0 81 44 150 0.996 8.41 3.11 Intr - 43128 42952 177 0 0 80 100 269 0.999 27.42 3.10 Intr - 44716 44627 90 1 0 91 105 179 0.999 20.09 3.09 Intr - 46244 46113 132 2 0 41 71 264 0.994 20.84 3.08 Intr - 46849 46808 42 1 0 128 100 22 0.980 5.84 3.07 Intr - 53321 53268 54 2 0 82 121 46 0.941 6.48 3.06 Intr - 62695 62546 150 1 0 78 86 76 0.697 6.76 3.05 Intr - 73877 73852 26 0 2 73 77 5 0.006 -4.36 3.04 Intr - 79842 79691 152 2 2 61 71 86 0.465 4.01 3.03 Intr - 80476 80292 185 1 2 72 25 98 0.430 0.49 3.02 Intr - 87019 86948 72 1 0 86 85 40 0.671 3.10 3.01 Init - 91037 90984 54 2 0 89 100 42 0.884 6.78 3.00 Prom - 96005 95966 40 -7.86 4.11 PlyA - 96122 96117 6 1.05 4.10 Term - 100412 99998 415 1 1 16 40 869 0.683 69.63 4.09 Intr - 101591 101419 173 2 2 78 47 109 0.090 4.54 4.08 Intr - 111965 111917 49 1 1 78 81 53 0.155 2.28 4.07 Intr - 116384 116312 73 0 1 117 33 64 0.324 2.26 4.06 Intr - 116971 116878 94 1 1 75 98 107 0.998 9.94 4.05 Intr - 120579 120450 130 2 1 106 116 154 0.993 20.70 4.04 Intr - 121939 121771 169 2 1 103 64 364 0.667 34.50 4.03 Intr - 127899 127787 113 2 2 117 73 82 0.615 9.62 4.02 Intr - 138496 138219 278 1 2 -12 64 233 0.424 7.21 4.01 Init - 138826 138665 162 1 0 87 109 124 0.636 14.07 4.00 Prom - 141165 141126 40 -4.46 5.00 Prom + 142756 142795 40 -5.46 5.01 Init + 151711 151854 144 0 0 89 42 229 0.157 18.52 5.02 Term + 164339 164506 168 1 0 117 47 60 0.227 2.68 5.03 PlyA + 165171 165176 6 1.05 6.08 PlyA - 168587 168582 6 1.05 6.07 Term - 170018 169963 56 0 2 112 55 26 0.132 -0.58 6.06 Intr - 171021 170863 159 1 0 89 49 138 0.150 9.96 6.05 Intr - 191178 190946 233 2 2 68 49 108 0.385 2.32 6.04 Intr - 191604 191311 294 2 0 80 -25 206 0.604 4.62 6.03 Intr - 194550 194466 85 1 1 76 57 74 0.830 1.98 6.02 Intr - 198994 198838 157 1 1 14 35 140 0.390 0.88 6.01 Init - 199338 199240 99 0 0 73 74 60 0.412 3.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 170880 170937 58 2 1 44 117 114 0.823 9.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:201368406_201568786|GENSCAN_predicted_peptide_1|95_aa MTLELNLCASRSLSACALPGIYNPGSPARALPQNSSCRQILEASAQSQRGLGEAEDGEGF KQACGLGPGNGLKLLQEARYKVELPVKNLEQALVV >gi568815597r:201368406_201568786|GENSCAN_predicted_CDS_1|288_bp atgaccctggagctgaacctctgtgcttcccgaagtctctcggcctgtgctctgcctggg atctacaacccagggagccctgcccgagccttacctcagaacagcagctgccgacagatc ctggaggcgtctgctcagtctcagcggggactgggtgaggcagaggatggagagggcttt aagcaggcatgtgggctggggcctgggaatggactgaaactgctacaggaggccagatac aaggtagaacttccagtcaagaaccttgaacaggcgctcgttgtgtga >gi568815597r:201368406_201568786|GENSCAN_predicted_peptide_2|586_aa MAVSRKDWSALSRLSLSIAAHRLGLGGHLPRPSRPHSALKHCGQPMGSREGKSPHRALWA DAAAANSHPDEAPGTEDRSRSAPWVNKPQRAGRKGLLARQRTLEDEEEQERERRRRHRNL SSTTDDEAPRLSQNGDRQASASERLPSVEEAEVPKPLPPASKDEDEDIQSILRTRQERRQ RRQVVEAAQAPIQERLEAEEGRNSLSPVQATQKPLVSKKELEIPPRRRLSREQRGPWALE EESLVGREPEERKKGVPEKSPVLEKSSMPKKTAPEKSLVSDKTSISEKVLASEKTSLSEK IAVSEKRNSSEKKSVLEKTSVSEKSLAPGMALGSGRRLVSEKASIFEKALASEKSPTADA KPAPKRATASEQPLAQEPPASGGSPATTKEQRGRALPGKNLPSLAKQGASDPPTVASRLP PVTLQVKIPSKEEEADMSSPTQRTYSSSLKRSSPRTISFRMKPKKENSETTLTRSASMKL PDNTVKLGEKLERYHTAIRHGSAVGVLGSQRSESVKSRGLPCTELFVAPVGVASKRHLFE KELAGQSRAEPASSRKENLRLSGVVTSRLNLWISRTQESGDQDPQV >gi568815597r:201368406_201568786|GENSCAN_predicted_CDS_2|1761_bp atggctgtcagcaggaaggactggtccgcgctgtccaggcttagcctctccatagcagct catcgcctgggactggggggccacctccccagaccctcaagacctcacagtgctctgaag cactgtgggcagcccatgggctctagagaggggaaaagccctcatagagccctgtgggca gacgcagctgctgccaacagtcaccctgatgaggcacctggaacagaggaccgctcccgc tccgctccctgggtgaacaagcctcagcgggcaggcaggaaagggctccttgcccggcag aggactctggaggatgaggaggaacaggagcgcgagcgcaggcggcggcaccgcaacctg agctccaccacggacgatgaggctcccaggctcagccagaatggagaccggcaggcctct gcttctgagagactaccgagcgtggaagaagcagaggtgcccaagccactgcccccagcc tccaaagatgaggacgaggacatccagagcatcctcagaacacggcaggagcggaggcag aggcggcaggtggtggaggctgcacaggcccccatccaggagaggctggaggcagaggag gggaggaacagcttgagccctgtgcaggccacacagaaacccctagtctccaagaaggaa ctggaaatcccacctcgccggagactgagtcgggaacagcggggcccctgggccctggag gaggagagcttggtgggcagggagccagaagagaggaagaaaggggttccagaaaagtcc ccagtcttggagaagtcctccatgccaaagaagacggcacctgaaaagagcctggtctcc gataaaacctccatctctgagaaggtgctggcctcagagaagacatctctatcagagaag atagcagtgtcagagaaaagaaacagctcagagaagaagtctgttctagaaaaaaccagt gtctctgagaagtcgctggccccagggatggcactgggctcaggaaggaggctggtgtct gagaaagcttccatctttgagaaggcactggcctcagagaagagcccaactgcagatgct aagccggccccaaagagggccacagcctcagagcagcccctggcgcaggagccgccagcc tctgggggaagcccagccaccaccaaggagcagagaggaagggccctccctgggaagaac ctgccctctttggcaaagcagggggcttcagaccctccgactgtggcctcccgcctccca cccgtcacactccaggtgaaaatccccagcaaggaggaagaggcagatatgtcctcaccc acacagcgaacctacagcagctccctcaaacgctccagccccaggaccatctcctttcgg atgaaacccaagaaagaaaactcggaaacaaccctaactcgcagtgccagcatgaagctc ccagacaacacagtgaagttgggagagaagctggagagataccacacggccatacggcat gggtcagctgttggtgttcttggttcccagagatcagaatctgtcaagtctcggggtctg ccttgcactgagttattcgtggctcctgtgggtgtagccagcaagcgccacctctttgag aaggaactggcgggccagagccgagcagaaccagcctccagccggaaggagaacttgagg ctctcaggggttgtgacatcaaggctcaacctgtggatcagcaggacccaggaatctgga gatcaggacccccaggtgtga >gi568815597r:201368406_201568786|GENSCAN_predicted_peptide_3|413_aa MAFQVEEITCAKVPGPEKVDVSWAPGSSWDTSDEWSREEIRQGINAKDDYPYFTDEKTGS KRRKALPEATQGVRGSASIHSSSGSGDGVCVVNLFERPEGRLKACLPELLKEQSDSIPQD DSSDPSSPLPKLFPGSLGGGGEKDYERRDGTTQAEVGPHYDAQAGHRVEGVVIVQGDLPG PKFNNNKGPVPSCCFYVCPSTLLEGEQFITYVKSAVYGEAQASPAPRGLNKRKPKITASR KLLLKSLMLAKAKECWEQEHEEREAEKVRYLAERIPTLQTRGLSLSALQDLCRELHAKVE VVDEERYDIEAKCLHNTREIKDLKLKVMDLRGKFKRPPLRRVRVSADAMLRALLGSKHKV SMDLRANLKSVKKEDTEKERPVEVGDWRKNVEAMSGMEGRKKMFDAAKSPTSQ >gi568815597r:201368406_201568786|GENSCAN_predicted_CDS_3|1242_bp atggcgttccaggtggaagaaatcacatgtgcaaaggtgccaggaccagagaaggtggat gtgagctgggctcctgggagctcctgggacacttcagatgagtggagccgggaagaaatc cggcagggtataaatgcaaaagatgattacccctattttacagatgagaaaacaggttca aagagaaggaaggctttacctgaggccacccagggagtaagaggaagtgccagcattcac agcagctctgggagtggggatggagtatgtgtggtgaatttattcgagaggcccgaaggc cgtctgaaagcatgtctgcctgagctgcttaaggagcagtcagattccatcccacaggat gactcgtctgaccccagctccccactccctaaactgttcccagggtccctgggtgggggc ggagagaaagattatgagaggcgagatggcacaacacaggcagaggtgggacctcactat gatgcccaggctgggcacagggtggagggggtggtcatagttcagggtgacctcccaggg cctaagtttaacaacaacaaaggcccagtgcccagctgttgtttctatgtgtgtccgagc acattgttggagggcgagcagttcatcacctatgtcaagtctgcagtctacggcgaggca caggccagcccagctccacgaggactgaacaagagaaaacccaagatcactgcctcccgc aaactcttgctgaagagcctgatgctggccaaggccaaggaatgctgggagcaggagcac gaggagcgcgaggctgagaaggtgcgctacctggcagagcgcatccccacgctgcagacc cgtggcctgtccctcagtgccctgcaggacctgtgccgggagctgcacgccaaggtggag gtggtggatgaggagcgatacgacattgaggccaaatgcctccacaacaccagggagatt aaggacctgaagctgaaggtgatggacctccgtgggaagttcaagcgcccgcccctgcgt cgagtccgtgtctcggctgacgccatgctccgggccctgctgggctccaagcacaaggtg tccatggatctgcgggccaacctcaagtctgtgaagaaggaagacacagagaaggagcgg cctgtggaggtgggtgactggaggaagaacgtggaggccatgtctggcatggaaggccgg aagaagatgtttgatgccgccaagtctccgacctcacaatag >gi568815597r:201368406_201568786|GENSCAN_predicted_peptide_4|551_aa MTSALEWCSVQSPLVGVARDTPSAERNKTARRPGAPSRHFRERLPPLRRRASCQALGPSP SRAFLGFLDTVDFRVCVTQGDPAPRNRLQVGQNNRRFCAPLTLIVWQRDEGIRATPSLTL LLGRKCPRAGWGKHLGLTLDICVPERRMPNWGGGKKCGVCQKTVYFAEEVQCEGNSFHKS CFLCMVCKKNLDSTTVAVHGEEIYCKSCYGKKYGPKGYGYGQGAGTLSTDKGESLGIKHE EAPGHRPTTNPNASKFAQKIGGSERCPRCSQAVYAAEKVIGAGKSWHKACFRCAKCGKGL ESTTLADKDGEIYCKGCYAKNFGPKGFGFGQGAGALVHSEDPLAFQRFLMEAPGSPLCMR EKSILGEGRGRTESAERKEQGWSCAYTWDWLCHRSSVGACKCAVERTADSGIPSLELSAN LGAPMTAAATATVLKEGVLEKRSGGLLQLWKRKRCVLTERGLQLFEAKGTGGRPKELSFA RIKAVECVESTGRHIYFTLVTEGGGEIDFRCPLEDPGWNAQITLGLVKFKNQQAIQTVRA RQSLGTGTLVS >gi568815597r:201368406_201568786|GENSCAN_predicted_CDS_4|1656_bp atgacatcagccctggagtggtgcagtgtgcaaagcccactggttggcgtggcccgggac acgccttccgcggagcggaacaaaacggcgcgcaggccgggcgcacccagccgccacttc cgagagcgcctgccgcccctgcgccgccgagccagctgccaggcactaggtccgtcccca tcccgggccttcctgggcttcctggacacagttgacttccgcgtgtgcgtgacccagggc gacccggctccccgcaaccgcctgcaagtcgggcagaacaatcggcgattctgcgcgcca ctgacgctcattgtatggcagagagatgagggcatccgtgccaccccctccctcaccctc ctgctggggagaaaatgcccgcgggcggggtggggcaaacatctgggcctgaccttggac atctgtgtccctgagcgcagaatgccgaactggggaggaggcaagaaatgtggggtgtgt cagaagacggtttactttgccgaagaggttcagtgcgaaggcaacagcttccataaatcc tgcttcctgtgcatggtctgcaagaagaatctggacagtaccactgtggccgtgcatggt gaggagatttactgcaagtcctgctacggcaagaagtatgggcccaaaggctatggctac gggcagggcgcaggcaccctcagcactgacaagggggagtcgctgggtatcaagcacgag gaagcccctggccacaggcccaccaccaaccccaatgcatccaaatttgcccagaagatt ggtggctccgagcgctgcccccgatgcagccaggcagtctatgctgcggagaaggtgatt ggtgctgggaagtcctggcataaggcctgctttcgatgtgccaagtgtggcaaaggcctt gagtcaaccaccctggcagacaaggatggcgagatttactgcaaaggatgttatgctaaa aacttcgggcccaagggctttggttttgggcaaggagctggggccttggtccactctgaa gatcccctggcctttcagcgcttcctgatggaggcaccagggagcccactctgtatgcgg gagaagtccattcttggggaaggccgagggaggacagaatctgcggagcggaaggagcaa ggttggagctgcgcttacacctgggactggctgtgccatcggtccagcgtcggcgcctgc aagtgtgcggtggagaggactgcagacagtggcatccccagcctggagctttccgcgaac ctcggggcgcccatgacggcggcggcgacggctaccgtgctcaaggagggcgtgctggag aagcgcagcggcgggctgctgcagctgtggaagcggaagcgctgcgtcctcaccgaacgc gggctgcagctcttcgaggccaagggcacgggcggccggcccaaggagctcagcttcgcc cgcatcaaggccgtggagtgcgtggagagcaccgggcgccacatctacttcacgctggtg accgaagggggcggcgagatcgacttccgctgccccctggaagatcccggctggaacgcc cagatcaccctaggcctggtcaagttcaagaaccagcaggccatccagacagtgcgggcc cggcagagcctcgggaccgggaccctcgtgtcctaa >gi568815597r:201368406_201568786|GENSCAN_predicted_peptide_5|103_aa MVAKKDVHMPKHPELADKNAPNLRVMKAMQSLKSRGYVKEVCLETFLLGITCLPPPHPFL TTAGFTSTAFTSSRIWQFILFQLLSSSPTLTKVDIAASKSERS >gi568815597r:201368406_201568786|GENSCAN_predicted_CDS_5|312_bp atggtggccaagaaggatgtccacatgcctaagcacccggagctggcagacaagaatgcg cccaaccttcgtgtcatgaaggccatgcagtctctcaagtcccgaggctacgtgaaggaa gtttgcctggagacatttctactgggaatcacctgcctacctccacctcatccatttctc accacagctggattcacctctacagcctttacctccagtcgtatttggcaattcatcctt ttccagctcctatcatcctcccctaccctcaccaaagtggacatcgcagcctctaaatct gaacggagctga >gi568815597r:201368406_201568786|GENSCAN_predicted_peptide_6|360_aa MSVLLERQSPPSSTKQRQVDPGHTGLEQRPQGTWPSSYSAGSNPGNLTGQGPSGPANALV PVDSPIRKNNRGNPLNQQKPVPHEEGCYHPYFSEKETDAQTDPVIQQHPGSGLSAGLPQP EPLTPTFFQSEKEGTRACGTCFLSPQDAELRAACIGEEGVGIGLGSAGQLHPLVAATGRC KAMQTKAEPDSSSFPESLYTAFIKHLLADGTGFSDQCFNGQFCCPLPTCMAPRPSSRTKL LTSIKTLLPVVTLRTLSLPQPTRDLKAATVGRALFLVVYVRVSSVRSSKGPASQASQNGK RQTLSAATARPPRAGHSPERLPARAPGPRRSRAESRIPKGSAGTWEWEEGEEEERGNDRT >gi568815597r:201368406_201568786|GENSCAN_predicted_CDS_6|1083_bp atgtctgtgctgttggagagacagtctccaccttcctccacaaagcagcggcaggtagac cctggccacactggattggaacagagaccccagggcacttggcccagctcctacagcgcg ggctccaaccccggtaacttgacaggacaaggcccctctggcccagccaatgccctcgta cctgtggacagccccatcagaaaaaacaacagaggcaaccccctgaaccaacagaagcct gttccccatgaagagggctgttatcatccctacttttcagaaaaggaaactgatgctcag acagatcccgtgatccaacagcatccaggatctggactgagtgcaggactgccccagcct gagcctctcaccccaactttcttccagtctgagaaggaggggacacgtgcctgtgggacc tgcttccttagcccacaagatgcagagctgcgagctgcctgcataggagaggagggcgtg gggatcgggctgggttctgctgggcaactgcaccctctggtggcggcaacaggaagatgc aaggccatgcagacaaaggccgaaccagattcttcctccttcccagaatccctctacacc gcctttattaagcacctattggcggatggcacgggcttcagcgatcagtgcttcaatgga caattttgctgccctctgcccacatgcatggcacctcgaccatcatctcggacaaagctt ctcacctccatcaaaaccctgctcccagtcgtaactctaaggaccctcagcctcccccaa cctacaagagacttaaaagcagcaactgttggaagagcactgttcctggtggtttatgtg cgagtctccagtgtcagatcctccaagggacctgccagccaagcctcgcaaaacgggaag cgccagaccctcagcgccgcgaccgcgcgaccgccccgcgccggccactcaccggagcgc ctgcccgcccgggctccgggtccgcgccgcagccgcgcagagtcgcgcatccccaagggc agcgcggggacttgggagtgggaggagggggaagaagaggagagaggaaatgacaggacc tga