GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:01:01 Sequence gi568815591f:10874835_11162085 : 287251 bp : 36.38% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 18630 18674 45 2 0 141 87 35 0.567 6.39 1.02 Term + 21406 21780 375 0 0 48 41 125 0.618 -2.45 1.03 PlyA + 21888 21893 6 1.05 2.03 PlyA - 26162 26157 6 1.05 2.02 Term - 32431 32195 237 1 0 109 44 104 0.856 3.28 2.01 Init - 38094 37981 114 0 0 66 75 135 0.512 10.26 2.00 Prom - 40571 40532 40 -4.95 3.00 Prom + 45839 45878 40 -5.85 3.01 Init + 46720 46782 63 0 0 51 26 79 0.543 -0.80 3.02 Term + 52090 52533 444 0 0 70 48 302 0.995 18.65 3.03 PlyA + 52819 52824 6 1.05 4.03 PlyA - 52871 52866 6 1.05 4.02 Term - 65655 65462 194 1 2 50 45 166 0.219 5.10 4.01 Init - 77891 77846 46 2 1 86 98 32 0.406 4.90 4.00 Prom - 85740 85701 40 -3.65 5.00 Prom + 85868 85907 40 -2.95 5.01 Init + 91490 91648 159 1 0 36 68 109 0.304 3.57 5.02 Intr + 99383 99521 139 1 1 21 23 155 0.795 1.42 5.03 Intr + 100001 100111 111 0 0 97 99 130 0.996 14.43 5.04 Intr + 101438 101509 72 0 0 111 83 7 0.626 0.96 5.05 Intr + 107538 108325 788 1 2 63 61 1086 0.256 93.66 5.06 Intr + 115869 116013 145 2 1 38 90 99 0.225 4.03 5.07 Intr + 119946 120174 229 1 1 39 22 201 0.445 4.61 5.08 Term + 120866 121097 232 2 1 41 55 184 0.615 5.36 5.09 PlyA + 121807 121812 6 1.05 6.02 PlyA - 122104 122099 6 1.05 6.01 Sngl - 131914 131654 261 1 0 27 31 348 0.948 17.91 6.00 Prom - 134319 134280 40 -4.75 7.00 Prom + 135775 135814 40 -7.05 7.01 Init + 138515 138563 49 1 1 82 58 29 0.756 -1.54 7.02 Intr + 138913 139072 160 2 1 112 33 100 0.890 5.02 7.03 Intr + 148034 148145 112 2 1 66 100 115 0.988 9.96 7.04 Intr + 153847 153984 138 0 0 54 92 73 0.448 4.04 7.05 Intr + 160806 160952 147 2 0 88 80 48 0.913 3.51 7.06 Intr + 161584 161854 271 0 1 67 57 257 0.997 16.69 7.07 Intr + 162151 162257 107 2 2 45 115 75 0.997 4.91 7.08 Intr + 163926 164021 96 2 0 51 92 112 0.984 7.19 7.09 Intr + 170693 170901 209 1 2 76 97 23 0.164 -1.05 7.10 Intr + 176778 176946 169 0 1 89 20 198 0.982 12.13 7.11 Intr + 186957 187007 51 2 0 98 31 66 0.484 0.09 7.12 Intr + 187130 187251 122 1 2 79 111 93 0.910 9.07 7.13 Intr + 220162 220291 130 1 1 78 55 71 0.158 2.58 7.14 Intr + 221613 221722 110 2 2 59 44 80 0.095 -1.14 7.15 Intr + 236516 236633 118 2 1 82 116 104 0.968 12.15 7.16 Term + 249785 249817 33 1 0 99 49 22 0.058 -3.99 7.17 PlyA + 250019 250024 6 1.05 8.03 PlyA - 250352 250347 6 1.05 8.02 Term - 255521 255390 132 2 0 88 40 83 0.621 0.61 8.01 Init - 262480 262442 39 1 0 60 106 46 0.655 3.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:10874835_11162085|GENSCAN_predicted_peptide_1|139_aa PCKMYLASPSSSAMIELTKEVKGLYNENYKRLMKEIEEDTRKWNDSPRSWLGRINTVIMS LQPKAIYRFNAIPIKIPMTFITEIEKNNPTIYVEPQKTQNSQSHPEQKEENWRNHITYLQ IILQSYSDQQHGTGIKTDT >gi568815591f:10874835_11162085|GENSCAN_predicted_CDS_1|420_bp ccatgtaagatgtaccttgcttccccttcatcttctgccatgattgagctaaccaaagaa gtgaaaggtctctacaatgaaaattataaaagactgatgaaagaaattgaagaggatacc agaaaatggaatgatagcccacgttcatggcttggaagaatcaatactgttataatgtcc ctgcaacccaaagcaatctatagattcaatgcaatccctataaaaataccaatgacattc atcacagaaatagaaaaaaacaatcctacaatttatgtggaaccacaaaagacccagaat agccaaagccaccctgagcaaaaagaagaaaactggagaaatcacattacctaccttcaa attatactacagagctatagtgaccaacagcatggtactggcataaaaacagacacatag >gi568815591f:10874835_11162085|GENSCAN_predicted_peptide_2|116_aa MSLLFGESDKQIVIGCDKYYEGDVQDDKMEYNTEGKEELSGVWAVITWLCGTEASSTASW LIVCNGQWLHADIDFVDIQMIKHTLVYSYHFKTEQETADVTHHFLWASIGLPSEEP >gi568815591f:10874835_11162085|GENSCAN_predicted_CDS_2|351_bp atgtccctgttattcggggagtcagataaacagatagttataggttgtgacaagtactat gaaggagatgtgcaagatgataagatggagtacaacacagagggcaaagaagagctttct ggagtgtgggctgttataacttggctgtgtggaacagaagcttcttcaacagcttcctgg ctcattgtgtgtaatgggcagtggctgcatgctgacattgattttgtggacatccagatg ataaaacacacccttgtctactcttatcattttaagacagaacaggagacagcagatgtt actcatcacttcttgtgggcatcaattgggctgccctctgaggagccctag >gi568815591f:10874835_11162085|GENSCAN_predicted_peptide_3|168_aa MTIRFSNKEINNDLREGSISRRIVEVKDLKQTLAIKTGYQDSNAWLEWIKYSIHTLNKSN CYACVHSRPEAQTVPFPPGWSSSRSHVDFQDSTAWSNKLCQALSLLYPKVQHPVGQPLRV IQLLSPNTQFTLCLSQQGGNLAFLGDLKGCSELKNFQELINQSALVHP >gi568815591f:10874835_11162085|GENSCAN_predicted_CDS_3|507_bp atgaccattcgatttagcaataaggagatcaataacgacctcagggagggcagtatcagc aggaggatcgtagaagttaaagacttaaaacaaactttggcaattaagacaggataccaa gattcaaatgcctggttggaatggatcaaatattccatccacacgttaaacaaaagcaat tgttatgcttgtgtgcacagcaggccagaggcccagactgtcccctttccaccagggtgg tcctccagtcgatcacatgtggatttccaggattctacggcctggagtaacaagttgtgc caagctctctctctgctatatcccaaagtccagcaccctgtgggtcagcccctgagggtc atccagcttctgtctcccaacactcagttcactttgtgtctctcacaacaaggaggaaac ttagcgttccttggagacctgaagggatgcagtgagcttaagaattttcaagagcttatc aatcagtcagcccttgttcatccctga >gi568815591f:10874835_11162085|GENSCAN_predicted_peptide_4|79_aa MESEIIDSGDSEGWEGAAGEAPYSRSARPVRPRSDRIPQYVAPLESACLVNRRVLVSTKK IPLRNMHGTAMKAFHYEVE >gi568815591f:10874835_11162085|GENSCAN_predicted_CDS_4|240_bp atggagagtgaaataatagactctggagactcagaaggatgggaaggcgccgcgggagag gccccctactcacggagcgcgcggcccgtgcgcccacgctccgaccgcattcctcagtac gtggcacccttggaaagtgcctgtctggtcaatcgaagagtcctcgtttcgaccaaaaaa attcccctcagaaatatgcatggcacggcaatgaaggcatttcactatgaagtggaatga >gi568815591f:10874835_11162085|GENSCAN_predicted_peptide_5|624_aa MCLSLAESEVFMGSECRKCMLIGLWVGPEKAPSDWPKGIKELLTPGHGLYPEQHLSDLSS RPQSPTSALPGLLQPLPKSSPNDHLTDSLVSVSEASAARVDRSSKRRQVKPLAASLLEAL DYDSSDDSDFKVGDASALYLPFSWFADIWYFQRREGEGKGDSEGSGNGSEDASKDSGEGS CSDSEENILEEELNEDIKVKEEQLKNSAEEEVLSSEKQLIKMEKKEEEENGERPRKKKEK EKEKEKEKEKEKEREKEKEKATVSENVAASAAATTPATSPPAVNTSPSVPTTTTATEEQV SEPKKWNLRRNRPLLDFVSMEELNDMDDYDSEDDNDWRPTVVKRKGRSASQKEGSDGDNE DDEDEGSGSDEDENDEGNDEDHSSPASEGGCKKKKSKVLSRNSADDEELTNDSLTLSQSK SNEDSLILEKSQNWSSQKMDHILICCVCLGDNSEDADEIIQCDNCGITVHEGVKLQTFAV SVTAHKGSAYPKSEQQQDLLQRAKEQSFHSVEGDPSGLPLLAGAACSYPLLWPHPHPADR SILQRADWEAAEARREFESSAGGPALLGDPVHPPQLLAQVLSPSLPGAGSARQAHAHPEL ALARKRLAQPRFPPVPLPPHLPAS >gi568815591f:10874835_11162085|GENSCAN_predicted_CDS_5|1875_bp atgtgtctgagtctggctgagtctgaggtttttatgggctcagaatgtagaaagtgcatg ctgattggtctatgggtgggcccagaaaaagcaccatctgattggccaaaaggcatcaaa gaacttctcactcctggtcatggactctacccagaacagcatctttcggacttgtcttcg cggccccagtccccgacctcggcgctgcctgggctcctgcagcctctccctaagtcttct ccaaacgaccacctcacggattccttagtaagtgtatccgaggcctctgcggcgagagtg gatcgcagctccaagaggaggcaggtgaagcctttggcagcttctctgctggaagctctt gattatgatagttcagatgacagtgattttaaagttggagatgcctcagctctttatctc ccatttagttggtttgctgatatttggtatttccaaagaagggaaggggaagggaaagga gattctgaagggagtggtaatggaagtgaagatgcttcaaaggacagtggagaaggttcc tgtagtgattctgaagaaaatattttagaagaagaactgaatgaagatattaaagtaaaa gaagaacaacttaaaaattctgcagaggaagaagtactatcatcagaaaaacaattaatt aaaatggaaaagaaggaagaagaagaaaatggagaaagacctagaaagaaaaaggagaaa gagaaggaaaaagaaaaggaaaaggagaaagagaaggaaagagagaaggaaaaagaaaaa gcaacagtatctgagaatgtggctgcttctgctgctgccaccacaccagccacaagtcct cctgctgttaacacatccccttctgttcccactacgacaaccgctacagaggaacaagtc agcgagccaaaaaaatggaaccttcgacgaaaccgaccacttctggattttgtgtccatg gaagagctgaatgacatggatgactatgacagtgaggatgacaatgattggcgacctact gtagtaaagagaaaagggagatctgcgtctcagaaagagggaagtgatggagacaatgag gatgatgaagatgagggaagcgggagtgatgaagacgagaatgatgaaggcaatgatgaa gatcatagtagccctgccagtgaagggggttgcaagaagaagaagagtaaagttcttagc agaaacagtgctgatgatgaggaactgaccaatgatagcctgaccctatctcaaagcaag agtaatgaggactcgctgattcttgagaagagtcaaaactggagctctcaaaaaatggac catattctgatttgctgtgtttgtctgggagataatagtgaggacgctgatgaaataatt cagtgtgacaattgtggcattacagtccatgaaggagtgaagctgcagaccttcgcggtg agtgttacagctcataaaggcagcgcgtacccgaagagtgagcagcagcaagatttattg caaagagcgaaagaacaaagcttccacagtgtggaaggggacccgagcgggttgccattg ctggctggggcagcttgctcttatccccttctctggccccacccacatcctgctgatagg tccattttacagagagctgattgggaggcagctgaggcccggcgagaatttgagagcagc gccggcgggccagcactgctgggggacccggtgcaccctccacagctgctggcccaggtg ctaagcccctcactgcccggggctggcagtgcccgccaagcccatgcccacccggaactc gcgctggcccgcaagcgccttgcacagccccggttcccacccgtgcctctccctccacac ctccccgcaagctga >gi568815591f:10874835_11162085|GENSCAN_predicted_peptide_6|86_aa MSPTFQQPKTLRLWRQPRYPRKSSPRRNKLDYYTIIKFLLTTESAMKKIEDNNMLVFTEE VKANKHQIKQAVKELCDIDVAKVSTP >gi568815591f:10874835_11162085|GENSCAN_predicted_CDS_6|261_bp atgtcacccaccttccagcagcccaagacactgagactctggaggcagcccagatatcct cggaagagctcccccaggagaaacaagcttgactactataccatcatcaagtttctgctg accactgagtctgccatgaagaagatagaagacaacaacatgcttgtgttcactgaggaa gttaaagccaacaagcaccagatcaaacaggctgtgaaggagctctgtgacattgatgtg gccaaggtgtcaacaccctaa >gi568815591f:10874835_11162085|GENSCAN_predicted_peptide_7|673_aa MRFHRVGQAGLELLTSGCYGVDGESDSIMSSASENSTEPWFCDACKCGVSPSCELCPNQD GIFKETDAGRWVHIVCALYVPGVAFGDIDKLRPVTLTEMNYSKYGAKECSFCEDPRFART GVCISCDAGMCRAYFHVTCAQKEGLLSEAAAEEDIADPFFAYCKQHADRLDRKWKRKNYL ALQSYCKMSLQEREKQLSPEAQARINARLQQYRAKAELARSTRPQAWVPREKLPRPLTSS ASAIRKLMRKAELMGISTDIFPVDNSDTSSSVDGRRKHKQPALTADFVNYYFERNMRMIQ IQENMAEQKNIKDKLENEQEKLHVEYNKLCESLEELQNLNGKLRSEGQGIWALLGRITGQ ISYSTLITILSTCYPAFSSSTSTVFKETLVLDLDAFFSPSQSSVDCAGQVKTCDIKEKMD TDTEVIIGGRQCSECDQAGSSDMEADMAMETLPDGTKRSRRQIKEPVKFVPQDVPPEPKK IPIRNTRTRGRKRSFVPEEEKHEERVPRERRQRQSVLQKKPKAEDLRTECATCKGTGDNE NLVSQRQFKETSETLLSLREHKSTASNHYLQLYSNSSSNKLEQIVVIDVLIVKFLNANTD LTVPLTGERELDAIFQFIYGYFISCDECRLCYHFGCLDPPLKKSPKQTGYGWICQECDSS SSKMKERNEIQRS >gi568815591f:10874835_11162085|GENSCAN_predicted_CDS_7|2022_bp atgaggtttcaccgtgttggccaggctggtctcgaactcctcacctcaggttgttatgga gttgatggagagagtgactctattatgagttcagcttctgaaaactccactgaaccttgg ttttgtgatgcctgtaaatgtggtgtttctcctagctgtgaactgtgtcctaatcaggat ggaattttcaaggagacagatgctggaagatgggttcatattgtttgtgccctgtatgtt cctggagtagcctttggagatattgacaaattacgaccagtaacactaacggaaatgaac tattccaaatatggtgccaaggagtgtagcttttgtgaagaccctcgctttgctagaact ggggtttgcattagctgtgatgcagggatgtgcagagcctatttccatgtgacctgtgct caaaaggaaggtctgctttcagaggcagcggcggaagaggatatagcagatccattcttt gcttattgtaagcaacatgcagataggttagacagaaagtggaagagaaaaaactacttg gctctacagtcctattgtaaaatgtctttgcaagagagagagaagcaactatcaccagaa gcacaggcaaggatcaatgcccggcttcagcagtatcgtgccaaagcagaactagctcga tctaccagaccccaggcctgggttccaagggaaaaattgcccagaccactcaccagcagt gcttcagctattcgtaaacttatgcggaaagcagaactcatggggatcagtacagatatc tttccagtggacaattcagatactagttctagtgtggatggaaggagaaaacataagcaa ccagctctcactgcagattttgtgaattattattttgagagaaatatgcgcatgattcaa attcaggaaaatatggctgaacaaaagaatataaaagataaattagagaatgaacaagaa aagcttcatgtagaatataataagctatgtgaatctttagaagaactacaaaacctgaat ggaaaacttcgaagtgaaggacaaggaatatgggctttactaggcagaatcacagggcag atatcatattctactctaataactattttaagcacttgctatccagccttttcttccagt accagcactgttttcaaggagacactagttctagatttggatgctttcttcagtccatcc cagagttctgtggactgtgctgggcaggttaagacatgtgacatcaaagaaaagatggat acagatacagaagtaattattggtggaaggcagtgctcggaatgtgaccaggcagggagc agtgacatggaagcagatatggccatggaaaccctaccagatggaaccaaacgatcaagg aggcagattaaggaaccagtgaaatttgttccacaggatgtgccaccagaacccaagaag attccgataagaaacacgagaaccagaggacgaaaacgaagcttcgttcctgaggaagaa aaacatgaggaaagagttcctagagagagaagacaaagacagtctgtgttgcaaaagaag cccaaggctgaagatttaagaactgaatgtgcaacttgcaagggaactggagacaatgaa aatcttgtcagccagaggcagttcaaagaaacttcagaaactttactgagcctgagggaa cacaagtcaactgccagtaatcactaccttcagctttatagtaatagcagtagtaataaa ctagaacaaattgtagtgatagatgttcttatcgtaaagtttttgaatgctaatacagac ctgacagttccacttacaggagaaagggaactggatgccattttccagttcatttacggg tatttcatatcgtgtgatgaatgcagactctgctaccattttggctgtttggatcctcct ttgaaaaagtctcctaaacagacaggctacggatggatatgtcaggaatgtgattcttca tcttccaagatgaaagaaagaaatgagattcagagaagttaa >gi568815591f:10874835_11162085|GENSCAN_predicted_peptide_8|56_aa MAANWQALATSSKDFLESWNESQKGLGSMVAYGGSITLTGFLKLGLTASSEECGLE >gi568815591f:10874835_11162085|GENSCAN_predicted_CDS_8|171_bp atggcagccaattggcaggcactggccacctcttctaaagatttccttgaatcctggaat gaatcccagaaaggtcttggaagtatggtagcatatggtggtagcattacacttactggc ttcttaaagttaggtttgactgcgtcatctgaggaatgtggcctggagtga