GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:15:04 Sequence gi568815597f:50044956_50301175 : 256220 bp : 41.67% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2986 3123 138 2 0 55 81 69 0.531 2.41 1.02 Intr + 9838 9939 102 2 0 94 87 69 0.935 6.63 1.03 Term + 9982 10085 104 2 2 101 49 43 0.656 -0.84 1.04 PlyA + 10917 10922 6 1.05 2.00 Prom + 14171 14210 40 -4.65 2.01 Init + 22828 22997 170 0 2 62 78 160 0.841 11.45 2.02 Intr + 28170 28266 97 0 1 22 92 58 0.572 -1.31 2.03 Intr + 30510 30582 73 2 1 92 103 81 0.971 8.06 2.04 Intr + 33461 33630 170 0 2 116 22 79 0.071 2.84 2.05 Term + 46308 46418 111 2 0 94 39 82 0.041 1.48 2.06 PlyA + 47192 47197 6 1.05 3.06 PlyA - 48132 48127 6 1.05 3.05 Term - 59072 58858 215 0 2 44 38 164 0.344 3.41 3.04 Intr - 64465 64307 159 2 0 74 44 105 0.326 3.74 3.03 Intr - 77647 77593 55 1 1 37 110 75 0.189 2.23 3.02 Intr - 84984 84889 96 0 0 91 87 41 0.196 3.59 3.01 Init - 91691 91683 9 2 0 86 89 0 0.140 0.41 3.00 Prom - 93027 92988 40 -2.45 4.00 Prom + 97715 97754 40 -4.25 4.01 Init + 100014 100242 229 2 1 107 85 173 0.811 17.48 4.02 Intr + 100625 100799 175 0 1 67 53 109 0.481 3.38 4.03 Intr + 119090 119135 46 2 1 72 92 52 0.004 1.49 4.04 Intr + 119738 119795 58 1 1 50 82 55 0.002 -1.36 4.05 Intr + 132134 132237 104 0 2 115 116 80 0.332 12.47 4.06 Intr + 144353 144452 100 1 1 50 77 99 0.086 3.76 4.07 Intr + 145792 145886 95 2 2 84 84 11 0.808 -0.94 4.08 Intr + 148810 148963 154 0 1 92 42 204 0.990 14.92 4.09 Intr + 150606 150831 226 1 1 92 96 240 0.978 21.32 4.10 Intr + 152474 152512 39 2 0 114 76 25 0.544 0.32 4.11 Term + 155896 156223 328 1 1 123 38 465 0.991 38.20 4.12 PlyA + 156632 156637 6 1.05 5.03 PlyA - 156803 156798 6 1.05 5.02 Term - 163874 163656 219 2 0 108 55 73 0.385 2.06 5.01 Init - 164387 164337 51 2 0 105 80 43 0.531 6.42 5.00 Prom - 165315 165276 40 -6.75 6.00 Prom + 166665 166704 40 -4.75 6.01 Init + 172209 172279 71 2 2 46 79 84 0.317 3.87 6.02 Intr + 174290 174372 83 2 2 73 94 58 0.491 3.26 6.03 Intr + 177470 177474 5 0 2 116 119 0 0.553 -2.17 6.04 Intr + 178161 178207 47 2 2 96 41 97 0.013 2.09 6.05 Intr + 179865 180158 294 0 0 1 32 249 0.021 5.80 6.06 Intr + 180478 181243 766 1 1 79 89 200 0.535 9.74 6.07 Term + 182156 182167 12 1 0 126 33 4 0.278 -4.07 6.08 PlyA + 182771 182776 6 1.05 7.00 Prom + 183501 183540 40 -5.75 7.01 Init + 189405 189594 190 2 1 53 65 212 0.941 14.62 7.02 Intr + 190017 190117 101 1 2 51 45 109 0.318 1.81 7.03 Intr + 194873 195055 183 1 0 116 30 100 0.562 6.06 7.04 Intr + 195134 195263 130 1 1 27 56 116 0.311 1.65 7.05 Intr + 195269 195359 91 0 1 58 51 66 0.039 -1.97 7.06 Intr + 198739 198851 113 1 2 135 97 59 0.059 10.60 7.07 Intr + 199645 199750 106 2 1 77 78 97 0.191 5.95 7.08 Term + 203698 203914 217 1 1 61 48 123 0.101 1.23 7.09 PlyA + 204361 204366 6 1.05 8.03 PlyA - 204603 204598 6 1.05 8.02 Term - 206856 206528 329 1 2 97 32 151 0.787 4.29 8.01 Init - 208987 208882 106 1 1 59 111 87 0.869 8.54 8.00 Prom - 211258 211219 40 -6.55 9.00 Prom + 221761 221800 40 -1.25 9.01 Init + 222209 222262 54 1 0 47 113 43 0.106 4.03 9.02 Intr + 233820 233994 175 2 1 21 86 126 0.078 4.29 9.03 Intr + 237337 237450 114 2 0 117 19 63 0.234 1.80 9.04 Intr + 239762 239922 161 0 2 65 39 128 0.373 4.39 9.05 Term + 243266 243388 123 1 0 60 55 91 0.082 0.40 9.06 PlyA + 245995 246000 6 1.05 10.03 PlyA - 246482 246477 6 1.05 10.02 Term - 247672 247497 176 2 2 15 46 147 0.055 0.14 10.01 Intr - 255057 255004 54 1 0 121 78 13 0.093 1.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 144331 144452 122 1 2 93 77 119 0.855 10.52 S.002 Sngl - 197473 196757 717 1 0 58 43 277 0.807 16.17 S.003 Init + 198603 198699 97 2 1 72 97 61 0.912 5.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:50044956_50301175|GENSCAN_predicted_peptide_1|114_aa XRCQGHHRNLTRRSRPPSLREAPPRGGQGVAAATVAEPQSELESESGLTFIASSTSSLAL LLLEGFSQYGALTGYLRQKKSYWLAVASSGCVEATSPIQSTATAFSLSMGSLKI >gi568815597f:50044956_50301175|GENSCAN_predicted_CDS_1|345_bp ngtaggtgccagggccaccacaggaacctgactcgcaggagtcgtcctccgtctctacgc gaagcgccccctcgaggtgggcagggggtggcggcggcgacggtggcggagccgcagagc gagctagagagcgagagcgggctgaccttcatagcttcatcaacaagctcccttgcactc ttgcttctggaaggattcagccaatacggagcacttacaggatatctcagacagaagaaa agttactggctagcagtggctagcagtggctgcgtggaggccacatcacctatccaatct acagctacagctttctctctctctatgggttctctcaagatctag >gi568815597f:50044956_50301175|GENSCAN_predicted_peptide_2|206_aa MAVEASGNLHSCQKAKHTCPSSHDGRKKCQAKGGKAPYKTIKSPENSLAITRTAAWSLCT DTQSPSSLMVPGSGGYISTEEETFCQSFIKSTEEEGEGEVGALALVSSFTVIAGCRLPNC NECFTSTSLNNCASRVAHFTDVKFEAHLQGSQYQGKLGLNKGNRGLTPKPIRLQETVSGS AFDAGSVLLKCFCWHHGTGAEFSWDQ >gi568815597f:50044956_50301175|GENSCAN_predicted_CDS_2|621_bp atggctgtggaagcctcaggaaacttacattcgtgtcagaaggcgaagcacacatgtcct tcttcacatgatggcaggaagaagtgccaagcaaaaggtggaaaagccccttataaaacc atcaaatctcctgagaactcactcgctatcacaagaacagcagcatggagcctttgcaca gacacacagtctccctctagcctcatggtaccagggagtggagggtacatctctacagag gaagaaacattctgccagtccttcataaagagtactgaagaggaaggagaaggggaagtg ggagctctagccctagtctcatcatttactgtcatcgcaggttgccgtctgccaaactgt aatgagtgtttcacaagtacttcccttaacaattgtgctagcagagtggcccattttaca gatgtcaaattcgaagctcacttacaaggtagtcagtaccaggggaaactgggactgaac aaagggaacagaggtctgactccaaagccaatacgcttacaggaaaccgtaagtggatct gcttttgatgctggcagcgtcttactaaaatgcttttgctggcatcatggaacaggagca gaattcagctgggaccaataa >gi568815597f:50044956_50301175|GENSCAN_predicted_peptide_3|177_aa MSQGCCEKNNKSEGTWFRAQYTVSTQYMFIFLVQKELQKALNGFGTDGSAHIAGKTDNDL LKMTILYPHCFSIKGPTRTLYTKEIVNVGTKQQHRTKLLCKHKTGQGYPVMWLNRPVPYD CSRLIFSRLGQFQSIHVQQHKTPSQTEPLSSHQSLRTQLQTFSSCSGCLDYLFLPLN >gi568815597f:50044956_50301175|GENSCAN_predicted_CDS_3|534_bp atgtctcagggttgctgtgagaaaaacaacaaatctgaaggcacctggttcagagctcag tacacagtaagcactcagtacatgttcattttccttgtccaaaaggagctccaaaaggca ctgaatggatttgggacagatggatctgcccacattgcagggaaaactgacaacgacttg ctaaaaatgacaatcttatatccccattgcttttcaataaaaggacctacccgcactctc tacacaaaagagatagtaaatgtaggaaccaaacagcagcacagaactaaactcttgtgc aagcataagactggtcaaggctatccagtgatgtggctaaacagacctgttccatatgac tgtagccggctcattttcagtcgattagggcagttccagagcattcacgtgcaacagcac aaaactccaagtcagactgagcctctctcttcacaccagtccctccgaacacagctgcag accttttcctcctgttccggctgtctggattacctcttcttgcccctcaattga >gi568815597f:50044956_50301175|GENSCAN_predicted_peptide_4|517_aa MEPQVSNGPTSNTSNGPSSNNRNCPSPMQTGATTDDSKTNLIVNYLPQNMTQEEFRSLFG SIGEIESCKLVRDKITVFFHVPETSVLKGATVDVWTVDTTALVPGEKQALRIADYSGIQL LCSRREMLNEDLHLSKTDIKIPASEGCKDYVFHMCLMGSGSAYQLPMDMGQSLGYGFVNY IDPKDAEKAINTLNGLRLQTKTIKVKGVVGAAWEEAALSPRPDAPSVMNDNACFTHLGFT LFPLYPSSGCAELMGVKVMTIHYAFIGCQVSYARPSSASIRDANLYVSGLPKTMTQKELE QLFSQYGRIITSRILVDQVTGVSRGVGFIRFDKRIEAEEAIKGLNGQKPSGATEPITVKF ANNPSQKSSQALLSQLYQSPNRRYPGPLHHQAQRFRLDNLLNMAYGVKRFSPITIDGMTS LVGMNIPGHTGTGWCIFVYNLSPDSDESVLWQLFGPFGAVNNVKVIRDFNTNKCKGFGFV TMTNYDEAAMAIASLNGYRLGDRVLQVSFKTNKAHKS >gi568815597f:50044956_50301175|GENSCAN_predicted_CDS_4|1554_bp atggagcctcaggtgtcaaatggtccgacatccaatacaagcaatggaccctccagcaac aacagaaactgtccttctcccatgcaaacaggggcaaccacagatgacagcaaaaccaac ctcatcgtcaactatttaccccagaatatgacccaagaagaattcaggagtctcttcggg agcattggtgaaatagaatcctgcaaacttgtgagagacaaaattacagtgttcttccat gtgcctgaaacaagtgttttaaagggggccactgtggatgtgtggacagttgacacaact gcccttgtccctggggagaaacaggccctaaggatagcagattattctggaatacaacta ctctgcagtaggagggaaatgcttaatgaagatctgcatttgagtaaaacagatatcaaa atacctgcttcagagggttgtaaagattatgtctttcacatgtgtcttatgggatcgggt tccgcttaccaacttcctatggacatgggacagagtttagggtatggatttgttaactat attgatccaaaggatgcagagaaagccatcaacactttaaatggactcagactccagacc aaaaccataaaggtaaagggagttgtgggtgctgcatgggaagaggcagccctcagccct cgtcctgatgccccatctgtgatgaatgacaacgcctgctttacccaccttgggttcacc ctgttccctctgtaccctagctctggctgtgcagaactaatgggagttaaggtcatgact atccattatgcatttattgggtgccaggtctcatatgcccgtccgagctctgcctcaatc agggatgctaacctctatgttagcggccttcccaaaaccatgacccagaaggaactggag caacttttctcgcaatacggccgtatcatcacctcacgaatcctggttgatcaagtcaca ggagtgtccagaggggtgggattcatccgctttgataagaggattgaggcagaagaagcc atcaaagggctgaatggccagaagcccagcggtgctacggaaccgattactgtgaagttt gccaacaaccccagccagaagtccagccaggccctgctctcccagctctaccagtccccc aaccggcgctacccaggtccacttcaccaccaggctcagaggttcaggctggacaatttg cttaatatggcctatggcgtaaagaggttctccccaattaccattgatggaatgacaagc cttgtgggaatgaacatccctggtcacacaggaactgggtggtgcatctttgtctacaac ctgtcccccgattccgatgagagtgtcctctggcagctctttggcccctttggagcagtg aacaacgtaaaggtgattcgtgacttcaacaccaacaagtgcaagggattcggctttgtc accatgaccaactatgatgaggcggccatggccatcgccagcctcaacgggtaccgcctg ggagacagagtgttgcaagtttcctttaaaaccaacaaagcccacaagtcctga >gi568815597f:50044956_50301175|GENSCAN_predicted_peptide_5|89_aa MPDSSVPMEAACSSTNMPRGVGLGDICPLPDIYERRALTSLLKSLLVLCFPKDVSCLSLH DQPFSSYSCALPFHCCPPKLLSVASTPLP >gi568815597f:50044956_50301175|GENSCAN_predicted_CDS_5|270_bp atgcctgattcctctgtccccatggaggctgcatgcagcagtacaaatatgccaagagga gtagggcttggagatatttgccctttaccagatatttatgaacgaagggccctcacttcc ttgctcaaaagccttctggtactctgctttcctaaagatgtttcctgcctcagcctgcat gaccaacccttttcctcttattcctgtgctctccctttccactgctgcccacctaagctc cttagtgtggcctccactcctttaccctga >gi568815597f:50044956_50301175|GENSCAN_predicted_peptide_6|425_aa MSLMGSRKQQMASVAKEPREKGESVSSGQKPMEMMSWEKHEGVHKYFGGGKGLVRTFPEI EDVTSSHIVDWNSLEGSEKDKNMWKSLELPRDSLNGCDQNADDDMNNEVQAEVVSGGDEE LTENWELTGNLLGSKVHFCYALAKRLVALCPCSRHLWSFELERDDLGPGGLEGKNGSVGQ AQGPAALCSLRTCMAPCIPNAPAQPWLKGAKIKLRPLLQRVQASSLGGFHMVLGLWVCKR QELRVGVLCLDFRGCMETSGYPGRSLLQGQSPHGENSTRAMQRGNMGLDPPHRVPTGTLP RGAVRRGPPFSRPPNGRSTNSLHHTSRKATGTQCQALKADTGDVPCRATEVELPKALGAH PLHLHALDVRHGVKGDYFRTLRFNDYPAGFWTCMGPIAPLFWPISSFWNGRIYPMPAPPL YLASG >gi568815597f:50044956_50301175|GENSCAN_predicted_CDS_6|1278_bp atgagcttaatgggttctaggaaacaacagatggccagcgtggcaaaagaaccgagagag aagggagaaagtgtaagttctggtcagaaacccatggagatgatgtcttgggaaaagcat gagggagttcacaaatactttggaggaggcaaaggactggtgaggacgttcccggagatt gaagatgtgacaagttcccacatcgtagattggaacagtctggagggctcagaaaaagat aagaacatgtggaaaagtttggaacttcctagagactcattgaatggttgtgaccaaaat gctgatgatgacatgaacaatgaagttcaggctgaggtggtttcaggtggagatgaggaa cttactgagaactgggaacttactgggaacttactgggaagtaaagttcacttttgctat gctttagcaaagagactggtggcactgtgcccctgctctagacatctgtggagctttgaa cttgagagagatgatttagggccaggaggcctagaagggaaaaatggttctgtgggccag gcccagggccctgctgctctgtgcagcctcaggacatgcatggcaccctgcatcccaaat gctccagctcagccatggctaaaaggggccaagataaagctcaggccattgcttcagagg gtgcaagcctcaagccttggtggcttccacatggtgttaggcctgtgggtgtgcaaaagg caagagttgagggttggagtcctctgcctagatttcagaggatgtatggaaacgtctgga tatccaggcagaagtctgctacagggtcagagccctcatggagagaactctactagggca atgcagaggggaaatatggggttggatcccccacacagagtccccactggaacactgcct agaggagctgtgagaagagggccaccattctccagacccccaaatggtagatccactaac agcttgcaccatacgtctagaaaagccacaggcactcaatgccaggccctgaaagcagac acaggggatgtaccctgcagagccacagaagtggagctgcccaaggccttgggagcccac cccttgcatctgcatgccctggatgtgagacatggagtcaaaggagattatttcagaact ttaagatttaatgactaccctgctgggttttggacttgcatggggcctatagcccctttg ttttggccaatttcttccttttggaatggaagaatttacccaatgcctgcacccccattg tatcttgcttctggctag >gi568815597f:50044956_50301175|GENSCAN_predicted_peptide_7|376_aa MWESLELPGDLESSEDRKMQESLELPRDLLNGFDQNAVSDMDNKVQAEVVPDRDEELVGN WSKATPAMAKRNQRTAQTRASEDASPKTWQLTCGVQPIFIFKAQKCKDKQVESSPRGAQI LMGETSCQQRMTTHWYRHRALIYTPVAQISAMTEGSTVKSGKKRCEAGGVGCSKGAQTQN QESQVQSPPSCVTRCDSCPPFCRIIIPAEQGGHRFNGNTESTWCKVGVKEIRFETPRHKK KPSLTFFFPFQTEQVPELVKHKCKIMKNNRSLPGEPTTTSGASSYDYAFRCPSLPSASEH LEGRVCQNQNSRAHVSQGLTQGPSMGLLRKGVMILSQPLTDATAHKGWKGHWKYTGPTLE DVNCGGICLFWSPYTG >gi568815597f:50044956_50301175|GENSCAN_predicted_CDS_7|1131_bp atgtgggaaagtttggaacttcctggagacttggaaagctcagaagacaggaagatgcag gaaagtttggaacttcctagagacttgttgaatggctttgatcaaaatgctgttagtgat atggacaataaagtccaggctgaggtggtcccagatagagatgaggaacttgttgggaac tggagtaaagccactccagccatggctaaaaggaaccaacgcacagctcagaccagggct tcagaagatgcaagccctaagacttggcagcttacatgtggtgttcagcctatatttatt ttcaaggcccaaaaatgcaaagataaacaagtagagtcctcaccccgaggagctcagatt ctaatgggagagacatcatgtcaacagaggatgacaacccactggtacagacacagagca ctgatctacactcccgtagcacagatcagtgccatgacagagggaagcacagtgaagagt ggaaagaagcggtgtgaagcagggggagtgggatgcagtaaaggagcacagacacagaat caggagagccaagttcagagccctcccagctgtgtgacccgatgtgactcatgtcccccc ttctgcagaataataatacctgctgaacaaggtggtcacaggttcaatggcaatacagaa agtacgtggtgtaaagtaggtgtcaaggaaatcagatttgagacaccacgacacaagaag aaaccatccctgactttcttcttcccttttcaaactgaacaagttcctgaacttgtcaaa cataaatgcaaaattatgaaaaacaatcgaagtcttcctggagagcccactaccacctct ggtgcctcaagttatgattacgccttcagatgtccatctctcccatctgcaagtgagcac ctggaaggcagggtatgtcagaaccaaaattctcgggctcatgtcagtcaaggcctcacc cagggaccctccatgggccttctcaggaaaggagtaatgattctcagccagcccctcaca gatgccacagcacataaaggttggaaaggccattggaaatacacgggccccacactagag gatgtgaactgtggaggcatttgcctcttctggtctccttacacaggctag >gi568815597f:50044956_50301175|GENSCAN_predicted_peptide_8|144_aa MESLAQGHPTELVPFALPKLLLSGFGKGFLSLRETESQVPTVFTGVWGKEAEVGKYGLTS IPRKPGAGLKGKRGSKQHGQHGQTEASVCPEDPSHRFLQSCGRLMKATGLNTPPLPVGLL FASPDSCREKIRAGLGVFRPGFKS >gi568815597f:50044956_50301175|GENSCAN_predicted_CDS_8|435_bp atggagtcacttgcccagggtcatccaactgagctggtgccctttgcactccccaagctg cttctctctgggtttggtaaaggttttttgtctttacgagaaacagaaagccaggttccc actgtatttacaggggtctggggaaaggaagcagaagtggggaagtacggcttgacttca atccccagaaagccaggggctgggctgaaggggaaaaggggaagtaaacagcatgggcag catggacaaacagaggcctcagtgtgccctgaggacccctcccacagattcctccaatct tgtggaaggctgatgaaagcaactggtctgaacacccctcctctgccagtgggtcttctc tttgcctctcctgacagctgcagggagaagatacgtgctggacttggcgtcttcaggcct gggttcaagtcctag >gi568815597f:50044956_50301175|GENSCAN_predicted_peptide_9|208_aa MLSEDSMLEERDEFCFGKCNISGKFQQVGWALECDHLHKASLGSAADAVIAATKGPERMT QSPKFGGQGHLRGVKEGPQRIGRCLPTLIREIFFTRLLIQMFTSSGNTLTNTTRKWEILW GTDLHLRHLWVSGASHRILHEYKLRDFLLKWMTKRASIQKRLQRHEKNPSPELLAFLRIQ TLLLLSGVTNTLSLEDFTPSRDSGSSLL >gi568815597f:50044956_50301175|GENSCAN_predicted_CDS_9|627_bp atgctatctgaagacagcatgttggaggaaagagatgaattctgttttggaaagtgcaat atttctggaaagttccaacaagtgggctgggcactggagtgtgaccatctccataaagct tccttaggctctgcggcagacgctgtcatagccgccacaaaaggcccagagagaatgacg cagtctccaaaatttgggggccaagggcacctcaggggtgtcaaagaaggccctcagcgg attggacgatgcctgcccacactgataagggagatattttttactcgtctactgattcaa atgtttacgtcttctggaaacaccctcacaaacacaacaagaaagtgggagattctctgg ggcacagacctacatctgaggcatctgtgggtctctggggcctcgcacagaatcctgcat gagtacaagctcagggactttttgctgaagtggatgacaaagagagcaagcatccaaaaa cgactgcagaggcatgagaagaacccctcccctgaactgctggcatttctcaggatccag acactgcttctcctttctggtgtcaccaacacactctccttggaggatttcacaccttcc agggattcaggttccagcctcctctga >gi568815597f:50044956_50301175|GENSCAN_predicted_peptide_10|76_aa XTLESQTVMTCLMCFLAHDEETEAEMQGIDFVPFDTASEWQQQDPWPRTHSVAQPQTTAF RVHALQLFSDGSNDKC >gi568815597f:50044956_50301175|GENSCAN_predicted_CDS_10|231_bp ngaaccctggaatcacagactgtcatgacttgtctaatgtgctttctggcacatgatgag gaaactgaggctgagatgcaggggattgactttgtcccctttgacacagccagtgagtgg cagcagcaggacccctggcccagaactcattccgtggctcagcctcaaaccacggccttc cgagtccatgctttgcagctgttctctgatgggtcgaatgacaaatgctga