GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:55:09 Sequence gi568815591r:20684293_20885762 : 201470 bp : 39.37% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1404 1544 141 2 0 71 80 69 0.830 4.03 1.02 Intr + 4961 5123 163 1 1 102 53 127 0.023 9.23 1.03 Intr + 14227 14258 32 2 2 3 103 13 0.006 -8.77 1.04 Intr + 15533 15637 105 1 0 101 115 31 0.219 6.59 1.05 Intr + 20432 20515 84 1 0 109 30 73 0.167 2.70 1.06 Intr + 38724 38927 204 2 0 32 93 147 0.022 8.07 1.07 Intr + 42748 42848 101 0 2 38 60 101 0.965 0.29 1.08 Intr + 44023 44163 141 1 0 62 103 64 0.898 3.85 1.09 Intr + 54691 54847 157 1 1 49 74 206 0.100 14.39 1.10 Intr + 58585 58782 198 0 0 90 83 147 0.992 13.03 1.11 Intr + 60940 61146 207 0 0 71 94 254 0.999 22.65 1.12 Intr + 69068 69214 147 1 0 48 87 145 0.981 9.91 1.13 Term + 71135 71332 198 1 0 115 44 111 0.994 5.82 1.14 PlyA + 72358 72363 6 1.05 2.00 Prom + 75752 75791 40 -7.05 2.01 Init + 76173 76313 141 2 0 52 83 131 0.929 9.18 2.02 Intr + 76578 76799 222 2 0 115 72 44 0.815 2.90 2.03 Term + 81032 81091 60 1 0 4 43 173 0.915 1.23 2.04 PlyA + 81106 81111 6 1.05 3.00 Prom + 92637 92676 40 -7.05 3.01 Init + 93954 94163 210 2 0 95 36 150 0.908 7.36 3.02 Term + 99305 99925 621 1 0 86 45 470 0.998 35.92 3.03 PlyA + 99934 99939 6 -10.43 4.08 PlyA - 99958 99953 6 -10.44 4.07 Term - 101503 99998 1506 1 0 156 48 1496 0.902 141.93 4.06 Intr - 109241 109116 126 2 0 62 36 87 0.524 0.86 4.05 Intr - 110820 110662 159 0 0 27 45 195 0.429 8.36 4.04 Intr - 113172 113037 136 2 1 72 -34 194 0.234 5.25 4.03 Intr - 113768 113693 76 0 1 63 69 84 0.993 1.65 4.02 Intr - 114896 114716 181 2 1 55 28 190 0.531 8.22 4.01 Init - 135227 135174 54 2 0 60 76 42 0.347 1.53 4.00 Prom - 137470 137431 40 -3.65 5.00 Prom + 137765 137804 40 -7.65 5.01 Init + 142207 142267 61 0 1 59 49 36 0.470 -1.74 5.02 Intr + 143026 143258 233 2 2 104 83 297 0.564 27.37 5.03 Term + 143950 144048 99 0 0 95 44 91 0.952 2.55 5.04 PlyA + 146323 146328 6 1.05 6.00 Prom + 153048 153087 40 -6.95 6.01 Init + 159422 159671 250 1 1 27 75 238 0.047 14.07 6.02 Intr + 181621 181698 78 2 0 98 99 27 0.012 3.40 6.03 Term + 198448 198470 23 2 2 119 53 19 0.017 -1.00 6.04 PlyA + 199585 199590 6 1.05 7.02 PlyA - 200377 200372 6 1.05 7.01 Sngl - 200914 200498 417 1 0 60 42 205 0.813 9.15 7.00 Prom - 201217 201178 40 -3.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 4961 5149 189 1 0 102 53 153 0.970 9.67 S.002 Term + 51278 51368 91 2 1 86 40 139 0.872 5.11 S.003 Init + 54719 54847 129 1 0 82 74 169 0.875 15.11 S.004 Sngl + 159422 159691 270 1 0 27 49 258 0.874 10.83 S.005 Init - 182952 182868 85 0 1 57 98 61 0.872 5.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:20684293_20885762|GENSCAN_predicted_peptide_1|625_aa DIKKADEQMESMTYSTERKTNSLPLHSVKSIKSDFIDKAEESTQSKEQGGEAGLNGKGTA VVHWMAEQLLWYLAGGACWKSALKGGKGDLPQDGAELQNLSVFSIIFAKIITMFGNNDKT TLKHDAEIYSMIFVILGVICFVSYFMQDIAWFDEKENSTGGLTTILAIDIAQIQGATGSR IGVLTQNATNMGLSVIISFIYGWEMTFLILSIAPVLAVTGMIETAAMTGFANKDKQELKH AGKIATEALENIRTIVSLTREKAFEQMYEEMLQTQHRNTSKKAQIIGSCYAFSHAFIYFA YAAGFRFGAYLIQAGRMTPEGMFIVFTAIAYGAMAIGETLVLAPEYSKAKSGAAHLFALL EKKPNIDSRSQEGKKPDTCEGNLEFREVSFFYPCRPDVFILRGLSLSIERGKTVAFVGSS GCGKSTSVQLLQRLYDPVQGQVLFDGVDAKELNVQWLRSQIAIVPQEPVLFNCSIAENIA YGDNSRVVPLDEIKEAANAANIHSFIEGLPEKYNTQVGLKGAQLSGGQKQRLAIARALLQ KPKILLLDEATSALDNDSEKVVQHALDKARTGRTCLVVTHRLSAIQNADLIVVLHNGKIK EQGTHQELLRNRDIYFKLVNAQSVQ >gi568815591r:20684293_20885762|GENSCAN_predicted_CDS_1|1878_bp gatattaaaaaagctgatgaacagatggagtcaatgacatattctactgaaagaaagacc aactcacttcctctgcactctgtgaagagcatcaagtcagacttcattgacaaggctgag gaatccacccaatctaaagagcaaggcggagaggcaggcctcaatggaaagggcacagct gtggttcactggatggcagagcagttgctgtggtaccttgctggtggggcttgctggaaa tctgccctcaagggtggcaagggagaccttccacaggacggtgctgagcttcagaacctg tcagtattttccatcatctttgcaaaaattataaccatgtttggaaataatgataaaacc acattaaagcatgatgcagaaatttattccatgatattcgtcattttgggtgttatttgc tttgtcagttatttcatgcaggatattgcctggtttgatgaaaaggaaaacagcacagga ggcttgacaacaatattagccatagatatagcacaaattcaaggagcaacaggttccagg attggcgtcttaacacaaaatgcaactaacatgggactttcagttatcatttcctttata tatggatgggagatgacattcctgattctgagtattgctccagtacttgccgtgacagga atgattgaaaccgcagcaatgactggatttgccaacaaagataagcaagaacttaagcat gctggaaagatagcaactgaagctttggagaatatacgtactatagtgtcattaacaagg gaaaaagccttcgagcaaatgtatgaagagatgcttcagactcaacacagaaatacctcg aagaaagcacagattattggaagctgttatgcattcagccatgcctttatatattttgcc tatgcggcagggtttcgatttggagcctatttaattcaagctggacgaatgaccccagag ggcatgttcatagtttttactgcaattgcatatggagctatggccatcggagaaacgctc gttttggctcctgaatattccaaagccaaatcgggggctgcgcatctgtttgccttgttg gaaaagaaaccaaatatagacagccgcagtcaagaagggaaaaagccagacacatgtgaa gggaatttagagtttcgagaagtctctttcttctatccatgtcgcccagatgttttcatc ctccgtggcttatccctcagtattgagcgaggaaagacagtagcatttgtggggagcagc ggctgtgggaaaagcacttctgttcaacttctgcagagactttatgaccccgtgcaagga caagtgctgtttgatggtgtggatgcaaaagaattgaatgtacagtggctccgttcccaa atagcaatcgttcctcaagagcctgtgctcttcaactgcagcattgctgagaacatcgcc tatggtgacaacagccgtgtggtgccattagatgagatcaaagaagccgcaaatgcagca aatatccattcttttattgaaggtctccctgagaaatacaacacacaagttggactgaaa ggagcacagctttctggcggccagaaacaaagactagctattgcaagggctcttctccaa aaacccaaaattttattgttggatgaggccacttcagccctcgataatgacagtgagaag gtggttcagcatgcccttgataaagccaggacgggaaggacatgcctagtggtcactcac aggctctctgcaattcagaacgcagatttgatagtggttctgcacaatggaaagataaag gaacaaggaactcatcaagagctcctgagaaatcgagacatatattttaagttagtgaat gcacagtcagtgcagtga >gi568815591r:20684293_20885762|GENSCAN_predicted_peptide_2|140_aa MKAAVQRDQDPSESSVNVSVLQQGSPIPRPRTSTNPWPVRNRAAEQEMGPSSCRKTSSGL PLILHYGDLYNYFIICYNVIIIEIKYSINVKMHLNHPKTTAPLPRSVEKLSSRKLVPGAK KNNKKKKKKKKKKKKKKKKK >gi568815591r:20684293_20885762|GENSCAN_predicted_CDS_2|423_bp atgaaggcagctgtccagagagatcaagatccatcagagtcatctgtaaatgttagtgtg ttacagcaggggtccccaatccccagaccacggaccagtaccaatccatggcctgttagg aaccgggctgcagagcaggagatgggaccatctagttgcaggaaaacaagctcaggtctc ccactgatcctacattatggtgatttgtataattattttattatatgctacaatgtaatc ataatagaaataaagtactcaataaatgtaaaaatgcacttgaatcatcccaaaaccact gccccacttccccggtccgtggaaaaactgtcttccaggaaactggtccctggtgccaaa aagaataataagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaaa taa >gi568815591r:20684293_20885762|GENSCAN_predicted_peptide_3|276_aa MGCSGPQPEGSGGPGPGPGLAGRPQTTTSLFAATCKRTVRIASSEPSPGGPRGLSQCTGH FEELGRGKRRGTKKKRQTDIGRSKLYKLWVSGCQKSPPPSPDSLSRTSGDAQGPGRVLRS RGAQYSGSGTKQPSARAALFCPGLQPLPLHRSAQACQPSRSCPREARCLPGTGLVQLYLQ WPFPLKVTPFTTWGVEEGRTRKREREIKARRGAGSGCNGFRQRYPWKGKARARDSDACYL LVHIPFTKLSHRRKKEEGQKQKETESESDAIGKGWS >gi568815591r:20684293_20885762|GENSCAN_predicted_CDS_3|831_bp atgggctgctccggcccgcagcccgaggggagcggcgggcccgggccggggccagggctc gcggggagaccccagaccaccacctcactcttcgctgctacgtgcaagcggaccgtgcgg atcgccagttcggaaccctccccgggagggccccggggtctttctcaatgtactgggcac ttcgaggagctgggaagaggaaagcggaggggaacaaagaaaaagaggcagactgacatt gggagatctaaactctacaaactctgggtctcgggctgccaaaagtcacccccaccaagc cccgacagcctgagtaggacgtccggcgatgcccaggggcctggcagggtgctccgaagc cgaggtgctcagtacagcggctctggaaccaagcagccgtctgcccgggcggcgctgttc tgcccaggcctccagcctctccctctccaccgctccgctcaggcttgtcagccgagccgc tcctgcccgcgcgaggcccgctgtctaccaggcactggattagtccagctctacctccag tggccgttccctctaaaagttacgcccttcacaacctggggggtggaggaggggaggaca aggaagagagaacgagaaataaaggcccgacgaggcgcgggttccggctgcaacgggttc aggcagcgttacccgtggaaagggaaagccagggcccgggacagcgatgcgtgttactta cttgtccatatcccctttacaaagttaagtcacagaaggaagaaggaagaggggcagaaa cagaaagagacagagagcgagtcggatgcaatagggaaaggctggagttga >gi568815591r:20684293_20885762|GENSCAN_predicted_peptide_4|745_aa MKEDGPGLENSGSDSMLLHQQREGSTLAYGQTSIEKIQYVFNAELTFKRDKTTLAMRALA LSLGSLREILSPLLQLAGARALRAGEQAGTRVCPGRRPAPGAGRWVKALKRDKPVWQRIP GIRETGSRSTSEVRWTIAGKRSSSRLRGLNSPVSSKTVDVFLAGENCECAAHWDKIFVKS CKTPELSREKCFKNERGQSFRWPPPLPPHPLLFLDSESNDSKARDFVRVHWAEITWPLWP EMGQEEPRLGSTPLAMLAATCNKIGSPSPSPSSLSDSSSSFGKGFHPWKRSSSSSSASCN VVGSSLSSFGVSGASRNGGSSSAAAAAAAAAAAAAALVSDSFSCGGSPGSSAFSLTSSSA AAAAAAAAAAASSSPFANDYSVFQAPGVSGGSGGGGGGGGGGSSAHSQDGSHQPVFISKV HTSVDGLQGIYPRVGMAHPYESWFKPSHPGLGAAGEVGSAGASSWWDVGAGWIDVQNPNS AAALPGSLHPAAGGLQTSLHSPLGGYNSDYSGLSHSAFSSGASSHLLSPAGQHLMDGFKP VLPGSYPDSAPSPLAGAGGSMLSAGPSAPLGGSPRSSARRYSGRATCDCPNCQEAERLGP AGASLRRKGLHSCHIPGCGKVYGKTSHLKAHLRWHTGERPFVCNWLFCGKRFTRSDELQR HLRTHTGEKRFACPVCNKRFMRSDHLSKHVKTHSGGGGGGGSAGSGSGGKKGSDTDSEHS AAGSPPCHSPELLQPPEPGHRNGLE >gi568815591r:20684293_20885762|GENSCAN_predicted_CDS_4|2238_bp atgaaggaggatggacctgggttagaaaattcaggttccgattcaatgctcttgcaccag cagcgtgagggttccactctagcatatggtcaaacttctatcgagaagatccagtacgta ttcaatgcagagcttaccttcaaaagagataaaactaccttggcaatgagagcactagca ctgtctttgggatccctgcgggagatcctctccccacttctacagttagcgggtgcccgg gccttgcgggccggcgagcaggcggggacgcgtgtgtgccctggccggcgacctgcgccg ggtgccgggagatgggtcaaagcattaaaacgggacaaacctgtatggcaacgcatccca ggtatacgtgaaaccggaagcaggagcacttccgaagttcgttggaccatcgcggggaag cgcagctcctcaaggctccgtgggttgaactctccagtcagcagcaaaaccgtcgatgtt ttcttagcaggagaaaactgcgaatgtgcggcccactgggacaagatctttgtgaagtcc tgcaaaacccctgaactcagtcgtgaaaaatgtttcaaaaatgagcggggccaaagcttt cgatggccacccccactgccaccacaccccctgcttttcctggactcagagtcgaatgat tctaaagcacgcgattttgttcgtgttcactgggccgaaataacgtggccattatggcct gaaatgggccaggaagaaccgaggttgggatcgactcctctggccatgcttgccgctacc tgtaataagataggcagccccagcccgtctccctcctccctctcggacagctcttcttcc ttcggcaaaggcttccacccctggaaacgctcctcgtcctcttcttccgccagctgcaac gtagtgggttccagtctctcaagcttcggcgtgtccggggcctccaggaacggcggctcg tcctcggcggctgcggcggccgcggcagcagccgcggctgccgcggccctggtgtccgac tcgttcagctgcggcggctcgcctggctccagcgccttctccctcacctccagcagcgcc gcagccgccgccgccgccgccgcagccgccgcctccagctcgcccttcgccaacgactac tctgttttccaggcccccggagtttccgggggcagcggcggcggcggcgggggcggcggc ggcggctcctccgcgcactcgcaggacggctcccaccagccggtgttcatctccaaggtg cacacctctgtggacgggctgcagggcatctacccgcgggtgggcatggcgcacccgtac gagtcgtggtttaagccctcgcacccgggcctgggtgctgcgggcgaggtgggctcggcc ggcgcctccagctggtgggacgtgggggccggctggatcgacgtgcagaacccgaacagc gcggctgcgctgcccggctcgctgcaccctgccgccggggggctccaaacctcgctgcac tcgccgctcggaggctacaactcggattactcgggcctgagtcactcggccttcagcagc ggcgcctcctcgcacctgctcagccccgccgggcagcacctcatggacggcttcaagcca gtgctacccggctcctacccggactcggccccgtcgccgctggccggcgcggggggctcc atgttgagcgctgggccttcggcgccgctggggggctccccgcgctcctcagctcgccgc tactccggccgcgccacctgcgactgccccaactgccaggaggcagagcggctgggccct gccggggcgagcttgcggcgcaagggcctgcacagctgccacatcccgggctgcggcaag gtgtacggcaagacttcgcacctcaaggcgcacctgcgctggcacacgggcgagcggccc ttcgtgtgcaactggcttttctgcggcaagcgcttcacgcgctccgacgagctgcagcgg cacctgcggacccacaccggcgagaagcgcttcgcctgtccagtttgcaacaagcgcttc atgcgcagcgaccacctcagcaagcacgtgaagacgcacagtggcggcggcggcggcggc ggctcggcgggctcgggcagcggcggcaagaagggcagcgacaccgacagcgagcacagc gccgcgggcagcccgccctgccactccccagagctgctgcagccccccgagcccgggcac cgcaacggcctagagtga >gi568815591r:20684293_20885762|GENSCAN_predicted_peptide_5|130_aa MGKKELKRIITRKEAFGVLGAFKMWKRGRGGSSGVNFRISVGLPVGAVINCADNTGAKNL YIISVKGIKGRLNRLPAAGVGDMGMATVKKGKLELRKKCSGTLVKGTPLENTEGGMSGGS ASPWLPTMNA >gi568815591r:20684293_20885762|GENSCAN_predicted_CDS_5|393_bp atgggaaaaaaagaacttaagaggattataacaagaaaggaagcttttggagtcctggga gcgttcaagatgtggaagcgaggacgtggtgggtcctctggtgtgaacttccggatttct gtgggtcttccggtaggagctgtgatcaactgtgctgacaacacaggagccaaaaacctg tatatcatctccgtgaaggggatcaagggacggctgaacagacttcccgctgctggtgtg ggtgacatggggatggctacagtcaagaaagggaaactagagctcagaaaaaagtgctca ggaactctggtcaaaggaacaccactagaaaacactgaaggaggcatgtcaggtggttct gcttctccatggctccctacgatgaatgcctag >gi568815591r:20684293_20885762|GENSCAN_predicted_peptide_6|116_aa MPNAIKKFVIRNTVEATAVRDISKVSIFNSEVLPKLYGKLRCAVSCAIHSKVVRNRSCEA CKDRTPSPSLDLRVLPHDPRQSPFSKGCKSRSTILPKAVSQVLIPHVTLGVNEASS >gi568815591r:20684293_20885762|GENSCAN_predicted_CDS_6|351_bp atgcccaatgctattaaaaagttcgtcattcgaaacacagtagaggccacagctgtcagg gacatttctaaagtgagtatcttcaacagcgaagtgcttcccaagctgtatgggaagtta cgttgcgctgtgagttgtgccattcacagcaaggtagtcaggaatcgatcttgtgaagcc tgcaaggaccgaacaccttcacccagtttagacctgcgggtgctgccccatgacccccgc caaagcccattttctaaaggttgtaaaagcaggtcaacaatactgccaaaagcagtttct caggtgctgattccccacgttacccttggagtcaatgaagcaagttcatga >gi568815591r:20684293_20885762|GENSCAN_predicted_peptide_7|138_aa MSELPFTIATKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIV KMVILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKTILSKNNKTGGIMLP DFKPYYKATVTKTACYWY >gi568815591r:20684293_20885762|GENSCAN_predicted_CDS_7|417_bp atgagtgaactcccattcacaattgctacaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactactcaatgaaataaaa gaggacacaaacaaatggaagaatattccatgctcatggataggaagaatcaatatcgtg aaaatggtcatactgcccaaagtaatttatagattcaatgccattcccatcaagctacca atgactttcttcacggaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcattgccaagacaatcctaagcaaaaacaacaaaactggaggcatcatgctacct gacttcaaaccatactacaaagctacagtaaccaaaacagcatgttactggtactaa