GENSCAN 1.0 Date run: 3-Nov-116 Time: 01:15:38 Sequence gi568815584r:74563723_74784681 : 220959 bp : 45.67% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10821 10889 69 2 0 78 65 151 0.827 12.85 1.02 Intr + 11527 11660 134 0 2 59 50 66 0.227 -0.66 1.03 Intr + 21927 22416 490 0 1 115 46 331 0.930 24.61 1.04 Intr + 23068 23148 81 0 0 84 111 79 0.996 9.63 1.05 Term + 24953 25108 156 1 0 75 51 86 0.674 1.53 1.06 PlyA + 25269 25274 6 1.05 2.00 Prom + 32335 32374 40 -3.46 2.01 Init + 33460 33664 205 0 1 75 80 82 0.291 4.27 2.02 Intr + 46746 46973 228 1 0 80 52 139 0.035 7.34 2.03 Intr + 47393 47507 115 0 1 92 91 39 0.075 4.11 2.04 Intr + 47871 48148 278 0 2 58 84 224 0.066 16.16 2.05 Intr + 48832 48908 77 2 2 92 18 19 0.010 -5.47 2.06 Intr + 49845 49975 131 2 2 94 90 -5 0.018 -0.21 2.07 Intr + 52105 52239 135 1 0 59 70 78 0.106 2.78 2.08 Intr + 53678 53780 103 2 1 33 116 73 0.112 4.78 2.09 Intr + 60471 60654 184 2 1 60 115 123 0.927 11.56 2.10 Term + 68309 68391 83 0 2 88 48 104 0.882 4.26 2.11 PlyA + 69253 69258 6 1.05 3.00 Prom + 71227 71266 40 -3.66 3.01 Init + 84513 84546 34 2 1 87 72 43 0.243 2.63 3.02 Intr + 88543 88659 117 2 0 46 115 64 0.722 5.34 3.03 Intr + 88688 88810 123 0 0 33 101 47 0.625 1.06 3.04 Term + 90049 90212 164 2 2 54 51 77 0.329 -1.30 3.05 PlyA + 93182 93187 6 1.05 4.19 PlyA - 95952 95947 6 1.05 4.18 Term - 100100 99998 103 1 1 110 47 153 0.996 11.15 4.17 Intr - 100223 100177 47 2 2 94 78 12 0.584 -1.99 4.16 Intr - 101203 101114 90 1 0 93 94 34 0.955 4.69 4.15 Intr - 103655 103597 59 0 2 107 107 55 0.999 7.90 4.14 Intr - 103872 103743 130 0 1 114 81 78 0.998 9.97 4.13 Intr - 106052 105927 126 2 0 102 100 116 0.999 15.08 4.12 Intr - 106404 106225 180 0 0 95 87 142 0.959 14.86 4.11 Intr - 107149 107040 110 2 2 66 64 72 0.993 2.60 4.10 Intr - 107761 107686 76 1 1 92 86 55 0.977 4.79 4.09 Intr - 109230 109109 122 1 2 113 86 60 0.633 8.51 4.08 Intr - 109496 109355 142 2 1 72 66 120 0.940 8.13 4.07 Intr - 110389 110312 78 1 0 80 80 125 0.997 10.65 4.06 Intr - 112224 111977 248 1 2 86 53 216 0.853 15.08 4.05 Intr - 112599 112419 181 0 1 92 103 83 0.999 9.64 4.04 Intr - 113030 112861 170 0 2 86 99 29 0.977 3.47 4.03 Intr - 119835 119574 262 0 1 92 82 168 0.717 13.66 4.02 Intr - 120958 120732 227 2 2 88 75 222 0.962 18.60 4.01 Init - 139280 139148 133 2 1 53 38 138 0.561 5.60 4.00 Prom - 140646 140607 40 -1.66 5.04 PlyA - 140946 140941 6 1.05 5.03 Term - 147071 147009 63 2 0 138 43 -5 0.043 -2.11 5.02 Intr - 149254 149211 44 2 2 85 64 31 0.011 -1.64 5.01 Init - 166127 165062 1066 2 1 66 40 405 0.619 27.99 5.00 Prom - 166278 166239 40 -2.16 6.00 Prom + 187146 187185 40 -3.56 6.01 Init + 199768 200640 873 0 0 36 110 434 0.903 33.73 6.02 Intr + 214725 214961 237 2 0 65 91 146 0.757 10.41 6.03 Term + 216683 216928 246 1 0 68 48 153 0.709 5.09 6.04 PlyA + 220715 220720 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 74027 74136 110 0 2 86 53 85 0.870 3.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:74563723_74784681|GENSCAN_predicted_peptide_1|309_aa MPETQQGAINPTDDEDDDDELEKVFVWVGEMHVNTTKAQRSIPLSGEPRRDLNSQTQLER TALKQYTLLAKPMPAPTPTLLLISLAMPGKAKEGGEERREDSAAFSTSLHQTVQRKKPLQ KETEPQTPSPMEKGTYPAGGDCGAGGDCGAGGCALAKVPSPAALLLRRLGERSAQRRGAS LEFWGSNSSSGMTSSQRAPRKPERQTQSCGRLQEPRFWHGGSQTAEDTGREQGRTLDCGQ RKHGGQNIAAGTTAAAGQILSDECDNRPGTVLCIHALTEASRQFYAAGVTMIPILQIMLL KLREVKRLA >gi568815584r:74563723_74784681|GENSCAN_predicted_CDS_1|930_bp atgcctgaaacacagcagggtgcaataaatcctactgatgatgaagatgatgatgatgaa ctggaaaaggtatttgtgtgggttggagagatgcacgttaacactacaaaggcccagaga tccatccccctttcaggggaacctcgaagggacctcaacagccaaacccagctggaaagg acggccttgaagcaatacactttgctggcaaagcccatgccagccccaacacctactctc ctgctcatatctctggccatgccagggaaagccaaggagggtggggaggagagaagggag gactctgcggccttctccaccagccttcaccaaacggtccaaaggaagaagcccctccaa aaagagactgagccccagaccccaagccccatggagaaggggacttacccagctggtggc gactgtggtgcgggcggcgactgtggtgctggcggctgtgctctggccaaggtgccctct ccagccgcactgctcctgcgcaggttgggtgaacgctcggcccagcgtcgaggtgccagc ctggagttctgggggtcaaattcctcatcgggaatgacctcctcgcagcgggctccacgg aaaccagagcggcagacacagagctgcgggcggctgcaggagccccggttctggcacggc ggctcgcaaacggctgaggacacagggagagagcaaggcagaactctcgactgcggccag agaaagcacggtggccagaatatagctgccggcaccacagccgctgccggccagatactg tctgatgagtgtgacaataggcctggcaccgtgctctgcatacacgctctcactgaagcc tctcgacaattctatgcagcaggtgtgactatgattcccattttacagataatgctgctg aagctcagagaggttaagagacttgcctga >gi568815584r:74563723_74784681|GENSCAN_predicted_peptide_2|512_aa MAAFGTSRQSEPRALLLVLHVNCRAQCLLALNTQGQFMLRLAPMASLFGVRDLEGLGKVL GKPNIKTSGLNPTGKSGPGKERLPKPFPDFPMVPPPSPSLPRAQCPPFQRHLPYPGKAER ARSHDDPIGDGDYWGIFSLCDGARDPGSAARVYKSSLVAGRRNAGEWRGALRKDLTSGLR ESWWLDTPRLLRAGRPPGLRLGGPPGAPPARLGPLHGLQARDRRVLLPEQTVHLGCSRCR VAAPRAPQSIRVSTSRLVSPYGVPLGMTRAHEESQGTVFGSGDPGCSLAGLKEAGRGAEA RPKGLDQQGCPWHICTAYFPERQSEGAGMWAPVHNLPHHTKDKPSHYYCSVSSVCISGSL PPIHPSPFASAKTLTQTNFPSPSGAPSTPQIHNHPALVCRCLSPPSGYDMLAGGDSHGMD VPAKIEKPYVLDIKQIPEECLVHFSHYFLSIPLSGIRVAKKNKAKDGPYPQAAASIVGKA DSTYHENSYQKQSLWIQFDPVHSWFSINIGCR >gi568815584r:74563723_74784681|GENSCAN_predicted_CDS_2|1539_bp atggcagcatttggcacatccaggcagagtgagccccgggcattactgttagtattgcat gtcaattgtcgggcacagtgcctgctggcactcaatacccaggggcagttcatgctccgc ttggctcctatggcctccctctttggggtgagggatttggaaggtctggggaaagtcctg gggaagcccaacattaagacgtcaggactaaacccaactggcaaatcgggcccaggcaaa gagaggctgccaaagcccttcccagacttcccgatggtgcccccacccagcccctcactg cccagggctcagtgccctccattccagaggcacctgccatacccaggcaaggcggagcgt gcccgcagccatgatgatccgattggggatggggattactggggcatcttctcactctgc gacggcgcccgcgatcccggcagcgctgcaagagtctacaagagctcgttagttgcaggc aggaggaacgcgggcgagtggagaggcgccttgcgaaaggatttaacgtcggggctccgg gagagttggtggctggacacgccgcgactgctgcgcgcgggacggcctcctggcctccgc ctcggtgggcctcctggggctcccccagcccggctgggcccgctccacgggctgcaagcc cgcgacaggcgcgtcctgctcccggaacagactgtacaccttggctgcagccgctgccgg gtagctgcccccagggcgccgcagtcgattcgcgtctccaccagccggctcgtatctccc tacggggtccctttgggcatgacccgcgcccacgaagagagccaggggaccgtctttgga tcgggggacccaggctgcagccttgcaggtctgaaggaagccgggaggggcgccgaggcc cgccctaaaggcttggatcaacagggctgcccctggcacatttgtacagcctatttccca gagaggcagagtgaaggagctggcatgtgggcaccagttcacaacctcccacaccacaca aaagacaagcccagtcattattactgctccgtctcctccgtctgcatttctggaagtctg ccgcccattcatccctcacccttcgcctccgccaaaaccctcacgcagaccaactttccc tcaccatcgggcgccccttccactcctcaaattcacaaccaccctgccctggtctgccgg tgcctgtctcctccaagtgggtatgacatgctggctggtggtgatagccatggtatggat gttcctgcaaagatagagaagccttatgtgcttgacatcaaacagatacctgaggagtgc ctggtgcatttcagccattatttcttgagtatccctttgtcaggcatcagagttgccaag aagaataaggctaaggacggtccctaccctcaagcagctgccagtatagtaggaaaggca gacagtacatatcatgaaaattcctaccagaagcagagcctgtggatccagtttgaccct gtgcactcctggttcagcatcaacattggctgcaggtga >gi568815584r:74563723_74784681|GENSCAN_predicted_peptide_3|145_aa MVTITPKDTGPECTEQVKEQRQERAEMVQTTFTESWWCFTATVLGPLAWSVKMGCAHGKW PLAQPWKQQLEEACGLSLSTRLLLCKTEARHGPETKPVSSFFQSLPLLPQSTPADLPLTR ATRGLYRNQTRRQGPELEADVWDDE >gi568815584r:74563723_74784681|GENSCAN_predicted_CDS_3|438_bp atggtcacaattacccccaaggacacgggcccagagtgcacagaacaagtaaaggaacaa aggcaggaacgagctgagatggttcagaccacgttcactgagagctggtggtgcttcaca gccaccgtccttgggcctctggcctggtcagtaaaaatgggctgtgcccatggaaagtgg cctctggctcagccctggaagcaacagctggaagaggcctgtggcctgtcactatccaca cggttgcttctctgcaagacagaggcccgccacggacctgaaactaaacctgtaagcagc ttcttccagagccttcctcttcttccccagtccacacctgcagacctgcccctcacccgt gccacccgcggtttgtacaggaatcagacaagaaggcaagggcccgaattggaagcagat gtttgggacgatgagtga >gi568815584r:74563723_74784681|GENSCAN_predicted_peptide_4|827_aa MWESLKLPGDLLNDFEQNGDDMDIEIQAEVVSDGDEEFVGNWSKGGITVSVVAFFFTIKF LFELAARVVSFLQNEDRERRGDRTIYDYVRGNYLDPRSCKVSWDWKDPYEVGHSMAFRVH VSFFSPWQLFYKNGQPFPAHRPVGLRVHISHVELAVEIPVTQEVLQEPNSNVVKVAFTVR KAGRYEITVKLGGLNVAYSPYYKIFQPGMVVPSKTKIVCHFSTLVLTCGQPHTLQIVPRD EYDNPTNNSMSLRDEHNYTLSIHELGPQEEESTGVSFEKSVTSNRQTFQVFLRLTLHSRG CFHACISYQNQPINNGEFDIIVLSEDEKNIVERNVSTSGVSIYFEAYLYNATNCSSTPWH LPPMHMTSSQRRPSTAVDEEDEDSPSECHTPEKVKKPKKVYCYVSPKQFSVKEFYLKIIP WRLYTFRVCPGTKFSYLGPDPVHKLLTLVVDDGIQPPVELSCKERNILAATFIRSLHKNI GGSETFQDKVNFFQRELRQVHMKRPHSKVTLKVSRHALLESSLKATRNFSISDWSKNFEV VFQDEEALDWGGPRREWFELICKALFDTTNQLFTRFSDNNQALVHPNPNRPAHLRLKMYE FAGRLVGKCLYESSLGGAYKQLVRARFTRSFLAQIIGLRMHYKYFETDDPEFYKSKVCFI LNNDMSEMELVFAEEKYNKSGQLDKVVELMTGGAQTPVTNANKIFYLNLLAQYRLASQVK EEVEHFLKGLNELVPENLLAIFDENELELLMCGTGDISVSDFKAHAVVVGGSWHFREKII AAPTHSTLPTAHTCFNQLCLPTYDSYEEVHRMLQLAISEGCEGFGML >gi568815584r:74563723_74784681|GENSCAN_predicted_CDS_4|2484_bp atgtgggaaagtttgaaacttcctggagacttgttgaatgactttgaacaaaatggtgat gatatggacattgaaatccaggctgaggtggtctcagatggagatgaggaatttgttggg aactggagcaaaggtggaatcacagtgtctgtggttgcattcttcttcacaattaagttc ctctttgagcttgccgcacgtgtagtcagcttcctccagaatgaggaccgcgagcgccga ggggaccggactatttatgactacgtgcggggaaattacctggatccccggtcttgcaaa gtctcctgggattggaaggacccctatgaggtgggccacagcatggccttccgagtgcat gtgtccttcttctccccatggcagttattctataagaacgggcagcctttccctgcacat cggcctgtgggactaagagttcacatctctcatgtcgagctagcagtggaaattccagtg acccaggaagtccttcaggagcccaattccaacgtagtaaaagtggccttcactgtgcgc aaggctgggcgttatgaaatcacagtgaagcttggtggattaaatgtggcatatagtccc tactacaaaatttttcaacctggaatggtggttccttctaagaccaaaattgtgtgccac ttttctactcttgtattgacctgtgggcagccgcacacccttcaaatagtaccccgagat gagtatgataatcccaccaacaattccatgtccttgagagatgagcacaattacaccttg tccattcatgagctcggccctcaagaagaagagagtactggtgtctcatttgagaaatca gtaacatccaacaggcagactttccaggtgttcttgcgactcaccctgcattctcgaggc tgcttccatgcttgcatttcataccaaaatcagccaatcaataatggtgaatttgacatt attgtcctaagtgaggatgagaagaatatcgtcgaacgcaatgtgtccacttcaggcgtg agcatttactttgaggcttatctttataatgctaccaactgtagcagcactccatggcac ctgccacccatgcacatgacctcttcccagcgccggccatccactgctgttgacgaggaa gatgaagactcgccctctgagtgccacacccctgagaaggtgaagaaaccgaagaaggtg tactgctatgtgtcaccaaagcaattctcagtgaaggagttctacctgaagatcatcccc tggcgcctttacaccttccgagtgtgtccaggaacaaaattttcataccttggtcctgac cctgtccataagctgctcacactggtggtggatgatggcattcaacctcctgtggagctc agctgtaaggagaggaacattctagcagccacttttatccgctccctgcataagaacata ggaggctctgagacctttcaggacaaggtgaactttttccagcgagagcttcggcaggta catatgaaaagaccacattccaaagtcaccctgaaggtcagcagacatgccttgttggaa tcgtctctgaaagccactcggaatttctccatctcagattggagcaagaactttgaggtt gttttccaggatgaagaagctctggactggggagggcctcgccgggaatggtttgagcta atctgcaaagcactatttgataccaccaatcagctcttcacccggttcagtgacaacaac caagcattagtgcatcccaaccctaatcgccccgctcatctgcgcctgaaaatgtatgag tttgcgggacggctcgtgggcaagtgtctctatgagtcctctctaggaggagcctacaag cagttggtccgagctcgcttcacccgctctttcctggcccaaatcataggactgcgtatg cattacaagtactttgaaacagatgacccagaattctacaaatctaaagtttgttttatc ctcaacaatgacatgagtgagatggagctggtctttgcagaagagaaatataataaatca ggtcaattggataaggttgtagaactcatgacaggtggagctcaaactccagtcaccaat gcgaataaaatcttctatttaaatttgctggcccaatatcggctggccagtcaagtgaaa gaggaggtggaacatttcctaaaaggcctgaatgaattggtccctgagaaccttttggct atttttgatgagaatgagcttgagctgctgatgtgtgggactggagacatcagtgtgtct gacttcaaagcccatgcagtagttgttggtggctcatggcatttcagagaaaagattatt gccgctccgacccatagcacgctgcctactgcacacacatgttttaaccagctgtgcctc cctacatatgactcctatgaagaggtgcacaggatgctgcagctggccatcagcgagggt tgcgagggctttggcatgctctga >gi568815584r:74563723_74784681|GENSCAN_predicted_peptide_5|390_aa MKAEINMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELE KQEQTHSKASRSQEITKIRAGLKEIETQKTLQKINESRSWFFETINKTDKPLPRLIKKKR QKNQIDAIKNDKGDIITDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEE VESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYHRYKEELVPFLLKLFQSTEKEGIL PNSFYEASIILIPKPGRDTTKKENFRPISLMNTDAKILNKIRENQIQQHIKKLIHHDQVG FIPGMQGWFNICKSINVIQHINRTKDKNHMIISIDAEKAFGKSQHRFMLKTLNKLGQRRW NFVALQPPARAHIHLDGLIVYISVGTEFLM >gi568815584r:74563723_74784681|GENSCAN_predicted_CDS_5|1173_bp atgaaggcagaaataaatatgttctttgaaaccaacgagaacaaagacacaacataccag aatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatcgaaaattgacaccctaacatcacaattaaaagaactagaa aaacaagagcaaacgcattcaaaagctagcagaagtcaagaaataactaaaatcagagca ggactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatccaggagctgg ttttttgaaacgatcaacaaaactgataaaccgctaccaagactaataaagaaaaaaaga cagaagaatcaaatagacgcaataaaaaatgataaaggggatatcatcaccgatcccaca gaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaactagaaaat ctagaagaaatggataaattcctcgacacatacactctcccaagactaaaccaggaagaa gttgaatctctgaatagaccaataacaggagctgaaattgtggcaataatcaatagctta ccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccacaggtacaag gaggaactggtaccattccttctgaaactattccaatcaacagaaaaagagggaatactc cctaactcattttatgaggccagcatcatcctgataccaaagccgggcagagacacaacc aaaaaagagaattttagaccaatatccttgatgaacactgatgcaaaaatcctcaataaa atacgggaaaatcaaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggc ttcatccctgggatgcaaggctggttcaatatatgcaaatcaataaatgtaatccagcat ataaacagaaccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggccttt ggcaaaagtcaacatcgcttcatgctaaaaactctcaataaattaggacagcggaggtgg aacttcgtcgcgctgcaacccccggctcgggcacacattcatttagatggcttaatagtt tacatatctgtgggcactgaatttttgatgtag >gi568815584r:74563723_74784681|GENSCAN_predicted_peptide_6|451_aa MYPNWGRYGGSSHYPPPPVPPPPPVALPEASPGPGYSSSTTPAAPSSSGFMSFREQHLAQ LQQLQQMHQKQMQCVLQPHHLPPPPLPPPPVMPGGGYGDWQPPPPPMPPPPGPALSYQKQ QQYKHQMLHHQRDGPPGLVPMELESPPESPPVPPGSYMPPSQSYMPPPQPPPSYYPPTSS QPYLPPAQPSPSQSPPSQSYLAPTPSYSSSSSSSQSYLSHSQSYLPSSQASPSRPSQGHS KSQLLAPPPPSAPPGNKTTVQQEPLESGAKNKSTEQQQAAPEPDPSTMTPQEQQQYWYRQ HLLSLQQRTKVHLPGHKKGPVVAKDTPEPVKEEVTVPATSQVPESPSSEEPPLPPPNEEV PPPLPPEEPQSEDPEEDARLKQLQAAAAHWQQHQQHRVGFQYQGIMQKHTQLQQILQQYQ QIIQPPPHIQASASIVHNRKLPQDIERICYS >gi568815584r:74563723_74784681|GENSCAN_predicted_CDS_6|1356_bp atgtacccgaattggggccggtatggcgggagcagccactatccgccgccaccggtccca ccgccgccgccagtggcgcttcctgaggcctcgccggggcccgggtactcgagctcgacg actcccgcggccccctcctcctcgggcttcatgagcttccgcgaacagcacttggcgcag ctccagcagctgcagcagatgcaccagaagcaaatgcagtgcgtgcttcagccccaccac cttcctccgccccctctgccgcccccgccagtgatgccggggggcggctacggagactgg cagccgccaccgccaccgatgcccccgccacccgggccggccctcagctatcagaagcag cagcagtacaaacaccagatgctccaccaccaacgagacgggcctcctggtttggttcca atggagctggaatccccccctgaatctccccctgtgccgcctgggtcctatatgccccca tctcagtcttacatgcccccacctcagccgccaccctcttactaccccccgacctcatct cagccctacctgcctcctgctcagccgtccccttcgcagtccccaccttcccaatcctac ctggcgcccaccccttcttactcatcctcctcctcttcctcgcaatcctatttgagccat tcccagtcctacttgccctcttctcaggcatctccttcccgcccctcccagggccattct aaatcccaactactagctccaccaccaccgtccgccccccctggaaataagacaactgtc cagcaagagcctttggagagtggggccaaaaacaagagtactgaacagcagcaagccgcc cctgagccagatccctctacgatgactccacaggaacagcagcagtattggtatcgacag cacttgcttagtttgcaacagaggacaaaagttcatttgccaggacacaaaaagggtcct gtggtagcaaaggatacaccagagccggtaaaagaagaagttacagtacctgccaccagt caagttccagaatctccttcttctgaggagcccccattgccacctccaaatgaggaagtg ccacctcctctcccacctgaggaaccccagtctgaggacccagaagaagatgccaggtta aagcagttgcaggctgcagcagcacactggcagcagcaccagcagcatcgagtcggtttc cagtatcagggaataatgcagaagcacactcagttacagcagattctacaacagtatcag cagattatacagcccccaccacatatacaggcaagtgcttccatagtccacaataggaag ttgccccaagacattgaaaggatttgttattcttga