GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:41:44 Sequence gi568815584f:63501509_63701808 : 200300 bp : 42.20% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 26822 26861 40 -1.45 1.01 Sngl + 41299 41715 417 0 0 72 49 375 0.860 26.05 1.02 PlyA + 43115 43120 6 1.05 2.00 Prom + 53580 53619 40 -6.15 2.01 Init + 59008 59033 26 0 2 77 58 43 0.237 -0.75 2.02 Intr + 61939 62054 116 1 2 21 91 115 0.649 4.27 2.03 Intr + 66374 66750 377 0 2 69 65 329 0.481 22.51 2.04 Term + 74879 74977 99 1 0 93 32 84 0.428 0.45 2.05 PlyA + 75413 75418 6 1.05 3.03 PlyA - 77738 77733 6 1.05 3.02 Term - 93474 92850 625 2 1 -11 36 760 0.249 53.77 3.01 Init - 93766 93555 212 1 2 66 -3 318 0.999 16.81 3.00 Prom - 95875 95836 40 -6.65 4.02 PlyA - 96161 96156 6 1.05 4.01 Sngl - 98434 97271 1164 1 0 70 44 806 0.999 70.28 4.00 Prom - 107844 107805 40 -5.45 5.00 Prom + 111628 111667 40 -6.45 5.01 Sngl + 117543 117740 198 2 0 78 49 184 0.931 8.42 5.02 PlyA + 118628 118633 6 1.05 6.00 Prom + 120228 120267 40 -5.05 6.01 Init + 134199 134424 226 2 1 84 7 104 0.041 0.58 6.02 Intr + 139881 140143 263 1 2 36 7 203 0.112 3.28 6.03 Term + 149824 149949 126 0 0 120 39 123 0.503 7.90 6.04 PlyA + 151252 151257 6 1.05 7.03 PlyA - 152456 152451 6 1.05 7.02 Term - 161887 161382 506 2 2 40 53 237 0.734 9.02 7.01 Init - 163078 163072 7 1 1 102 100 1 0.701 3.88 7.00 Prom - 175656 175617 40 -4.55 8.04 PlyA - 176666 176661 6 1.05 8.03 Term - 182614 182537 78 1 0 76 51 92 0.456 1.08 8.02 Intr - 185148 184657 492 0 0 103 12 278 0.451 13.97 8.01 Init - 198523 198458 66 1 0 68 116 -5 0.353 1.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 100001 100222 222 1 0 86 38 275 0.989 17.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:63501509_63701808|GENSCAN_predicted_peptide_1|138_aa MICGLEGRETGEVAAAGAGGGTGTCRGRSGQLRGAWGDGCPVRGPSGSAAAEPLGLTAGP EPQRSGGGGGLTILPRGPVPSNRHRPRCGSAGGSSLTFLAGGASARCPGPGGGGDDVGRG GREGGRLDLQRQQEKGTE >gi568815584f:63501509_63701808|GENSCAN_predicted_CDS_1|417_bp atgatctgtggcttggaggggcgggagacgggcgaggtggccgcagcgggggcgggcggc gggaccgggacgtgcagggggaggtccgggcagctgcggggagcctggggcgacggctgt ccggtacggggtccctccggttccgcggcggcggagcctctaggcctcacggccggaccc gaacctcagcgttccggtggcggcggcggcctcacgatcctcccccgcgggcccgtccca tcaaatcggcaccgaccccgttgcggctccgccggtggctcctcgctcacattcctggcg ggcggcgcctcagctcgttgccccggacccggcggcggcggcgacgacgtggggaggggg ggaagggagggaggaagactggatctgcagcggcagcaagagaaggggacagaataa >gi568815584f:63501509_63701808|GENSCAN_predicted_peptide_2|205_aa MLPVQPAEPTLKTLTKIVKINFFGTLYNNKRFAASPGAFIREKWLNLVFQEIPSGNRCFL SPTICSSLAFQLGPGRIALAKKDGKKKSLSAISIHKRIHGVGFKKCAPRALEEIQKCAMK EMGTPDVHIDTRLNKAVWAKGIRIVPYRIPVWLSRKRNEDENSTNKLYTLVTHEPHYLAM TDKDRVRVLNLDINLAMQMYSLIDA >gi568815584f:63501509_63701808|GENSCAN_predicted_CDS_2|618_bp atgcttcctgtacaacctgcagaaccaacactgaaaacactgacaaaaattgtcaaaatc aatttctttggaacactgtacaataacaaacgttttgcagcaagtccaggagcatttatt cgagaaaaatggctgaatcttgtgtttcaagaaataccttcagggaaccgctgctttctt tctcccacaatctgtagttctcttgctttccaacttgggcccggcaggatcgctcttgca aagaaggatggcaagaagaagagcctttctgccatcagcattcacaagcgcatccatggt gtgggcttcaagaagtgtgcccctcgggcactcgaagagatccaaaaatgtgccatgaag gagatgggcactccagatgtgcacattgatactaggctcaacaaagctgtttgggccaaa ggaataaggattgtcccataccgtatccctgtgtggttgtccagaaaacgtaacgaggat gaaaattcaacaaacaagctctatactttggttacccatgagcctcattacttggccatg actgataaggaccgagttagagtgctgaaccttgacattaatctggccatgcaaatgtat tcattaatagatgcttaa >gi568815584f:63501509_63701808|GENSCAN_predicted_peptide_3|278_aa MWAGNAWRAALSGVPCGRSAQSVLAQLRGILEGELEGIRGAGTWKSERVITSRQGPHIHV DGVSGGILNLTSVRFIRGTQSIHKNLEAKIARFHQREDAILYPSCCDANAGLFEVLLRPE DAVLSDELNCASIIHGICLCKAHKYHYCHLDVAYLETKLQEAQKHRLFLVATDGAFSMDG DIVPLQKICRLASRYGALVFVDECHATGFLGLTGQGTDELLGVMGQVTIINSTLGKALGG ASGGYTTGPGPLVSLLLPSPISSPTVCHLLLLAAPLRP >gi568815584f:63501509_63701808|GENSCAN_predicted_CDS_3|837_bp atgtgggctggaaacgcctggcgcgccgcgctttccggggtgccgtgcggccgcagcgcg cagtcagtactggcccagctgcgcggcattctggagggggagctggaagggatccgcgga gctggcacctggaagagtgagcgggtcatcacgtcccgtcaggggccgcacatccacgtg gacggcgtctccggaggaatcctcaacttaacctcggtccgcttcatccgtggaacccaa agcatccacaagaatctagaagcaaaaatagcccgcttccaccagcgggaggatgccatc ctctatcccagctgttgtgacgccaacgccggcctctttgaggtcctgctgagacccgag gacgcagtcctgtcggacgagctgaactgtgcctccatcatccacggcatctgcctgtgc aaggcccacaagtaccactattgccacctggacgtggcctacctagaaaccaagcttcag gaggcccagaagcatcggctgttcctggtggccaccgatggggccttttccatggatggc gacatcgtgcccctgcagaagatctgccgcctcgcctctagatatggcgccctggtcttt gtggatgaatgccatgccactggtttcctgggactcacaggacagggcacagatgagctg ctgggtgtgatgggccaggtcaccatcatcaactccaccctggggaaggccctgggtggt gcatcagggggctacacgacagggcctgggcccctggtgtcactgctgctgcccagccct atctcttctccaacagtctgccacctgctgttgttggctgcacctctaaggccctag >gi568815584f:63501509_63701808|GENSCAN_predicted_peptide_4|387_aa MEKIEEQFANLHIVKCSLGTKEPTYLLGIDTSKTVQAGKENLVAVLCSNGSIRIYDKERL NVLREFSGYPGLLNGVRFANSCDSVYSACTDGTVKCWDARVAREKPVQLFKGYPSNIFIS FDINCNDHIICAGTEKVDDDALLVFWDARMNSQNLSTTKDSLGAYSETHSDDVTQVRFHP SNPNMVVSGSSDGLVNVFDINIDNEEDALVTTCNSISSVSCIGWSGKGYKQIYCMTHDEG FYWWDLNHLDTDEPVTRLNIQDVREVVNMKEDALDYLIGGLYHEKTDTLHVIGGTNKGRI HLMNCSMSGLTHVTSLQGGHAATVRSFCWNVQDDSLLTGGEDAQLLLWKPGAIEKTFTKK ESMKIASSVHQRVRVHSNDSYKRRKKQ >gi568815584f:63501509_63701808|GENSCAN_predicted_CDS_4|1164_bp atggaaaagattgaggaacaatttgctaatctgcacattgttaaatgttccttaggaacc aaagagcccacttaccttcttggtatagacacatcaaagactgtccaagcaggaaaggaa aacttggttgctgttttatgttctaatggatcaatcagaatatatgataaagaaaggtta aatgtactacgagaatttagtggatatcctggacttcttaatggagtcagatttgcaaat tcctgtgacagtgtatattcagcatgtactgatggcactgtgaaatgctgggatgctcga gtagccagagaaaaacctgttcagctcttcaagggttacccttccaatatttttatcagt tttgatattaattgtaatgatcatattatttgtgctggtacagaaaaagttgatgatgat gcattgttggtgttttgggatgcaaggatgaattctcagaatttatctacaactaaagac tcacttggtgcatattcagagacacatagtgatgatgtcactcaagtacgtttccatccc agcaatcccaacatggtagtctcaggttcatctgatggcctggtaaatgtatttgatatt aatattgataatgaggaggatgcactggttacaacctgtaactcaatttcatcagtaagc tgtattggttggtctgggaaaggttataaacagatttactgcatgacacatgatgaagga ttttattggtgggatcttaatcatctggacactgatgaaccagttacacgtttgaacatc caggatgtcagagaagtagttaacatgaaagaagatgctttggactatttgattggtggc ctatatcatgaaaagacagacacattgcatgttattggaggaacaaacaaaggaaggatt catttgatgaactgcagcatgtcaggactgacccatgtgactagccttcagggagggcat gctgctacagtccgttctttctgttggaatgtgcaagatgattctttgttgactggagga gaagatgcacagttgttactttggaaacctggagctatagagaagacctttacaaagaaa gagagtatgaaaatagcatcctctgtgcaccaacgagtacgagttcatagtaatgattct tataaaagaaggaaaaagcagtga >gi568815584f:63501509_63701808|GENSCAN_predicted_peptide_5|65_aa MERMPPYFCQALRCGKSTNGRQVVGKSMERNLLLGPAGTEWSSNKEKTLQHPRPDAKHKI VTAHS >gi568815584f:63501509_63701808|GENSCAN_predicted_CDS_5|198_bp atggagcgaatgcctccctacttctgccaagccttacgctgtggaaagagcaccaacggt cggcaggtggtggggaagagcatggagagaaacctcctgttggggcctgctgggactgag tggagcagcaacaaggagaaaactctccagcatcccaggcctgacgctaagcacaagata gtaacagctcatagctga >gi568815584f:63501509_63701808|GENSCAN_predicted_peptide_6|204_aa MVNYLENCKDSSRKLLELIKESSKTFGYKINVHKSVALLYINSDQAENQIKNSTPFTIAA KTNKQTKLRNIPNRLVKICFLAYKRIILNPFSGGESIRGEERAGGNRSFPENEQGRLAFI EKRPRGSTGNCTGNSSLTRRSFFGQGQHRFDLVAPTAERDCQHEENDHEGQMAVKQEELA SGSEENAVPLPHGLRCPACSEKLL >gi568815584f:63501509_63701808|GENSCAN_predicted_CDS_6|615_bp atggtcaattaccttgaaaactgtaaggactcctccagaaagctcctagaactgataaaa gaatccagcaaaacttttggatacaagattaatgtacacaaatcagtagctcttctatac atcaacagcgaccaagcagagaatcaaatcaagaactcaaccccttttacaatagctgca aaaacaaacaaacaaacaaaacttaggaatatacctaacaggctggtgaaaatctgcttt ctggcctacaaaaggattatactaaacccttttagtggtggggagagtatccgtggagag gaaagagcaggaggaaacaggtctttccctgaaaatgagcaagggcgtttggcttttatt gaaaaacgccccagaggaagtactgggaactgcaccggcaacagcagcctgacccgcaga agtttctttggccagggccagcacaggtttgacctggtggctccgacggcggagcgagac tgccaacacgaagagaatgatcatgaagggcaaatggcagtgaagcaagaagagctggct tcaggctcagaagagaatgctgtgcctcttcctcatggtttgcggtgccctgcatgttca gagaaacttctctag >gi568815584f:63501509_63701808|GENSCAN_predicted_peptide_7|170_aa MTIKLFFALLTLHLSTYLILSGHRTRTQDLPHGGSKRAVTQTGLKHTPCLPHCRRQGEKS CGPLMSSDLEAPQAKAVTYSLTLHFLASPSFWAAPRSPVPAVEATCGTPGSAAASQGASA RGGAWSCLPYCSRCAWLFAVAGPRAHLLTHLSPLHSTHLRRCGILAGSMS >gi568815584f:63501509_63701808|GENSCAN_predicted_CDS_7|513_bp atgaccataaagctcttcttcgccttgctcaccctccacttgtccacgtacctcattctt tctggacacaggacgagaactcaggacctgccgcatggcgggtctaaaagagctgtaaca caaacagggctgaaacacaccccttgcttgccacattgcaggagacaaggagagaagagc tgtggccctttgatgagctcagacctagaagctccccaagccaaggctgtgacatactct ttgacactgcatttcctggcatctccaagcttctgggcagcaccacgttccccagtgcca gctgtggaagctacttgtggtacacctggttcagctgcagcttcgcagggagccagtgcc cgtggtggtgcctggagctgcctgccctactgcagccgatgtgcctggctgtttgcagtg gctggaccccgtgctcacttgctcacacacctgtcaccactccactccactcatctcaga aggtgtgggatcttggcaggtagcatgagctga >gi568815584f:63501509_63701808|GENSCAN_predicted_peptide_8|211_aa MIMLPHISLGDRVRPCLKKKEKDIIAGFLYTILILAVFYPFVDLIDNFNQTHKYAPFIII GLHLALGIFSFTLDTWSTSRGDTAEILGSGAGIACGSHVTYNMGLVLDPSLDTLPLAGPP ITVTLFGKAILRILIGMVFVLIIRDVMKKITIPLACKIFNIPCDDIRKARQHMEVELPYR YITYGMHIFPLKRFVQNQSAVLAIEDIEEVL >gi568815584f:63501509_63701808|GENSCAN_predicted_CDS_8|636_bp atgattatgctgccgcacatcagccttggtgacagagtaagaccctgtctcaaaaaaaag gaaaaggatattattgctggattcctatataccattttaatcttagctgtcttctatcca tttgtggacctgattgacaacttcaaccaaactcacaaatatgctccattcatcatcatc gggcttcatttagctttggggatcttttctttcactcttgacacctggagcacatcccga ggagacacagccgagatactaggaagtggtgctggaattgcatgtggatctcatgttact tataacatgggtctagtattagatccttctctagatacattacctttagctgggcccccc attactgtgactctgtttggaaaagccatattgcggatcctcatagggatggtatttgta ctaataatcagagatgtaatgaaaaagatcaccattcctttagcctgcaaaatcttcaat ataccgtgtgatgatattcgaaaagcaagacagcacatggaagttgaacttccttatcgg tatattacctatggaatgcacatatttcctctgaagcgttttgtccaaaatcaaagtgct gttttggcaatagaagacattgaagaagttctttga