GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:51:29 Sequence gi568815575f:91335578_91536723 : 201146 bp : 35.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 780 819 40 -3.65 1.01 Init + 3029 3133 105 1 0 61 81 94 0.608 6.27 1.02 Term + 11526 11675 150 2 0 74 38 119 0.741 2.43 1.03 PlyA + 13609 13614 6 1.05 2.03 PlyA - 13883 13878 6 1.05 2.02 Term - 20323 20167 157 0 1 47 54 104 0.871 -0.68 2.01 Init - 23875 22821 1055 1 2 37 39 443 0.632 28.01 2.00 Prom - 26258 26219 40 -3.65 3.00 Prom + 41185 41224 40 -6.05 3.01 Sngl + 61227 61628 402 2 0 66 54 172 0.508 7.62 3.02 PlyA + 61947 61952 6 1.05 4.00 Prom + 62535 62574 40 -6.25 4.01 Init + 64335 64437 103 2 1 78 73 100 0.251 7.95 4.02 Intr + 78549 78643 95 1 2 72 98 56 0.866 3.76 4.03 Term + 78800 78889 90 1 0 121 38 59 0.880 0.94 4.04 PlyA + 80626 80631 6 1.05 5.03 PlyA - 80889 80884 6 1.05 5.02 Term - 86827 86655 173 2 2 43 49 191 0.491 7.71 5.01 Init - 88469 88412 58 2 1 54 100 26 0.765 1.92 5.00 Prom - 88543 88504 40 -5.45 6.00 Prom + 92065 92104 40 -6.35 6.01 Sngl + 100001 101149 1149 1 0 80 54 1142 0.945 106.07 6.02 PlyA + 101381 101386 6 1.05 7.00 Prom + 106832 106871 40 -3.95 7.01 Init + 128065 128215 151 0 1 63 81 114 0.985 8.45 7.02 Term + 135066 136276 1211 1 2 14 48 723 0.997 51.72 7.03 PlyA + 136343 136348 6 1.05 8.02 PlyA - 139778 139773 6 1.05 8.01 Sngl - 143127 142732 396 0 0 70 37 164 0.877 5.50 8.00 Prom - 143851 143812 40 -6.15 9.00 Prom + 153984 154023 40 -5.15 9.01 Init + 157486 157699 214 0 1 80 75 82 0.633 5.09 9.02 Intr + 160730 160810 81 0 0 41 103 140 0.446 9.49 9.03 Intr + 175892 176053 162 0 0 73 71 90 0.024 4.83 9.04 Intr + 178364 178611 248 0 2 41 41 202 0.002 7.06 9.05 Term + 198613 198729 117 0 0 9 48 112 0.002 -3.14 9.06 PlyA + 200594 200599 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 176139 176273 135 2 0 48 35 212 0.949 8.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:91335578_91536723|GENSCAN_predicted_peptide_1|84_aa MGDITTDTKEASLSEVNIRGSLSHAKKIKDVDTQEAPEHSGHGGRNGGYTLAQQHGLQLT KVDMATATTEYHLSGAETNTEPSI >gi568815575f:91335578_91536723|GENSCAN_predicted_CDS_1|255_bp atgggagatattacaactgatactaaagaggcttcattgtctgaggtaaatatccgaggt tcattgtctcatgccaagaaaatcaaggacgtggacacacaagaagcccctgaacacagt ggccatggtggcaggaatggaggttacacattggctcagcaacatggacttcaactaacc aaggtggacatggctacagccaccactgagtaccatttgtcaggagcagagaccaacact gagccctcaatatag >gi568815575f:91335578_91536723|GENSCAN_predicted_peptide_2|403_aa MNIDAKILNKILAKRIQQHIKKLIHHDQVGFIPGMQGCFKIRKSINIIQHINRAKDKNHM IISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKT GTRHGCPLSPLLFNIVLEVLARAIRQEMEIKGIQLGKDKVKLSLFADDMIVCLENPIVSA QNLLKLISNFSKVSGCKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTR DVKDLFKNYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMPF FTELEKTTLKFIWNQKRAHIAKSILSQKSKAGGITLPDFKLYYKATVTKTAWFFCGHKFS THLSQYQGVWLLDDMARVCFVLKETAKQSSKVAAIFCIPISDE >gi568815575f:91335578_91536723|GENSCAN_predicted_CDS_2|1212_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaacgaatccagcagcacatc aaaaaacttatccaccatgatcaagtgggcttcatccctgggatgcaaggctgcttcaaa atacgcaaatcaataaacataatccagcatataaacagagccaaagacaaaaaccacatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacagcccttcatgctaaaa actctcaataaattaggtattgatgggacgtatttcaaaataataagagctatctatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacatggatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagggcaattaggcaggagatggaaataaagggtattcaattaggaaaagacaaagtc aaattgtccctgtttgcagacgacatgattgtgtgtctagaaaaccccattgtctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatgcaaaatcaatgta caaaaatcacaagcattcttatacaccaacaacagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaagaactacaaaccactgctcaatgaaataaaagaggataca aacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggcc atactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatgcctttc ttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccacatc gccaagtcaatcctaagccaaaagagcaaagctggaggcatcacactacctgacttcaaa ctatactacaaggctacagtaaccaaaacagcatggttcttctgtggacataagttttca actcacttgagtcaataccaaggagtctggttgctggatgacatggcaagagtatgtttc gttttgaaagaaactgccaaacagtcgtccaaagtggctgcaatattctgcattcccatc agtgatgaatga >gi568815575f:91335578_91536723|GENSCAN_predicted_peptide_3|133_aa MVLNQVKLAEMTDVEFRIWIEMKIIEIQEKVSIQSMESKKTIQEMKDKRTILRKHQTDLI GLKNSLQEFYNTIRSINSRIDQAEERTSELKYWFCESTQSAKEYKVMNKASKRHEIMYRD QNYDSLVSQKERE >gi568815575f:91335578_91536723|GENSCAN_predicted_CDS_3|402_bp atggttcttaaccaggttaaattggctgaaatgacggatgtagaattcagaatctggata gaaatgaaaatcattgagattcaggaaaaagtctcaatccaatccatggaatccaagaaa acaatacaagagatgaaagacaaaagaaccattttaagaaagcaccaaactgatctgata gggctcaaaaactcactacaagaattttataatacaatcagaagtattaacagtagaata gaccaagctgaggaaagaacttcggagctcaaatactggttctgtgaatcaactcaatca gccaaagaatacaaagtaatgaacaaagcctctaaaagacatgagattatgtacagagac caaaactatgactcactggtgtcccagaaagagagggaatga >gi568815575f:91335578_91536723|GENSCAN_predicted_peptide_4|95_aa MDPNQEEISDLPEKEFRSLVIKLIREAPEKGEAQSKLKRLKSWVYKSQLKRAPPGSWNCT PVGDSKENSWDNNLQMALAKACALARNQIIVGFVI >gi568815575f:91335578_91536723|GENSCAN_predicted_CDS_4|288_bp atggatccaaaccaagaagaaatatctgatctacctgaaaaagaattcagaagtttagtt attaagctaatcagagaggcaccagagaaaggggaagcccaatcaaagctcaagagactc aaatcttgggtctacaagtcacaacttaagagggcccctccaggctcctggaactgcaca cctgttggagactctaaggaaaattcatgggacaataacctgcagatggctttagcaaaa gcttgtgctctagcaagaaaccaaataattgttggttttgtgatctaa >gi568815575f:91335578_91536723|GENSCAN_predicted_peptide_5|76_aa MSQLSSPGPALDTGIITIQARSSSPSALVSAPSESPPLLSQHASDDSSSHPGIRGLLYLV VYGHRAGSGGTDFKAS >gi568815575f:91335578_91536723|GENSCAN_predicted_CDS_5|231_bp atgagtcaattatcttcacctggtcccgcccttgacacggggattattacaattcaagct aggagttcatctccatcagctctagtatctgctccttcagaatctccacctcttctctca cagcatgcatcagatgattcttcatcacatccagggatccgtggcttgctctatcttgtt gtctacggccaccgtgctggctctggaggcactgactttaaagctagctag >gi568815575f:91335578_91536723|GENSCAN_predicted_peptide_6|382_aa MGSGEPNPAGKKKKYLKAALYVGDLDPDVTEDMLYKKFRPAGPLRFTRICRDPVTRSPLG YGYVNFRFPADAEWALNTMNFDLINGKPFRLMWSQPDDRLRKSGVGNIFIKNLDKSIDNR ALFYLFSAFGNILSCKVVCDDNGSKGYAYVHFDSLAAANRAIWHMNGVRLNNRQVYVGRF KFPEERAAEVRTRDRATFTNVFVKNIGDDIDDEKLKELFCEYGPTESVKVIRDASGKSKG FGFVRYETHEAAQKAVLDLHGKSIDGKVLYVGRAQKKIERLAELRRRFERLRLKEKSRPP GVPIYIKNLDETINDEKLKEEFSSFGSISRAKVMMEVGQGKGFGVVCFSSFEEATKAVDE MNGRIVGSKPLHVTLGQARRRC >gi568815575f:91335578_91536723|GENSCAN_predicted_CDS_6|1149_bp atggggagcggggagcctaatcctgctggcaagaaaaagaagtatctcaaggccgctctg tacgtgggtgacttggacccagatgtcaccgaggacatgctctataagaagttcaggcct gctggccctctgcgattcacccgaatctgccgtgatccggtgacccgcagccccctgggc tatgggtatgttaacttccgctttcccgcggatgcagagtgggccttgaacaccatgaat tttgatttgattaatggaaaaccattccgccttatgtggtctcagccagatgaccgctta agaaagtctggagtgggaaatatattcatcaaaaacctggacaaatccatagacaatagg gccctgttttacttattttctgcttttgggaacattctgtcctgcaaagtcgtatgcgat gacaacggctctaagggttatgcctatgttcactttgacagcctggccgctgccaataga gccatctggcacatgaatggagtgcggctcaacaaccgccaggtgtatgttggcagattc aaattcccagaagagcgggcggctgaggtcagaaccagggatagagcaactttcaccaat gttttcgttaaaaacattggagacgacatagatgacgaaaaactgaaggaacttttctgt gaatatgggccaactgagagtgttaaagtaataagagatgccagtgggaaatctaaaggc tttggatttgtgagatatgagacacacgaggctgcccaaaaggctgtgctagacttgcat ggaaagtccatcgatggaaaagtcctctatgtagggcgagcacagaagaaaattgaacgc ctggctgagttgaggcggagatttgaacggctgaggttaaaagaaaaaagtcggccccca ggggtgcctatctatattaagaacttggatgagacaatcaatgatgaaaaactgaaggag gaattttcttcctttgggtcaattagtcgggccaaagtgatgatggaagtggggcaaggc aaaggatttggtgtggtctgcttttcctcttttgaagaggctaccaaagcagtggatgag atgaatggccgcatagtgggctccaagcccctgcatgtcaccctgggccaggccaggcgc aggtgctga >gi568815575f:91335578_91536723|GENSCAN_predicted_peptide_7|453_aa MVILTAAEKAFDKIKHTFMIKTLSKLGIRGNFLILIKNIYEKPTAKIIINANRFWSGPSA NSSKVAEEGPDCWKENQQTESNKININKKDTHAKTPSDRHQHQRSKVDKSTKMRKNQHKN AENTKNQNASFLPKDHNSLPAREQNWMEIEFDELTEVGFRKQVITNSIELKEHVLTKCKE AKNLDKRLRELLTRVTSLEKNINDLMELKNTARQLLEAYTSVNTQTDQAEERISEIEDQL NEIKCEDKIREKRLERNEQSLQEIWDYLKRSNLRLIGVPESDRENGTRLENTFQNIIWKN FPNLARQDNIQIQEIQRTSQRCSLRRATSRHIIVRFTQVEMKEKMLRVVREKGQVTHKGK PMRLTADLSAETLQATREWGPIFNNLKEKNFQPGISYPAKLSFISEGEIKSFTDKQMLRD LVTTSTALQEIPKDALNMEKKNWYQPLQKHTKI >gi568815575f:91335578_91536723|GENSCAN_predicted_CDS_7|1362_bp atggtcatattaacagctgcagaaaaagcatttgacaaaatcaaacacacgtttatgata aaaactctcagcaaactaggaatacgagggaacttcctcatcttgataaagaacatctat gaaaaacctacagctaaaattataattaatgcaaacaggttctggagtggaccttcagca aactccagtaaagttgcagaagaggggcccgactgttggaaggaaaaccaacaaacagaa agcaataaaatcaacatcaacaaaaaggacacccatgcaaaaaccccatccgatcgtcat caacatcaaagatcgaaggtagataaatccacgaagatgaggaaaaaccagcacaaaaat gctgaaaataccaaaaaccagaatgcctcttttcttccaaaggatcacaactccttgcca gcaagggaacaaaactggatggaaattgagtttgacgaattaacagaagtaggcttcaga aaacaggtaataacaaactccattgagctaaaggagcatgttctaacaaaatgcaaggaa gctaagaaccttgataaaaggttacgggaactgctaactagagtaaccagtttagagaag aacataaatgacctgatggagctgaaaaacacagcacgacaacttcttgaagcatacaca agtgtcaatacccaaactgatcaagcagaagaaaggatatcagaaattgaagatcaactt aatgaaataaagtgtgaagacaagattagagaaaaaagattggaaaggaacgaacaaagc ctccaagaaatatgggactatctgaaaagatcaaacctacgtttgattggtgtacctgaa agtgacagggagaatggaactaggttggaaaacacatttcagaatattatctggaagaac ttccccaacctagcaagacaggacaacattcaaattcaggaaatacagagaacatcacaa agatgctccttgagaagagcaacctcaagacacataatcgtcagattcacccaggttgaa atgaaggaaaaaatgttaagggtagtcagagagaaaggtcaggttacccacaaagggaag cccatgagactaacagcagatctctctgcagaaaccctacaagccacaagagagtggggg ccaatattcaacaatcttaaagaaaagaattttcaacctggaatttcatatccagccaaa ctaagctttataagtgaaggagaaataaaatcctttacagacaagcaaatgctgagggat ttggtcaccaccagtactgccttacaagagatcccgaaggatgcactaaatatggaaaag aaaaactggtaccagccactgcaaaaacataccaaaatctga >gi568815575f:91335578_91536723|GENSCAN_predicted_peptide_8|131_aa MIQKDKEGHYIIIKHSIQREELTFLNIFAPNTGVHRFVKQVLKVLKETDKHTIIVGDFDN PLTVLDGSLRQKINKDIHDLNSTLDQMNLTDISRTLYQQQHDIHTSHLHRAHTLKLTTQL AIKQQIQQRIK >gi568815575f:91335578_91536723|GENSCAN_predicted_CDS_8|396_bp atgatccaaaaagacaaagaagggcattacataatcataaagcattcaattcaacgagaa gagttaacctttctaaatatatttgcacccaacactggggtccacagatttgtaaaacaa gttcttaaagttcttaaagagactgataaacacacaataatagtgggagacttcgataac ccactgacagtattagatggatcattgaggcagaaaattaacaaagatattcatgaccta aattcaacacttgaccaaatgaacctaacagacatctccagaacactctaccaacaacag catgatatacatacttctcatctacacagggcacatactctaaaactgaccacacaattg gccataaaacaacaaattcaacaaagaatcaaataa >gi568815575f:91335578_91536723|GENSCAN_predicted_peptide_9|273_aa MEHRVEDTALGMSSEIVLCRVQPPKLYLAFMGQHLSACSFSRCTLQGVNGATILGSGGLW PSSHSSTKQLPVPNKIHESARTRTPTHAHADGSKSSKNASVADARQTGSGVDLQQTPTDL QLRVLTVRRKTNKQKGHPHQNPICTSPSSKTKEIQSTIREYHKHLYANKLENLEEMDKFL DTYTLPRLNQEEVESLNKPITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEELFLLGA STSQRLTKALTIVPGIAVGYSGPKALHLGGNEC >gi568815575f:91335578_91536723|GENSCAN_predicted_CDS_9|822_bp atggaacacagggtagaagacactgcactgggcatgagctctgagattgtactctgcagg gtacagccccccaagctgtaccttgctttcatgggtcagcatttgagtgcctgcagcttt tccaggtgcacactgcaaggtgtcaatggagctaccattctggggtctggaggattgtgg ccttcttctcacagctccactaagcaattaccagtaccgaacaaaattcatgaaagcgcg cgcacacgcacacccacacacgcacacgcagacggtagtaaaagcagtaaaaatgcctcc gttgctgatgccaggcaaacagggtctggagtggacctccagcaaactccaacagacctg cagctgagggtcctgactgttagaaggaaaactaacaaacagaagggacatcctcaccaa aaccctatctgcacgtcaccatcatcaaagaccaaagaaatacaatctaccatcagagaa taccataaacacctctatgcaaataaactagaaaatctagaagaaatggataaattcctc gacacatacaccctcccaagactaaaccaggaagaagttgaatctctgaataaaccaata acaggctctgaaattgaggcaataattaatagcttaccaaccaaaaagagtccaggacca gatggattcacagccgaattctaccagaggtacaaggaggagctgtttctcctgggagcc agcacatctcagagactcaccaaggccctcactatagtaccaggtattgctgttggttat tcagggcccaaggctcttcacttaggaggcaatgaatgctga