GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:52:13 Sequence gi568815593r:100711850_101002955 : 291106 bp : 35.41% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 312 307 6 1.05 1.07 Term - 40754 40714 41 0 2 110 48 48 0.051 -0.63 1.06 Intr - 63796 63569 228 2 0 56 82 107 0.098 3.82 1.05 Intr - 68363 68272 92 1 2 97 73 73 0.151 5.42 1.04 Intr - 78551 78425 127 0 1 78 100 66 0.142 5.62 1.03 Intr - 82803 82715 89 2 2 75 75 92 0.347 5.30 1.02 Intr - 83234 83087 148 0 1 95 53 42 0.376 -0.23 1.01 Init - 83936 83864 73 2 1 65 100 58 0.235 5.98 1.00 Prom - 91462 91423 40 -4.05 2.05 PlyA - 92179 92174 6 1.05 2.04 Term - 100280 99998 283 1 1 102 42 134 0.321 4.11 2.03 Intr - 144547 144254 294 0 0 111 110 271 0.449 26.80 2.02 Intr - 174751 174494 258 0 0 66 105 130 0.489 8.16 2.01 Init - 183980 183805 176 2 2 42 113 72 0.554 3.97 2.00 Prom - 186746 186707 40 -3.45 3.00 Prom + 186792 186831 40 -7.05 3.01 Init + 188266 188552 287 0 2 29 10 151 0.559 -2.00 3.02 Term + 188610 188865 256 0 1 43 48 296 0.882 15.17 3.03 PlyA + 188973 188978 6 1.05 4.04 PlyA - 189736 189731 6 -0.45 4.03 Term - 191487 191241 247 2 1 80 42 299 0.937 18.78 4.02 Intr - 201792 201743 50 0 2 69 64 47 0.047 -3.04 4.01 Init - 206201 205752 450 2 0 41 86 173 0.346 8.36 4.00 Prom - 212080 212041 40 -6.65 5.02 PlyA - 212112 212107 6 1.05 5.01 Sngl - 213055 212711 345 1 0 78 47 219 0.991 12.59 5.00 Prom - 213832 213793 40 -8.55 6.04 PlyA - 214737 214732 6 1.05 6.03 Term - 220271 219145 1127 0 2 48 48 277 0.759 10.87 6.02 Intr - 220519 220422 98 0 2 26 40 94 0.528 -2.87 6.01 Init - 221456 220834 623 2 2 19 87 168 0.328 4.66 6.00 Prom - 235006 234967 40 -3.25 7.00 Prom + 237895 237934 40 -4.55 7.01 Init + 238669 238681 13 0 1 69 111 -6 0.487 0.38 7.02 Intr + 239120 239261 142 0 1 62 54 107 0.382 3.19 7.03 Intr + 239402 239517 116 2 2 77 83 51 0.417 2.67 7.04 Intr + 254882 254969 88 0 1 84 63 55 0.083 0.71 7.05 Intr + 256878 256963 86 0 2 34 106 72 0.285 2.14 7.06 Intr + 264783 264825 43 1 1 110 62 29 0.016 -1.12 7.07 Intr + 270757 270839 83 1 2 58 84 93 0.132 4.26 7.08 Term + 271246 271382 137 2 2 84 43 80 0.356 0.30 7.09 PlyA + 272490 272495 6 1.05 8.04 PlyA - 272505 272500 6 1.05 8.03 Term - 275875 275662 214 0 1 57 47 184 0.745 6.92 8.02 Intr - 276241 276141 101 1 2 71 43 69 0.716 -1.31 8.01 Init - 281176 281081 96 1 0 67 103 110 0.917 10.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:100711850_101002955|GENSCAN_predicted_peptide_1|265_aa MVRESQRMAFLDFVVSFREEVQGEGCTQLALSAWIPHLPRASQVQSGKGWVNECKVWPLR TARHTGCGRAGSSSSFSPTIQWVPNPYPVSRKNEVCGHLQDEQANLPELFCPFLSGKADR HNLGQWNLKGGMLTSLSLLSVPFEMKYSGIETGCKRADPFFVTEIEQFLNMTRVAQWTSP TGSMGLSPCATMRECRNKDTRQRDKRKDSWAWGTTTTNARRPVVAPNVWLRCYLLDTKQK GQGKECESSPMIGSSGTCILTVADC >gi568815593r:100711850_101002955|GENSCAN_predicted_CDS_1|798_bp atggtaagagaaagtcagagaatggccttcctggattttgtggtcagcttcagggaagaa gttcaaggggaaggctgcactcagcttgcactatcagcctggatcccacacctgccaagg gctagccaggtgcagagtggcaagggatgggtgaacgaatgcaaggtctggccactgcgc acagccaggcacactggctgtggcagggcaggcagctccagctctttcagtcccaccatt cagtgggtcccgaatccttatcctgtgtccaggaagaatgaagtatgtggacacctgcag gatgagcaagccaatctccccgaacttttctgtcctttcctctccggaaaagctgacagg cacaaccttggacaatggaaccttaaaggaggcatgctgacttcattgtctcttttaagt gtcccttttgaaatgaaatattctgggatagagactggctgtaagcgtgcagatcccttt ttcgtcacagaaattgagcaatttctcaacatgaccagagtggcgcaatggaccagccct acagggtccatgggtctctccccgtgtgcgacgatgagagagtgtagaaataaagacaca agacaaagagataaaagaaaagacagctgggcctgggggaccactaccaccaacgcgcgg agaccagtagtggccccgaatgtctggctgcgctgttatttattggatacaaagcaaaag gggcagggtaaagagtgtgagtcatctccaatgataggttcatcaggaacctgtatctta actgttgctgactgttag >gi568815593r:100711850_101002955|GENSCAN_predicted_peptide_2|336_aa MFLTFTKLAILFCFRDGELSLSRSLVNSSDKIIRKAGSSIFQHNVEGWKINSSLVLEIRK NILRFLDAERDVSVVKSSFKPGDVIHYVLDRRRTLNISHDLHSLLPEVSPMKNRRFKTCA VVGNSGILLDSECGKEIDSHNFVIRCNLAPVVEFAADVGTKSDFITMNPSVVQRAFGGFR NESDREKFVHRLSMLNDSVLWIPAFMVKGGEKHVEWVNALILKNKLKVRTAYPSLRLIHA VRGYWLTNKVPIKRPSTGLLMYTLATRFCDEIHLYGFWPFPKDLNGKAVKYHYYDDLKYR YFSNASPHRMPLEFKTLNVLHNRGALKLTTGKCVKQ >gi568815593r:100711850_101002955|GENSCAN_predicted_CDS_2|1011_bp atgtttcttactttcacaaagctggcaattttgttttgtttcagagatggtgaattgtct ttgagtcggtcacttgtcaatagctctgataaaatcattcgaaaggctggctcttcaatc ttccagcacaatgtagaaggttggaaaatcaattcctctttggtcctagagataaggaag aacatacttcgtttcttagatgcagaacgagatgtgtcagtggtcaagagcagttttaag cctggtgatgtcatacactatgtgcttgacaggcgccggacactaaacatttctcatgat ctacatagcctcctacctgaagtttcaccaatgaagaatcgcaggtttaagacctgtgca gttgttggaaattctggcattctgttagacagtgaatgtggaaaggagattgacagtcac aattttgtaataaggtgtaatctagctcctgtggtggagtttgctgcagatgtgggaact aaatcagattttattaccatgaatccatcagttgtacaaagagcatttggaggctttcga aatgagagtgacagagaaaaatttgtgcatagactttccatgctgaatgacagtgtcctt tggattcctgctttcatggtcaaaggaggagagaagcacgtggagtgggttaatgcatta atccttaagaataaactgaaagtgcgaactgcctatccgtcattgagacttattcatgct gtcagaggttactggctgaccaacaaagttcctatcaaaagacccagcacaggtcttctc atgtatacacttgccacaagattctgtgatgaaattcacctgtatggattctggcccttc cctaaggatttaaatggaaaagcggtcaaatatcattattatgatgacttaaaatatagg tacttttccaatgcaagccctcacagaatgccattagaattcaaaacattaaatgtgcta cataatagaggagctctaaaactgacaacaggaaagtgtgtaaagcaataa >gi568815593r:100711850_101002955|GENSCAN_predicted_peptide_3|180_aa MGPPFAGGKGRRDKGTALYCKIGSRKAKYESGVVMQDSRTMEGPSLKLLPSHKARSAPQR PETALVTSLEYWTGIWSRTSIPVYSKLKTGLSTWLGSAVWELKRNNELGSQAFPRTFHPG EKGKCSGSLAALALFLAAPTPCTVLVSQETKAVRTLQGTGVSATFAASPLPCPTVIAGFD >gi568815593r:100711850_101002955|GENSCAN_predicted_CDS_3|543_bp atgggacccccatttgcgggggggaagggtcgaagggataaaggaactgccttgtattgc aaaattggtagcagaaaagctaaatatgaatctggggtagtgatgcaagattcaaggact atggaaggcccttctttgaaactgctgccaagccacaaggctcgcagtgccccccaaagg ccagagactgccttggttacctcacttgaatactggaccgggatctggagccgaacctcc attccagtttactcaaaactcaaaactggcctctccacgtggcttggaagcgctgtttgg gagttgaaacgaaataatgagctcgggtcacaagcttttccacgaacctttcatcctgga gaaaaaggaaagtgctcagggagcctagctgccctcgcattgttcctggcagctccaact ccttgcacagtcttggtctctcaggagaccaaggcagtcaggacgctacagggcacgggt gtaagcgcgacgttcgcggccagccctctgccttgccccactgtaattgcgggttttgat taa >gi568815593r:100711850_101002955|GENSCAN_predicted_peptide_4|248_aa MLEVLARAIRQEKEIRGIPLGKEEAKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSG HKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTREVKDLFKENYKPLLN KIKEDRSKWKNIPCSWIGRINMVKMAILPKKIKSRTAHVIHTHYRSYLVIGFAQRLSPSK REGPVTPPNPAAEAAATSNAQGVSSEKKPEPQGGGAERPWQSWRKQGERSLGSVRGPEGA RPQKTPVI >gi568815593r:100711850_101002955|GENSCAN_predicted_CDS_4|747_bp atgttggaagttctggccagggcaatcaggcaggagaaagaaataaggggtattccatta ggaaaagaggaagccaaattgtccctgtttgcagatgacatgattgtatatttagaaaac cccatcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcagga cacaaaatcaatgtgcaaaaatcacaagcattcctatacaccaataacagacaaacagag agccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctagga atccaacttacaagggaggtgaaggacctcttcaaggagaactacaaaccactgctcaat aaaataaaagaggacagaagcaaatggaagaacatcccatgctcatggataggaagaatc aatatggtgaaaatggccatactgcccaagaaaataaagtcacgcacagctcatgtaatc catactcattatcgttcttacctggtgattggctttgcgcagcgtttatctcctagcaag agggaagggccagtgacgcccccgaacccagctgcagaagctgccgccacctccaatgca caaggtgtctcatctgaaaagaaacctgagccccagggaggcggcgcggagcgaccctgg cagagctggcgcaaacagggcgagaggtcgctgggcagcgttcgaggaccagagggagct cggccacagaagaccccagtgatctga >gi568815593r:100711850_101002955|GENSCAN_predicted_peptide_5|114_aa MDPNQEEIPDLPEKEFRLVIKLIKEVPEKDEVQCKEIEKKMIEEMRGEIFSEIDSINRKQ SQLQEIKDTLREMQNVLESLSIRIEQAGERTSELEGKVFEVTQSKDKEKRIRKK >gi568815593r:100711850_101002955|GENSCAN_predicted_CDS_5|345_bp atggatccaaaccaagaagaaatccctgatttacctgaaaaagaattcaggttagttatt aagctaatcaaggaggtaccagagaaagatgaagttcaatgtaaggaaattgaaaaaaaa atgatagaagaaatgaggggagaaatcttcagtgaaatagatagcataaacagaaaacaa tcacaacttcaggaaataaaggacacacttagagaaatgcaaaatgttctagaaagtctc agcattagaattgaacaagcaggagaaagaacttcagagcttgaaggcaaggtttttgaa gtaacccaatccaaagacaaagaaaaaagaataagaaaaaaatga >gi568815593r:100711850_101002955|GENSCAN_predicted_peptide_6|615_aa MRTKKDTTYQNLWDAFKAVCRGKFIGLNAHKRKQEKSKVDTLTSQLKELEKQEQTQSKAS RRQEITKIRAELKEIETQKTLQQINETGSRFFEKINRIDRPLGRLIKMKRERNQIDTIKN DKGDITTVPIDIQTTIREYYKHIYANKLENLEEMDKFLDTCTLPRLSQEEIESLNRPITG SKIEAIINSLPTKKSPGPDGFTAEFYHRTKDKNHMIISIDAEKAFDKFQQPFMLKTLNKL VLGVLAKAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISSFSKVSE YKINVQKSQAFLYTNNRHTESQIMHELPFTIASKRIKYLGIQLTRVVKDLFNENYKPLLN EIKEDTKKWKNIPCSWIGRINIMKMATLLKVICRFNTIPIKLPMTFSTELEKTTLKFIWN QKRAHTAKSILIQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEASEIIPH IYNHLIFDKLDKNKQWGKDSLFNKWYWENWLSICRKLKLDPFLTPYTKINSRWIKVLSVR PKTIKTLEENLGNTIQDIGMGKDFMSKTAKAMATKAKIEKWDLIETKESLHGKRNYCQSE QASYRMGENFCNLPI >gi568815593r:100711850_101002955|GENSCAN_predicted_CDS_6|1848_bp atgagaacaaaaaaagacacaacataccagaatctctgggacgcatttaaagcagtgtgt agagggaaatttataggactaaatgcccacaagagaaagcaggaaaaatctaaagttgac accctaacatcacaattaaaagaactagagaagcaagagcaaacacagtcaaaagctagc agaaggcaagaaataactaagatcagagcagaactgaaggagatagaaacacaaaaaacc cttcaacaaatcaatgaaactgggagcaggttttttgaaaagatcaacagaattgataga ccgctaggaagactaataaagatgaagagagagaggaatcaaatagacacaataaaaaat gataaaggggatatcaccactgttcccatagatatacaaactaccatcagagaatactat aaacacatctatgcaaataaactagaaaatctagaagaaatggataaattcctggacaca tgcaccctcccaagactaagccaggaagaaattgaatccctgaatagaccaataacaggc tctaaaattgaggcaataattaatagcctaccaaccaaaaaaagtccaggaccagatgga ttcacagccgaattctaccacagaacaaaagacaaaaaccacatgattatctcaatagat gcagaaaaggccttcgacaaatttcaacagcccttcatgctaaaaactctcaataaatta gtgttaggagttctggccaaggcaatcaggcaggagaaagaaataaagggtattcaatta ggaaaagaggaagtcaaactgtccctgtttgcagatgacatgattgtatatttagaaaac cccattgtctcagcccaaaatctccttaagctgataagcagcttcagcaaagtctcagaa tacaaaatcaatgtgcaaaaatcacaagcattcctatacaccaataacagacacacagag agccaaatcatgcatgaactcccattcacaattgcttcaaagagaataaaatacctggga atacaacttacaagggttgtgaaggacctcttcaatgagaattacaaaccactgctcaac gaaataaaagaggacacaaaaaaatggaagaacattccatgctcatggataggaagaatc aatatcatgaaaatggccacactgctcaaggtaatttgtagattcaataccatccccatc aagctaccaatgactttctccacagaattggaaaaaactactttaaagttcatatggaac caaaaaagagcccacactgccaagtcaatcctaatccaaaagaacaaagctggaggcatc acgctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactgg taccaaaacagagatatagaccaatggaacagaacagaggcctcagaaataataccacac atctacaaccatctgatctttgacaaacttgacaaaaacaagcaatggggaaaggattcc ctatttaataaatggtactgggaaaactggctatccatatgtagaaagctgaaactggat cccttccttacaccttatacaaaaattaattcaagatggattaaagtcttaagtgtcaga cctaaaaccataaaaaccctagaagaaaacctaggtaataccattcaggacataggcatg ggcaaggacttcatgtctaaaacagcaaaagcaatggcaaccaaagccaaaattgagaaa tgggatctaattgaaactaaagagagtctgcacggcaaaagaaactactgtcagagtgaa caggcaagctacagaatgggagaaaatttttgcaatctacccatctga >gi568815593r:100711850_101002955|GENSCAN_predicted_peptide_7|235_aa MVRAGQNLSETVLHEDGWQLSPLEMPFAEGSWAIKDVMIHPWERSGSNDGLIFLCFLIDT ATENDPMPFLHKNVCLLVYFAGKMTKGLFKEPKLKPSNMGRRCCPETSAPLEYRESGEDR EIQATLNVVYVTMSSSPSLAMESSFFPQIIVDKQEVAKNITKSNMLRLLERGPDPDPKRV FLDLAQERFQGGTRDLEKKETQRQSIEKEKGAQGTGVQHKEDPRRHRPLSSLSIY >gi568815593r:100711850_101002955|GENSCAN_predicted_CDS_7|708_bp atggtaagagcaggacaaaatctatcagaaactgtattgcatgaagatggctggcagctg agtcctttagaaatgccttttgctgaagggagttgggccatcaaagatgtaatgatacat ccctgggagaggtcagggtccaatgacggtttaatcttcttgtgtttcctcatagatact gcgactgaaaacgacccaatgcctttcctgcataaaaatgtctgtctcttagtctacttt gcaggcaaaatgacaaagggattattcaaagaacccaaactgaagcctagtaacatggga aggagatgctgtccagaaacatcagccccactagaatacagagagtcaggggaagataga gagatccaagccacacttaatgtcgtctatgtgactatgagttcttccccaagccttgcg atggagtccagtttctttcctcaaataattgtagacaaacaagaagttgcaaaaaatatc acaaaaagcaatatgttacggttactggaaaggggtcctgatccagaccccaagagagtg ttcttggatcttgcacaagaaagatttcagggtggaacgagagacttggaaaagaaagag acacagagacaaagtatagagaaagaaaaaggggcccaggggactggcgttcagcataag gaggatccacgccggcaccgtcctctgagttcccttagtatttattga >gi568815593r:100711850_101002955|GENSCAN_predicted_peptide_8|136_aa MAEKERKEGASEQGVWLGTEIEKIGCWMTVLQIASRLFFIKYKNPAQFMTRLAATLRRFT ALDPKRSLLHMLEICHWAKECPQHGIPPKPRPICVGPHRKSDCSSHLAATPRAPGTLAQG SLTDSFPDLLGLAAED >gi568815593r:100711850_101002955|GENSCAN_predicted_CDS_8|411_bp atggctgagaaggagagaaaagaaggagcatctgaacaaggagtttggctggggacagaa atagagaagattggctgttggatgaccgtacttcagatagcgtctaggctctttttcatc aaatataaaaatccagcccagttcatgactcgtttggcagcaaccctgagacgctttaca gccctagaccctaaaaggagcttgctacacatgctggaaatctgccactgggccaaggaa tgcccacagcacggtattcctcctaagccacgtcccatctgtgtgggaccccaccgaaaa tcggactgttcatctcacctggcagccactcccagagcccctggaactctggcccaaggc tctctgactgactccttcccagatcttctcggcttagcggctgaagactga