GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:17:30 Sequence gi568815593r:88622607_88923788 : 301182 bp : 37.28% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 18342 18381 40 -3.85 1.01 Init + 37063 37225 163 0 1 29 66 168 0.951 8.64 1.02 Term + 38122 38183 62 2 2 20 50 124 0.355 -1.21 1.03 PlyA + 38848 38853 6 1.05 2.08 PlyA - 40069 40064 6 1.05 2.07 Term - 45344 45121 224 0 2 80 41 157 0.629 6.30 2.06 Intr - 49591 49304 288 2 0 48 48 122 0.398 0.49 2.05 Intr - 52262 52064 199 2 1 86 11 193 0.637 9.40 2.04 Intr - 53399 53247 153 2 0 34 2 234 0.688 8.75 2.03 Intr - 53845 53574 272 2 2 8 -71 489 0.780 21.34 2.02 Intr - 55543 55443 101 0 2 -9 102 78 0.830 -1.67 2.01 Init - 55902 55742 161 0 2 83 86 171 0.955 13.74 2.00 Prom - 56184 56145 40 -4.75 3.00 Prom + 56531 56570 40 -11.64 3.01 Init + 56596 56687 92 0 2 6 81 108 0.826 1.91 3.02 Intr + 57607 57799 193 1 1 28 75 159 0.693 7.17 3.03 Term + 57981 58232 252 2 0 59 40 223 0.985 9.25 3.04 PlyA + 58597 58602 6 1.05 4.00 Prom + 58639 58678 40 -6.95 4.01 Init + 62325 62419 95 2 2 71 37 171 0.972 10.20 4.02 Intr + 62867 62943 77 2 2 67 85 39 0.866 -0.26 4.03 Intr + 65109 65227 119 1 2 26 81 173 0.820 9.66 4.04 Intr + 66731 66881 151 1 1 -9 91 112 0.613 0.61 4.05 Intr + 67018 67143 126 2 0 72 36 112 0.495 4.13 4.06 Intr + 72139 72268 130 2 1 87 31 126 0.506 5.63 4.07 Term + 72740 73016 277 2 1 65 38 275 0.887 14.15 4.08 PlyA + 73904 73909 6 1.05 5.05 PlyA - 74159 74154 6 1.05 5.04 Term - 77208 76919 290 1 2 77 45 117 0.434 0.85 5.03 Intr - 86024 85892 133 2 1 62 40 129 0.147 4.80 5.02 Intr - 92630 92537 94 1 1 101 66 62 0.196 4.35 5.01 Init - 98658 98642 17 0 2 78 89 9 0.178 -0.25 5.00 Prom - 99334 99295 40 -7.15 6.13 PlyA - 99366 99361 6 1.05 6.12 Term - 100319 99998 322 1 1 120 43 176 0.948 9.61 6.11 Intr - 106022 105887 136 0 1 56 86 88 0.951 4.11 6.10 Intr - 106741 106612 130 1 1 44 117 13 0.793 -0.85 6.09 Intr - 107628 107605 24 0 0 62 111 46 0.571 1.60 6.08 Intr - 109295 109123 173 0 2 103 35 162 0.928 11.14 6.07 Intr - 126511 126464 48 2 0 124 115 52 0.997 9.23 6.06 Intr - 129437 129260 178 2 1 118 92 35 0.641 5.47 6.05 Intr - 138519 138382 138 0 0 60 95 104 0.979 8.04 6.04 Intr - 138762 138579 184 2 1 83 75 155 0.687 12.57 6.03 Intr - 176493 176180 314 0 2 70 71 198 0.095 10.36 6.02 Intr - 182195 181992 204 2 0 115 78 309 0.757 31.07 6.01 Init - 201182 201129 54 2 0 80 109 59 0.889 8.53 6.00 Prom - 207494 207455 40 -7.05 7.00 Prom + 210213 210252 40 -6.65 7.01 Init + 222558 222776 219 2 0 80 93 123 0.846 10.68 7.02 Term + 228291 228341 51 2 0 75 32 81 0.347 -2.35 7.03 PlyA + 229885 229890 6 1.05 8.00 Prom + 231853 231892 40 -4.95 8.01 Init + 234391 234556 166 0 1 87 53 111 0.202 7.39 8.02 Term + 245153 245355 203 0 2 22 34 142 0.205 -1.23 8.03 PlyA + 245602 245607 6 1.05 9.00 Prom + 250918 250957 40 -5.65 9.01 Init + 260607 260898 292 2 1 57 28 467 0.243 32.96 9.02 Intr + 269502 269671 170 1 2 47 48 163 0.174 6.94 9.03 Term + 270230 270433 204 1 0 48 48 121 0.650 0.49 9.04 PlyA + 270470 270475 6 1.05 10.04 PlyA - 270991 270986 6 1.05 10.03 Term - 275428 275340 89 2 2 42 51 97 0.445 -1.76 10.02 Intr - 281396 281310 87 0 0 90 105 46 0.317 5.52 10.01 Init - 292445 292364 82 2 1 37 75 75 0.359 2.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:88622607_88923788|GENSCAN_predicted_peptide_1|74_aa MQIDQQQESHYLWSPTTSPELAIDLVRPLVHGIKTVLTCWPVPFGVTEAQPADGGIPKQP QTGGGAEAVTEVAD >gi568815593r:88622607_88923788|GENSCAN_predicted_CDS_1|225_bp atgcaaatcgaccagcagcaagaaagccactacctttggagtcccacgacttcacctgag cttgcaatagaccttgtgcgccctttggttcatggaatcaaaactgttttgacatgttgg ccagtgcccttcggtgtcactgaagctcagcccgcggacggcgggattcccaagcaaccg cagaccggaggtggcgcagaggcagtgaccgaggtcgctgattag >gi568815593r:88622607_88923788|GENSCAN_predicted_peptide_2|465_aa MGVRAGWEGAPTLRARLPGWDRAPGPAAAAAATLPTLFPNTHLASESPSHRPSGRTRTRS LLLPSKYMHRHSMPCNVAGVFVLILHKGARARRGGGGGGEGGGGGEVEEEEEEEEEEEEE EEEEEEEEEEERMEAPRSPDYAWAGAASSNLFFSEPGSRRSSAAHRTGAETTVLEGYSPD VEGDRELTSMDVTSQLNLEREAGMGENTERQPAEPESAGGGGSSRAQAGLRNGPRPLLGR RADLRAGKKGAPVAVVATSPLCPRPPSVPFHRSTSQPQLCFPDMFVAIGPHACEGRTQGD GNSGNVFALVSAAFYLANVPAVLEWGTVLRGQESSGIDQKLDRRLCQVTATAGGPFGAVA ALGPAPQSRVPFPPFLGVHLCLPPSFLHLGTERDFFQSLDTRTIYTSTEEVKGRGLLFQE QSVVLAEEMLIEKIGIVAMLPLLLLSLPLLGQQAWGSWLITDAYQ >gi568815593r:88622607_88923788|GENSCAN_predicted_CDS_2|1398_bp atgggggtgagggcggggtgggagggggctccgacactgcgcgctcggctgccgggctgg gacagagcacccggacccgcggcggcagcggcagcgacgctacccaccctcttccccaac acccacctagccagcgagtcaccgagccaccggccctcgggccgtacacgcacgcgttcc ttgctactgccatctaaatatatgcaccgacactctatgccctgcaacgtggccggtgtg tttgtcctgatcctacataaaggagcgagagcaagaagaggaggaggaggtggaggagaa ggaggaggaggaggagaggtggaagaggaggaggaggaggaggaggaggaggaggaggag gaggaggaggaggaggaggaggaggaggaggaaagaatggaggcgccgcggagccctgat tatgcatgggctggagctgcctcctctaacctgttcttctccgagccgggcagccggcgg tcctcagccgcccacagaacaggagccgaaacaaccgttctcgaaggctacagcccggac gtagagggagacagggagctgacgtcaatggacgtgacctcgcagctaaaccttgaacgt gaagccgggatgggggaaaacacggagcggcagccggcggagcccgagagcgcgggcggc ggcggcagtagccgagcccaggccgggctaagaaatggtcctcggcctcttctgggccgg agggcagatctacgagcaggcaagaaaggggcgccggtggctgtggtggcgacttctccg ctctgccctcggccgccaagtgtgccgtttcatcgttccacctcgcagccccagctttgt tttcctgacatgttcgtggccattgggccccacgcctgcgaggggagaacccagggtgat gggaactcgggcaatgtatttgcattagtgtctgcggcgttttatcttgcaaatgtaccc gcggtgcttgaatgggggactgtcttgaggggtcaggagagcagtggaattgatcagaag ttagaccggaggctctgtcaagtcacggccacggccgggggcccctttggagctgtggct gccttgggcccagcaccgcagagtcgggttccgtttcctccctttcttggtgtccacctc tgtctaccaccctcctttctccaccttggcaccgagagagatttcttccagtctttggat actagaacgatctacactagcacggaggaggtaaaaggccgcgggttgcttttccaggag cagtcggtagtgctagcggaagagatgttgattgagaaaatagggattgttgcaatgctg ccgctactgctgctgtcgctgccgctgctcgggcagcaggcgtggggcagttggctgatc actgatgcatatcaataa >gi568815593r:88622607_88923788|GENSCAN_predicted_peptide_3|178_aa MTFKDDVGYLTVKIKACLNDSAVVQPCPLKCPYGTPVESPLFGFYQEALSAPPPPMAREE KGGRVGSHERSRLGAASVGAHVSPQQTRLSFLSPKGNSDTYWPHGYRARAVRAALGRGAG LRGLGRPRETSPKALRTALGLVRPGEDKATLSGDPWEDCRMELNTQAHTHAHFFWGTG >gi568815593r:88622607_88923788|GENSCAN_predicted_CDS_3|537_bp atgacatttaaagatgatgttggctacctcacggtaaaaattaaggcttgccttaacgac tcggcagttgtccaaccatgtcctctgaaatgcccatatggaaccccggttgaatctccc ctcttcggattttaccaggaggccctcagtgccccaccgccgcccatggcacgtgaagag aaaggagggagagttgggagccacgagagaagccggctcggagccgcttccgtcggggcc catgtttcgccccaacagacccgtctctcctttctctccccgaaggggaacagcgacacc tactggccgcatgggtaccgcgcccgggccgtcagggctgccctgggacggggtgccggg ctccgtggacttgggaggccgcgggaaacttcaccgaaggctctgcggactgctctagga ctcgtccgccctggagaagacaaagccacgctttctggagacccctgggaggactgtagg atggaactgaacacacaggcacacacgcatgcccattttttctggggaaccggataa >gi568815593r:88622607_88923788|GENSCAN_predicted_peptide_4|324_aa MEDEEEVENQKVVRALLEKEEEEEEEEEGWEGLPQPEPCVSVHIGTPPHLISACFYRGTD GCVWGLVPTQISTGSSAGRRDPANGIHEETHVRLEKKFKLTPVPALELDQVLATAPTWNA GSVAPKPKAKGKRNEVWDGGRPLIGGSDWGGLPGMPPPTPSPAEDSSAAGLPEALTPASP LGAIDADLGVRLGEVLGIKRHELRFLLHNFSMREEDIEYLTQSSGMEPSKTLFPSLELLC KKDFRSETPQCGSKTRLGCPGMERGLSLDWNSWNRRGLVSLSQALRTAYSSRQGHLGREK VERVFGKGETLENAAPPHPVSVDE >gi568815593r:88622607_88923788|GENSCAN_predicted_CDS_4|975_bp atggaggatgaagaggaggtggagaatcagaaggtagtgcgggcgttgttagagaaggag gaggaagaggaggaggaggaggagggatgggaggggctccctcagcccgaaccctgcgtc tctgttcacattggaacaccccctcacctcatctcagcttgcttttaccgaggaacagac ggctgcgtgtgggggctggtccctacccagatcagtacaggatcttcagctggcagacga gatccagcaaatggaattcacgaagaaactcatgtgcgtttggaaaaaaagttcaagctg acacctgtgcctgccctggagctagatcaggttctggcgactgctccaacctggaacgcc ggctcagtagcgcccaagccgaaggcgaaggggaagaggaacgaagtgtgggatgggggg cggcccctcattgggggttcagactggggcgggcttccagggatgccccctccgaccccg tcaccagcggaggactcatctgccgcggggctgccggaagctctcactcccgccagcccc ctgggtgctattgatgcggaccttggagtgcggcttggagaagtcttgggaatcaaacgg catgaacttagattcctgttacacaacttttcaatgagggaggaggatatcgaatacctc actcagtcttcgggaatggagccttcaaaaacacttttcccatccttggagttgctctgt aagaaggactttcggagtgaaactccacagtgcggttcaaagacacgcttaggatgcccc gggatggagagaggactgagcttggactggaattcctggaacagacgagggcttgtgagc ttgtcccaggcactgaggacagcatactcttcccgacaagggcatctgggtcgagagaag gtggagcgggtcttcggcaaaggcgaaaccctggagaacgctgccccaccccaccctgta agcgtggacgagtaa >gi568815593r:88622607_88923788|GENSCAN_predicted_peptide_5|177_aa MHLGIQNITFPKDNLVGSYFTNAQSTYSTYRESRAVQSNLPSKKKEALDVGWLKAHKTKC SDIDKDKKRGTPGPKHSPDARGFPGEKVRAKMKQKEEKKEEEQEGEKDQEEDEEKGEGGK KKKGGRKEGKGKKKRKEHTKSILPPNVKCIEGTYLVLEVSMESHQLYLTEDSFCCSM >gi568815593r:88622607_88923788|GENSCAN_predicted_CDS_5|534_bp atgcatcttggaattcagaatatcacgttcccaaaagataatttggtaggatcctacttt actaatgctcaaagcacctactccacttacagagaaagccgagctgtgcagtcaaacttg cctagcaagaagaaagaggcactggatgttgggtggctgaaagctcacaagacaaaatgt agtgatatagacaaggacaaaaagagagggacccctggacctaagcactctccagatgcc agaggatttccaggagagaaagtaagagcaaaaatgaagcagaaggaagaaaaaaaggag gaagagcaagagggggagaaggatcaggaagaggatgaggagaaaggagaaggaggaaag aaaaagaagggaggaagaaaagaaggaaagggaaaaaaaaagagaaaggaacacaccaaa agtatcctccctccaaatgttaaatgcattgaaggaacttacttagttttagaagtgtcc atggagtctcaccaattatatcttacagaagactccttttgttgctcaatgtga >gi568815593r:88622607_88923788|GENSCAN_predicted_peptide_6|634_aa MGRKKIQITRIMDERNRQVTFTKRKFGLMKKAYELSVLCDCEIALIIFNSTNKLFQYAST DMDKVLLKYTEYNEPHESRTNSDIVEETLPSRGRQTPHTRELRLATGGCHSGTKLPEEGT GSNLCCSPASAGVNPGKWSEVDLQQTPADLQQRCLTVKRKTNKQKGIASTSTKRMSAPKP HPKVTSIKDQSLCLGLNFPNYVFQTLRKKGLNGCDSPDPDADDSVGHSPESEDKYRKINE DIDLMISRQRLCALNKKENKGCESPDPDSSYALTPRTEEKYKKINEEFDNMIKSHKIPAV PPPNFEMPVSIPVSSHNSLVYSNPVSSLGNPNLLPLAHPSLQRNSMSPGVTHRPPSAGGL MGGDLTSGAGTSAGNGYGNPRNSPGLLVSPGNLNKNMQAKSPPPMNLGMNNRKPDLRVLI PPGSKNTMPSVSEDVDLLLNQRINNSQSAQSLATPVVSVATPTLPGQGMGGYPSAISTTY GTEYSLSSADLSSLSGFNTASALHLGSVTGWQQQHLHNMPPSALSQLGACTSTHLSQSSN LSLPSTQSLNIKSEPVSPPRDRTTTPSRYPQHTRHEAGRSPVDSLSSCSSSYDGSDREDH RNEFHSPIGLTRPSPDERESPSVKRMRLSEGWAT >gi568815593r:88622607_88923788|GENSCAN_predicted_CDS_6|1905_bp atggggagaaaaaagattcagattacgaggattatggatgaacgtaacagacaggtgaca tttacaaagaggaaatttgggttgatgaagaaggcttatgagctgagcgtgctgtgtgac tgtgagattgcgctgatcatcttcaacagcaccaacaagctgttccagtatgccagcacc gacatggacaaagtgcttctcaagtacacggagtacaacgagccgcatgagagccggaca aactcagacatcgtggaggagacacttcccagcaggggtcgacagacacctcatacgaga gagctccggctggcaactggtgggtgccactctgggacgaagcttccagaggaaggaaca ggcagcaatctttgctgttctccagcctctgctggtgttaacccaggcaaatggtctgaa gtagacctccagcaaactccagcagacctgcagcagaggtgcctgactgttaaaaggaaa actaacaaacagaaaggaatagcatcaacatcaacaaaaaggatgtctgcaccaaaaccc catccaaaggtcaccagcatcaaagaccaaagcctttgtctaggtttgaactttccaaat tatgtatttcagacgttgagaaagaagggccttaatggctgtgacagcccagaccccgat gcggacgattccgtaggtcacagccctgagtctgaggacaagtacaggaaaattaacgaa gatattgatctaatgatcagcaggcaaagattgtgtgcattgaacaagaaagaaaacaaa ggctgtgaaagccccgatcccgactcctcttatgcactcaccccacgcactgaagaaaaa tacaaaaaaattaatgaagaatttgataatatgatcaagagtcataaaattcctgctgtt ccacctcccaacttcgagatgccagtctccatcccagtgtccagccacaacagtttggtg tacagcaaccctgtcagctcactgggaaaccccaacctattgccactggctcacccttct ctgcagaggaatagtatgtctcctggtgtaacacatcgacctccaagtgcaggtggtctg atgggtggagacctcacgtctggtgcaggcaccagtgcagggaacgggtatggcaatccc cgaaactcaccaggtctgctggtctcacctggtaacttgaacaagaatatgcaagcaaaa tctcctcccccaatgaatttaggaatgaataaccgtaaaccagatctccgagttcttatt ccaccaggcagcaagaatacgatgccatcagtgtctgaggatgtcgacctgcttttgaat caaaggataaataactcccagtcggctcagtcattggctaccccagtggtttccgtagca actcctactttaccaggacaaggaatgggaggatatccatcagccatttcaacaacatat ggtaccgagtactctctgagtagtgcagacctgtcatctctgtctgggtttaacaccgcc agcgctcttcaccttggttcagtaactggctggcaacagcaacacctacataacatgcca ccatctgccctcagtcagttgggagcttgcactagcactcatttatctcagagttcaaat ctctccctgccttctactcaaagcctcaacatcaagtcagaacctgtttctcctcctaga gaccgtaccaccaccccttcgagatacccacaacacacgcgccacgaggcggggagatct cctgttgacagcttgagcagctgtagcagttcgtacgacgggagcgaccgagaggatcac cggaacgaattccactcccccattggactcaccagaccttcgccggacgaaagggaaagt ccctcagtcaagcgcatgcgactttctgaaggatgggcaacatga >gi568815593r:88622607_88923788|GENSCAN_predicted_peptide_7|89_aa MNITNNERLTATVISSRVNGAYNCCCQQLLQIAKRPQKQFRKEKFLNGSTLWKKLFLYSH CSFPALKYDVEATPTQCKDEDEDLNDDPL >gi568815593r:88622607_88923788|GENSCAN_predicted_CDS_7|270_bp atgaacattacaaataacgaacgcttaactgcaactgtcatatcctcccgtgtcaatggt gcctataattgctgttgtcaacagctgcttcaaattgccaaaagaccccagaaacaattc aggaaggaaaagtttttaaatggatccacattatggaaaaaactcttcctttatagtcac tgttcatttcctgcattgaaatatgacgtagaagcaactcctactcaatgtaaagatgag gatgaagaccttaatgatgatccactttaa >gi568815593r:88622607_88923788|GENSCAN_predicted_peptide_8|122_aa MHLEKPQTLNASLRKQPGLGGWGAIPCKATGAELSKAMGAHLLHQHDLDVRHRVKEDQSY FQPPSQQELKTNWSTYVDADGKRIQLSLNSLKKAEWRAFWSLQSNLSLRVQRDSLMDFLV CE >gi568815593r:88622607_88923788|GENSCAN_predicted_CDS_8|369_bp atgcacctggaaaagccacagacactcaatgccagcctgagaaagcagccaggattgggg ggctggggggctataccctgcaaagccacaggggcagagctgtccaaggccatgggagcc caccttttgcatcagcatgaccttgatgtgagacatagagtcaaagaagatcagtcctac tttcaaccaccaagccagcaagaattgaaaaccaactggtcaacatatgtcgatgcagat ggaaaaaggatacaactgtctttaaattccctaaagaaagctgagtggagagccttctgg tcactgcagtccaacctatcattacgcgtgcagcgagactctttgatggatttcttagtc tgtgaataa >gi568815593r:88622607_88923788|GENSCAN_predicted_peptide_9|221_aa MCACVRAREGSAREGGGAGPRIRARRGRSEEEEEEEEGGGREGGGGGGGESGSQRPVGLK VTRLVQKTASRKEVTEDEWEKENSSQRLKHRVKKTGGAFFPFSLKSSILRENVSLLQVLV DMEEGLTFSMADNREGWSDLEEAVWPTANWLYHMVRVRSDCLLMSFQVSDGRLWQLGWAK ACWEGVAHRMPNGAFCSRGPRSSSVRHMINTTAVLGDCTIS >gi568815593r:88622607_88923788|GENSCAN_predicted_CDS_9|666_bp atgtgtgcgtgtgtgcgcgcgcgcgaggggagcgcgcgcgagggggggggcgcggggccg cgcattcgcgcgcgccgaggccgctcggaagaggaggaggaggaggaagaaggaggagga agagaaggaggaggaggaggaggaggagaaagtggctctcagcggccggtcggattaaaa gtaaccagactcgtccagaaaactgcgtccaggaaggaagtgacagaagacgaatgggaa aaggagaacagctcacagcgtttgaaacatcgcgtaaaaaagactgggggtgcattcttc ccattctccctaaaatccagtatccttagagaaaatgtttcacttcttcaggtgcttgta gacatggaagaaggcctcaccttctccatggctgacaaccgagaaggctggtctgacttg gaagaagctgtgtggccaacagccaattggctgtatcacatggtcagagtccgctcagat tgtctcctgatgtcattccaagtgtctgatggccgtctatggcaactgggttgggcaaaa gcctgctgggaaggggtagcccacagaatgcctaatggagcgttctgttcaagaggacct agaagtagttctgttagacatatgatcaacactacagctgtgttaggggactgtaccatt tcttga >gi568815593r:88622607_88923788|GENSCAN_predicted_peptide_10|85_aa MRKRESKAPNMLPIVAEFGPGTDRFAAGIFLPVRVWTTKPSAGADGYNFLEKQKGTESSM QISAMEEPPEEPVIMQIPEIAAKWA >gi568815593r:88622607_88923788|GENSCAN_predicted_CDS_10|258_bp atgaggaaaagagagtcaaaggctccaaatatgcttcctattgttgctgagtttggtcca gggactgataggtttgctgcaggaatttttttacctgtcagggtttggacaacaaagccc tcagcaggtgctgacgggtacaacttcctggagaagcagaaaggcactgagagtagcatg caaataagtgctatggaagaaccacctgaggagcctgttataatgcagattcctgagatt gcagctaagtgggcctga