GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:54:45 Sequence gi568815590r:81593659_81794972 : 201314 bp : 38.28% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 155 150 6 1.05 1.02 Term - 12023 11923 101 0 2 96 50 50 0.670 -0.69 1.01 Init - 13033 12913 121 1 1 31 58 146 0.655 6.30 1.00 Prom - 20355 20316 40 -2.45 2.06 PlyA - 21956 21951 6 1.05 2.05 Term - 23765 23676 90 2 0 40 48 102 0.654 -1.86 2.04 Intr - 24461 24299 163 1 1 -9 94 161 0.479 6.06 2.03 Intr - 25158 25038 121 1 1 10 53 119 0.462 -0.77 2.02 Intr - 26455 26350 106 1 1 86 100 36 0.965 3.47 2.01 Init - 27796 27659 138 1 0 57 88 100 0.774 7.09 2.00 Prom - 31444 31405 40 -9.05 3.00 Prom + 31614 31653 40 -7.35 3.01 Init + 37102 37315 214 0 1 72 5 188 0.674 8.16 3.02 Intr + 39766 39986 221 2 2 52 78 119 0.256 4.40 3.03 Term + 49040 49225 186 1 0 25 38 237 0.311 8.91 3.04 PlyA + 49384 49389 6 1.05 4.12 PlyA - 50265 50260 6 1.05 4.11 Term - 65808 65693 116 1 2 112 32 129 0.997 7.45 4.10 Intr - 67009 66858 152 0 2 52 98 172 0.901 13.49 4.09 Intr - 77389 77281 109 2 1 70 94 61 0.664 3.32 4.08 Intr - 80348 80183 166 2 1 51 97 104 0.429 6.21 4.07 Intr - 82621 82576 46 0 1 35 86 14 0.227 -6.61 4.06 Intr - 85572 85468 105 2 0 24 77 118 0.300 2.71 4.05 Intr - 87125 86992 134 2 2 73 68 93 0.683 4.32 4.04 Intr - 89904 89821 84 0 0 60 38 98 0.445 1.10 4.03 Intr - 92206 92105 102 1 0 36 87 89 0.050 3.05 4.02 Intr - 94321 94278 44 2 2 91 96 22 0.067 0.34 4.01 Init - 101254 100990 265 1 1 59 102 174 0.484 13.12 4.00 Prom - 104090 104051 40 -5.05 5.08 PlyA - 105129 105124 6 1.05 5.07 Term - 109207 109037 171 1 0 108 38 89 0.956 2.74 5.06 Intr - 109466 109311 156 2 0 97 87 1 0.509 0.19 5.05 Intr - 120381 120260 122 1 2 32 77 117 0.980 4.29 5.04 Intr - 121237 121146 92 0 2 76 111 107 0.981 10.52 5.03 Intr - 134588 134416 173 2 2 74 105 116 0.855 9.72 5.02 Intr - 138628 138541 88 0 1 34 84 49 0.772 -1.85 5.01 Init - 139193 138820 374 2 2 49 61 300 0.911 17.88 5.00 Prom - 140345 140306 40 -6.45 6.00 Prom + 146941 146980 40 -2.15 6.01 Init + 154315 154365 51 0 0 60 101 -3 0.213 -0.59 6.02 Intr + 155765 155798 34 1 1 69 99 33 0.739 -0.62 6.03 Intr + 159406 159583 178 2 1 99 84 164 0.984 15.16 6.04 Intr + 161712 161826 115 0 1 38 94 163 0.999 11.43 6.05 Intr + 164484 164637 154 2 1 112 62 145 0.953 13.02 6.06 Term + 164822 164886 65 0 2 75 37 98 0.878 0.47 6.07 PlyA + 165851 165856 6 1.05 7.00 Prom + 174309 174348 40 -5.35 7.01 Sngl + 175459 175908 450 0 0 88 53 443 0.919 36.76 7.02 PlyA + 176337 176342 6 1.05 8.00 Prom + 176872 176911 40 -6.15 8.01 Init + 177824 179202 1379 1 2 70 39 471 0.061 30.56 8.02 Term + 186970 187105 136 1 1 64 42 180 0.383 7.51 8.03 PlyA + 189524 189529 6 1.05 9.04 PlyA - 190093 190088 6 1.05 9.03 Term - 192557 192441 117 2 0 69 48 86 0.357 0.26 9.02 Intr - 192725 192632 94 1 1 23 103 38 0.385 -2.15 9.01 Init - 193684 193464 221 1 2 65 84 93 0.577 4.75 9.00 Prom - 194730 194691 40 -3.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 58584 58639 56 1 2 96 53 74 0.907 1.44 S.002 Init - 92257 92105 153 1 0 58 87 136 0.831 10.54 S.003 Sngl + 177824 179209 1386 1 0 70 43 451 0.914 34.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:81593659_81794972|GENSCAN_predicted_peptide_1|73_aa MCIAASGGAEAFYEMGIHCWDIAVAAIIVTEAGGVLMDVTGGPFHLMSRRIIAANCTALA ERIAKEIQVAPFQ >gi568815590r:81593659_81794972|GENSCAN_predicted_CDS_1|222_bp atgtgcattgcggcaagtggaggagcagaggcattttatgaaatgggaattcactgctgg gatattgcagtagctgccattattgttactgaagctggtggcgtgctaatggatgttact ggtggaccattccatttaatgtcacggagaataattgctgcaaattgtacagcattagca gaaaggatagccaaagaaattcaggtagcaccttttcaatga >gi568815590r:81593659_81794972|GENSCAN_predicted_peptide_2|205_aa MGGKKEKKKEKRFNPKKSLEKSKTMMCAWNSEMVMEREALEVAGAEIEFGVVYSCVEDKR YTVRKGKGAFYNGQKLQVSQEELKEAGLLGASEGRKLHKDAARNARTSYRDGLRKTGRSL CGKGPLRAGQGGHAYPKEWLWWARLLSMAAGEVANSAAYGFAPATLVTPLGTLSVLEWQD MPIDDVTGTFDWLYNNRGDILVACF >gi568815590r:81593659_81794972|GENSCAN_predicted_CDS_2|618_bp atggggggaaaaaaggaaaagaaaaaagaaaagcgttttaatccaaagaagagcttggag aagtcgaagaccatgatgtgtgcatggaacagtgagatggtcatggagagggaagcactg gaagtggccggggcagagatagaatttggagttgtgtacagttgtgtggaagacaagagg tacactgtcaggaaaggaaaaggtgccttttataatggtcaaaaactacaggtttcacaa gaagaactgaaggaagcggggcttcttggcgcttctgaagggagaaagctgcacaaggat gccgccaggaatgctcgcacctcctaccgggatggcctgcgtaagacggggcggtctctg tgcggaaaaggtcctctgagagcaggtcaaggtggccacgcatatcctaaagaatggttg tggtgggctagactgctgtcaatggcagctggcgaggtggccaactcagctgcatatggg tttgcaccggccacactggtgactccactaggaactctcagcgtcctagagtggcaagat atgcccattgatgatgtcactggtacttttgactggctttacaataatcgtggggatatt cttgttgcatgcttttaa >gi568815590r:81593659_81794972|GENSCAN_predicted_peptide_3|206_aa MIPQKLPQGLSAVASANRYSNIGSVPGHPRAAPDNVAGKSHPDAAFAPGADRTPQTALAP TPAERSAEPGEATVPNPKNVCRAQVLSGAGDSSTNSASWHDVQLLTLSTFPKGSSLLEAF GILSCQLTPICERGCNYAEPQNEREPGPQEQNSVSGKKKKKKKKRKEKREEEGEEGEGEE EEEDNNSGAKTLAKDHTENPKLVALF >gi568815590r:81593659_81794972|GENSCAN_predicted_CDS_3|621_bp atgattccacagaagctgccacaggggttaagtgcagtggcgtccgcgaaccgctactcc aacatcgggagtgttcctggtcaccccagggcagctccagataacgtggcaggcaagagt cacccagatgcggccttcgccccaggggctgaccgcacaccgcagaccgccctcgccccc accccagcagagagaagcgccgaacccggagaggccactgttcctaatcctaagaacgtc tgccgtgcccaggtgttgtcgggggcaggggacagcagcacaaactctgcttcctggcat gatgtccagctcctcactctttcaaccttccctaagggaagctccctcttggaggccttt ggtatcctttcctgtcagctgactcccatctgtgagagaggttgtaattacgctgaacct cagaatgaaagagagcctgggccacaggagcaaaactctgtctcaggaaaaaaaaaaaag aagaagaagaagaggaaagagaagagagaagaagagggagaggaaggagaaggagaagaa gaagaagaagacaacaactcaggtgctaagacacttgccaaagatcacacagaaaatccg aaacttgtggccttgttttga >gi568815590r:81593659_81794972|GENSCAN_predicted_peptide_4|440_aa MSSLSFLNIEKTEILFFTKTEETILVSSSYENKRPNSSHLFVKIEDPKILQMVNVAKKIS SDATNFTINLVTDEEGETNVTIQLWDSEGPSLEWGSYDLNSDRVAKRKVAWLLRWKAVTR TETAGNSSGVYGFGKMKTVTVIHTVARVGNLDIQIQRKSELFLSLVVCEAIKNEMNVMLK SSPVDLVTATDQKVEKMLISSIKEKYPSHSFIGEESVAAGEKSILTDNPTWIIDPIDGTT NFVHRFPFVAVSIGFAVNKKKKVDETYLGLVKETCFPLQIEFGVVYSCVEGKMYTARKGK GAFCNGQKLQVSQQEDITKSLLVTELGSSRTPETVRMVLSNMEKLFCIPVHGIRSVGTAA VNMCLVATGGADAYYEMGIHCWDVAGAGIIVTEAGGVLMDVTGGPFDLMSRRVIAANNRI LAERIAKEIQVIPLQRDDED >gi568815590r:81593659_81794972|GENSCAN_predicted_CDS_4|1323_bp atgtcatcgctcagttttctgaatatagagaagactgaaatactatttttcacaaagact gaagaaaccatccttgtaagttcaagctacgaaaataaacggcctaattccagccacctc tttgtgaaaatagaagatcctaaaatactacaaatggtgaatgtggccaagaagatctca tcagatgctacaaactttaccataaatctggtgactgatgaagaaggagaaacaaatgtg actattcaactctgggattctgaaggaccctctttggaatgggggtcttatgacctaaac tcagacagggtggcaaaaaggaaggtggcctggctcctacgctggaaggctgtgaccagg acagaaacagcaggaaacagttcaggtgtttatggctttgggaaaatgaaaactgtgacg gttatccatacagttgccagagtgggaaatctggacatccagattcagaggaaatctgaa ctcttcctgtccctggtagtttgtgaagctataaaaaatgaaatgaatgttatgctgaaa agttctccagttgatttggtaactgctacggaccaaaaagttgaaaaaatgcttatctct tccataaaggaaaagtatccatctcacagtttcattggtgaagaatctgtggcagctggg gaaaaaagtatcttaaccgacaaccccacatggatcattgaccctattgatggaacaact aactttgtacatagatttccttttgtagctgtttcaattggctttgctgtaaataaaaag aaaaaagtggatgaaacatacttaggtttggtaaaggaaacttgttttcctctgcagata gaatttggagttgtgtacagttgtgtggaaggcaagatgtacactgccagaaaaggaaaa ggtgccttttgtaatggtcaaaaactacaagtttcacaacaagaagatattaccaaatct ctcttggtgactgagttgggctcttccagaacaccagagactgtgagaatggttctttct aatatggaaaagcttttttgcattcctgttcatgggatccggagtgttggaacagcagct gttaatatgtgccttgtggcaactggcggagcagatgcatattatgaaatgggaattcac tgctgggatgttgcaggagctggcattattgttactgaagctggtggcgtgctaatggat gttacaggtggaccatttgatttgatgtcacgaagagtaattgctgcaaataatagaata ttagcagaaaggatagctaaagaaattcaggttatacctttgcaacgagacgacgaagat taa >gi568815590r:81593659_81794972|GENSCAN_predicted_peptide_5|391_aa MGSRLWARPQPTSLILRAVLLGQGDFSLDSIFQVLLFLAQHLLSLPKSDQGLLGTGSGSA LRRAAPFKELAQLAHCCGVRRASDPGGLSGAPRGRAEGTGEQGSPLGDPDRKGGPPGNQQ VVLDRDGPRNEELVCEWLKEKQTDVQGSCAGPESSFTIPEYNHMRFVPNQEIHLISPILL LYLLVNNSRLPFSQAKNLGVNLDPHPIHQQIPHRHQSDHECEKLEIPKPRMAATQKLVKD IIDSKTGETASKRWKGAKNSETAAKVALMKLKMHADGDKSLPQTERIYFQVFLPKGSKEK SKPMFFCHRWSIGKAIDFAASLARLKNDNNKFTAKKLRLCHITSGEALPLDHTLETWIAK EDCPLYNGGNIILEYLNDEEQFCKNVESYLE >gi568815590r:81593659_81794972|GENSCAN_predicted_CDS_5|1176_bp atgggctcccgcctgtgggccagaccccagcctacctcgcttattctgcgtgccgtgctt cttggccagggcgatttctctctggattcgattttccaggtactcttgtttcttgcccag catctcctcagtctcccgaagtcggaccagggcctcctggggactgggagcggctcggct cttagaagagccgccccctttaaagaacttgcccaacttgctcattgctgtggagttcgg agggccagcgatcccgggggcctctcgggagctccccggggccgcgccgaggggacaggt gagcaaggcagtcctctcggggacccagaccggaaaggcggccctcctgggaatcaacaa gtcgtcctggacagggacggtcccaggaacgaggaactagtttgtgaatggctcaaagaa aagcaaaccgatgtccaggggtcttgtgcaggacctgaaagtagtttcactataccggag tacaatcacatgagatttgtacctaatcaagaaatacacctaatctctccaatcctgctc ctctacctcttagttaataactccaggcttcctttttctcaggccaaaaaccttggagtc aaccttgacccacatcccatccatcagcagatcccacaccgtcatcagtcagatcatgag tgtgaaaaactggaaatcccaaagcctcgaatggctgccactcagaaacttgttaaagac attattgattccaagacaggagaaacagcaagtaaacgatggaaaggtgccaaaaatagt gaaacagctgcaaaggttgcattgatgaaattaaagatgcatgctgatggcgataagtca ttaccacagacagaaagaatttactttcaggttttcttacctaaagggagcaaagagaag agcaaaccaatgttcttttgccaccgatggagcattggaaaggccatagactttgccgct tctctagccaggcttaaaaatgacaataacaaatttacagctaagaaattaaggctgtgt cacattacttcaggagaagccttacccttggatcatactttggaaacctggattgctaag gaggattgtcctttatataatggtggaaatataatcttggaatatcttaatgatgaagaa caattctgtaaaaatgttgaatcttacttggaatag >gi568815590r:81593659_81794972|GENSCAN_predicted_peptide_6|198_aa MGDRSLSHLCNLDHKRQVPGNGQGAGETAALQALKRKKRFEKQLTQIDGTLSTIEFQREA LENSHTNTEVLRNMGFAAKAMKSVHENMDLNKIDDLMQEITEQQDIAQEISEAFSQRVGF GDDFDEDELMAELEELEQEELNKKMTNIRLPNVPSSSLPAQPNRKPGMSSTARRSRAASS QRAEEEDDDIKQLAAWAT >gi568815590r:81593659_81794972|GENSCAN_predicted_CDS_6|597_bp atgggagaccggagtctatctcacctctgcaatctcgaccataaaagacaggtgccagga aatggacaaggtgctggggaaacagctgcattacaggcactaaagagaaagaagaggttc gagaaacagctcactcagattgatggcacactttctaccattgagttccagagagaagcc ctggagaactcacacaccaacactgaggtgttgaggaacatgggctttgcagcaaaagcg atgaaatctgttcatgaaaacatggatctgaacaaaatagatgatttgatgcaagagatc acagagcaacaggatatcgcccaagaaatctcagaagcattttctcaacgggttggcttt ggtgatgactttgatgaggatgagttgatggcagaacttgaagaattggaacaggaggaa ttaaataagaagatgacaaatatccgccttccaaatgtgccttcctcttctctcccagca cagccaaatagaaaaccaggcatgtcgtccactgcacgtcgatcccgagcagcatcttcc cagagggcagaagaagaggatgatgatatcaaacaattggcagcttgggctacctaa >gi568815590r:81593659_81794972|GENSCAN_predicted_peptide_7|149_aa MGKKQNRKTGNSKKQSTSPPPKERSSSPAMEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRSDQLEERVSA MEDEMNEMKREGKFREKRIKRNEQSLQEI >gi568815590r:81593659_81794972|GENSCAN_predicted_CDS_7|450_bp atggggaaaaaacagaacagaaaaactggaaactctaaaaagcagagcacctctcctcct ccaaaggaacgcagttcctcaccagcaatggaacaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgaagcgatcaactggaagaaagggtatcagcg atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatga >gi568815590r:81593659_81794972|GENSCAN_predicted_peptide_8|504_aa MDTFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEW VPFLLKLFQSIEKEGILPNSFYEASIILIPKTGRDTTKKENFRPISLMNIDAKILNKILA KRIQQHIKKLIHHDQVGFIPGMQGWFNIHKSINVIQHINRAKDKNHMIISIDAEKAFDKI QQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIALEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISKFSKV SGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPP LKEIKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPIKLPMPFFTELEKTTLKLI WNQKRARITKSILSQKNKAGGITLPDFKLYYKATVTKTACFFLYKMLSLLPLVVPTRRHS KLIVNARTEHLLKSAAGISVVQNW >gi568815590r:81593659_81794972|GENSCAN_predicted_CDS_8|1515_bp atggatacgttcctcgacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgtggcaataatcaatagcttaccaacaaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaatgg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactct ttttatgaggccagcatcattctgataccaaagacaggcagagacacaacaaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaacgaatccagcagcacatcaaaaagcttattcaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaatatacacaaatcaataaatgtaatccagcatataaacaga gccaaagacaaaaaccacatgattatctcaatagatgcagaaaaagcctttgacaaaatt caacaacccttcatgctaaaaactctcaataaattaggtattgatgggacgtatttcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattc aacatagcgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtatatcta gaaaaccccattgtctcagcccaaaatctccttaagctgataagcaaattcagcaaagtc tcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccaccg ctcaaggaaataaaagaggatacaaacaaatggaagaacatcccatgctcatgggtagga agaatcaatatcatgaaaatggccatactgcccaaggtaatttacagattcaatgccatc cccatcaagctaccaatgcctttcttcacagaattggaaaaaactactttaaagttgata tggaaccaaaaaagagcccgcatcaccaagtcaatcctaagccaaaagaacaaagctgga ggcatcacgctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatgt tttttcctctacaaaatgttgtcactgctgccgctcgttgttcccactaggcggcactct aagctcatcgtcaatgcgaggaccgagcacctgctaaaatctgcggcaggcatctccgtt gtgcaaaattggtaa >gi568815590r:81593659_81794972|GENSCAN_predicted_peptide_9|143_aa MAEEELEDRGRSEGLPPIELIKVCMVKEDKSYNIVWTQAAERERKMTRRIWQIKLNQTKA RRHKSTQSGRFIICGANSFAFLSLSPHLKGDNNSAPQSGLLKEEQVSVDVVEVLSGAVRV FHAHIKMFASQRFMPHPVPAKLL >gi568815590r:81593659_81794972|GENSCAN_predicted_CDS_9|432_bp atggctgaagaggagttagaagacagaggaagaagtgagggtttgcctccaattgaatta ataaaagtatgcatggtaaaagaagacaaaagttacaatatcgtatggactcaagcagct gagagggaaagaaagatgaccaggagaatatggcagataaaactgaaccagacaaaagca agaaggcacaaaagcactcagtctggaaggtttataatctgtggggcaaattcctttgca tttctgagcctcagtcctcatttaaagggagacaacaatagtgccccccaatctggttta ttgaaagaagaacaggtcagtgtggatgttgtggaggtgttaagtggggcagtgcgtgtt ttccatgcccacataaaaatgtttgccagtcagagatttatgcctcatccagtgcccgct aagctgctatga