GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:46:58 Sequence gi568815581f:67278155_67787184 : 509030 bp : 44.60% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 15435 15578 144 1 0 110 9 169 0.143 11.45 1.02 Term + 19131 19261 131 1 2 103 54 106 0.971 7.04 1.03 PlyA + 19961 19966 6 1.05 2.13 PlyA - 20007 20002 6 1.05 2.12 Term - 20782 20691 92 2 2 38 48 61 0.087 -5.02 2.11 Intr - 22553 22520 34 2 1 125 108 35 0.220 7.00 2.10 Intr - 25247 25097 151 1 1 86 28 57 0.026 -0.34 2.09 Intr - 46197 46121 77 0 2 88 53 101 0.247 4.91 2.08 Intr - 53889 53680 210 0 0 92 22 110 0.136 3.91 2.07 Intr - 64109 64032 78 2 0 97 95 -2 0.542 1.15 2.06 Intr - 66626 66452 175 1 1 14 66 77 0.674 -1.96 2.05 Intr - 67703 67591 113 2 2 65 108 102 0.794 9.08 2.04 Intr - 69096 68962 135 0 0 80 110 -14 0.602 0.76 2.03 Intr - 69331 69182 150 1 0 22 83 93 0.853 2.56 2.02 Intr - 70500 70396 105 0 0 65 110 19 0.821 2.21 2.01 Init - 72170 72075 96 2 0 89 94 79 0.635 8.91 2.00 Prom - 77258 77219 40 -1.16 3.04 PlyA - 77452 77447 6 1.05 3.03 Term - 79277 79219 59 0 2 113 54 91 0.786 6.05 3.02 Intr - 81992 81956 37 2 1 83 100 10 0.690 -0.46 3.01 Init - 88365 88258 108 0 0 115 100 234 0.998 27.52 3.00 Prom - 89595 89556 40 -6.86 4.00 Prom + 97640 97679 40 -8.46 4.01 Init + 99067 99188 122 0 2 63 24 157 0.742 4.46 4.02 Intr + 99937 100048 112 1 1 39 99 138 0.700 10.38 4.03 Intr + 100460 101040 581 1 2 94 38 177 0.090 4.80 4.04 Term + 120330 120522 193 0 1 51 47 231 0.728 12.19 4.05 PlyA + 121417 121422 6 1.05 5.06 PlyA - 121503 121498 6 1.05 5.05 Term - 121866 121846 21 0 0 106 50 16 0.006 -2.09 5.04 Intr - 155098 155004 95 2 2 108 113 2 0.175 4.38 5.03 Intr - 158521 158390 132 2 0 37 104 31 0.075 0.22 5.02 Intr - 209940 209853 88 0 1 66 82 64 0.632 3.14 5.01 Init - 213273 212980 294 0 0 39 46 215 0.659 9.59 5.00 Prom - 224962 224923 40 -3.66 6.00 Prom + 242685 242724 40 -3.96 6.01 Init + 252905 252907 3 1 0 108 81 0 0.596 1.30 6.02 Intr + 254648 254796 149 1 2 85 109 250 0.833 25.83 6.03 Intr + 274103 274191 89 2 2 99 111 -2 0.290 2.81 6.04 Intr + 278870 279003 134 0 2 53 107 21 0.038 0.86 6.05 Intr + 300032 300103 72 1 0 105 84 77 0.251 8.60 6.06 Intr + 342380 342640 261 1 0 96 -12 148 0.028 3.28 6.07 Intr + 353989 354084 96 0 0 105 99 184 0.997 21.41 6.08 Intr + 391354 391509 156 0 0 120 115 66 0.471 12.71 6.09 Intr + 397325 397388 64 1 1 110 115 37 0.807 6.89 6.10 Term + 414418 414734 317 2 2 78 32 247 0.105 13.20 6.11 PlyA + 415347 415352 6 1.05 7.00 Prom + 417474 417513 40 -3.46 7.01 Init + 439794 440084 291 2 0 74 25 214 0.511 10.93 7.02 Intr + 441752 441808 57 1 0 37 98 59 0.646 0.88 7.03 Intr + 443224 443372 149 0 2 76 97 155 0.973 14.23 7.04 Intr + 444426 444483 58 0 1 111 106 -45 0.573 -1.51 7.05 Intr + 445895 446039 145 1 1 88 97 -14 0.427 -0.64 7.06 Intr + 456209 456285 77 0 2 98 108 -1 0.458 2.13 7.07 Intr + 459354 459538 185 2 2 92 70 165 0.918 13.69 7.08 Intr + 459693 459818 126 0 0 55 107 35 0.788 1.89 7.09 Intr + 459968 460178 211 2 1 55 32 163 0.981 6.32 7.10 Intr + 465325 465432 108 0 0 101 72 39 0.929 4.08 7.11 Intr + 465589 465621 33 0 0 77 100 34 0.480 1.92 7.12 Intr + 465771 465822 52 2 1 69 70 23 0.065 -2.92 7.13 Term + 472943 473091 149 0 2 85 45 112 0.824 4.66 7.14 PlyA + 473619 473624 6 1.05 8.00 Prom + 495644 495683 40 -3.26 8.01 Sngl + 506876 507151 276 1 0 36 38 227 0.672 7.88 8.02 PlyA + 507593 507598 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 18893 18902 10 1 1 66 113 2 0.861 1.08 S.002 Term + 100460 101089 630 1 0 94 35 171 0.901 6.72 S.003 Term + 342380 342667 288 1 0 96 42 150 0.805 6.58 S.004 Term - 361587 361477 111 0 0 96 49 106 0.947 6.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:67278155_67787184|GENSCAN_predicted_peptide_1|91_aa XLRPFLGLKLLGLPQGPERLTTHGSTGPLVTTERRLQLFLATLREIKERLRPFLGLKLLG LPQGPERLTTYGSTGPLVTTERRLQLFLATL >gi568815581f:67278155_67787184|GENSCAN_predicted_CDS_1|276_bp ngcctgcgtcccttcctggggctgaagcttctgggcctgccccagggtcctgaacggctg acaacacacggatcaactggaccactggtgaccacagaaaggaggctgcagctgtttctt gctacgcttcgagagatcaaagagcgcctgcgtcccttcctggggctgaagcttctgggc ctgccccagggtcctgaacggctaacaacatacgggtcaactggaccactggtgaccaca gaaaggaggctgcagctgtttcttgctaccctttga >gi568815581f:67278155_67787184|GENSCAN_predicted_peptide_2|471_aa MVQQCCTYVEEITDLPIKLRLIDTLRMVTEGKIYVEIERARLTKTLATIKEQNGDVKEAA SILQELQVETYGSMEKKERVEFILEQMRLCLAVKDYIRTQIISKKINTKFFQEENTEKLK LKYYNLMIQLDQHEGSYLSICKHYRAIYDTPCIQAESEKWQQALKSVVLYVILAPFDNEQ SDLVHRISGDKKLEEIPKYKDLLKLFTTMELMRWSTLVEDYGMELRKGSLESPATDVFGS TEEGEKRWKDLKNRVVEHNIRIMAKYYTRITMKRMAQLLDLSVDVTLMQEVASHGLGKLC PCGFAGYSPPPGCFHGLALSVAFPGARCKLSVDLPFWDLEDSGPLLTAPLGHAPETSQKQ LDHLDILQSIPLGHNIDDRRMTVPIIVQMIHQYLSCTTSGLFSPICYFSNYLFRVDFKQL KACTATPCLAVIGVQVMFGYTILSHWKVFRGNNMHGAVISYDNHYCFCNAS >gi568815581f:67278155_67787184|GENSCAN_predicted_CDS_2|1416_bp atggttcaacagtgctgtacttatgttgaggaaatcacagaccttcctatcaaacttcga ttaattgatactctacgaatggttaccgaaggcaagatttatgttgaaattgagcgtgcg cgactgactaaaacattagcaactataaaagaacaaaatggtgatgtgaaagaggcagcc tccattttacaggagttacaggtggaaacctacgggtcaatggaaaagaaagagcgagtg gaatttattttggagcaaatgaggctctgcctagctgtgaaggattacattcgaacacaa atcatcagcaagaaaattaacaccaaatttttccaggaagaaaatacagagaaattaaag ttgaagtactataatttaatgattcagctggatcaacatgagggatcctatttgtctatt tgtaagcactacagagcaatatatgatactccctgtatacaggcagaaagtgaaaaatgg cagcaggctctgaagagtgttgtactctatgttatcctggctccttttgacaatgaacag tcagatttggttcaccgaataagtggtgacaagaagttagaagaaattcccaaatacaag gatcttttaaagctttttaccacaatggagttgatgcgttggtccacacttgttgaggac tatggaatggaattaagaaaaggttcccttgagagtcctgcaacggatgtttttggttct acagaggaaggtgaaaaaaggtggaaagacttgaagaacagagttgttgaacataatatt agaataatggccaagtattatactcggataacaatgaaaaggatggcacagcttctggat ctatctgttgatgtcacgctgatgcaagaggtggcctcccatggccttgggaagctctgc ccttgcggctttgcagggtatagcccccctcctggctgctttcatgggctggcattgtct gtggcttttccaggtgcacggtgcaagctgtcagtggatttaccattctgggatctggag gacagtggcccacttctcacagccccactaggccatgccccagagacatcccaaaagcag ctggaccatctggacatcctgcagagcatcccactaggccacaatattgatgacaggaga atgacagttccaattattgttcagatgatacatcaatacctctcatgcaccacctcaggc cttttcagccccatctgttacttttccaactacctcttccgcgtagacttcaaacaactt aaagcttgcacggccacaccgtgtcttgccgttattggggtgcaggtgatgtttggttac acaatcttgtcccactggaaggtcttcaggggcaataacatgcatggagctgtcatctcc tatgataaccattactgcttctgtaatgcctcctga >gi568815581f:67278155_67787184|GENSCAN_predicted_peptide_3|67_aa MADGGSERADGRIVKMEVDYSATVDQRLPECAKLAKIVTIEMTLIKAKGFRYGIDIPYLS CSSEDVL >gi568815581f:67278155_67787184|GENSCAN_predicted_CDS_3|204_bp atggcggacggcggctcggagcgggctgacgggcgcatcgtcaagatggaggtggactac agcgccacggtggatcagcgcctacccgagtgtgcgaagctagccaagatagtaaccata gagatgactttgatcaaagcgaaaggcttccgatatggtatcgacatcccgtatcttagt tgcagtagtgaagatgtgctatga >gi568815581f:67278155_67787184|GENSCAN_predicted_peptide_4|335_aa MGLPSPPRPLLMSASLALAPPAPTSRPQAHCIGADAPAPASSLAGLGGAPRFPPRGSAAG RTMLLKEYRICMPLTVDEVGVRGGALRPRDLHAPSPAGGTSEPLGEVPAGPDFSPSPRAW AQLPARPPFEARDPAVSFRKPPGSETFCVSCFFRRGERRASRERRGLRRGHRRALQHVVP GPRLPAGGAGRGAGARPPSAALLSFSRSCEAASSSIQEVVTQIGLFRPRLPFLPLLPPHL CGDSTTAERLCPGTAREPKAKQGNVKLTSVVWHWLCDVQLGVVEKNWALPVDQCRLQELQ FLVHLIDLLSILLSCNGFPRIQKAVMDQTSSSDHQ >gi568815581f:67278155_67787184|GENSCAN_predicted_CDS_4|1008_bp atggggctaccctccccgccgcggccgctgctgatgtcagcctcgctcgcgctcgctcct cccgcacccacctcccggccccaggcacactgcatcggcgcggacgctccggccccggcg agcagccttgctggtcttgggggcgccccccgcttcccgccccgggggtccgcggccggc aggaccatgctgctgaaagagtaccggatctgcatgccgctcaccgtagacgaggtaggg gtgcgaggaggagccctgcgccctcgggatctgcacgccccgagccccgcgggaggaacc tctgagcccttaggggaggtccctgcggggcccgacttctcgccgtcgccgcgggcttgg gcgcagctcccggcacgtccgcccttcgaggctcgggacccggctgtgtcctttcgcaaa ccgcctggctcggaaactttctgcgtctcttgtttcttccgccgcggggagcggcgcgcg agccgggagcggcgggggctgcgacgcggccacaggagggcgctccagcacgtggtgccg gggccgcggctgccggctgggggcgccgggcgcggggcgggggctcgtcctccaagcgcg gctctgctgtccttctcccgatcctgcgaagccgcgagctccagtattcaggaagtggtg actcagataggattattccggccccggcttccctttcttccccttcttcctccccacctt tgtggtgattccacgactgctgagcgtctctgtccagggacagcgagggagcccaaagcc aagcaaggcaacgtgaaactcactagtgtggtttggcattggttgtgcgacgtgcagttg ggtgttgtggagaagaattgggcccttcctgttgaccagtgccggctgcaggagttgcag tttttggtgcatctcatcgatttgctgagcatacttctcagctgtaatggtttccccagg attcagaaagccgtaatggatcagaccagcagcagcgaccaccagtga >gi568815581f:67278155_67787184|GENSCAN_predicted_peptide_5|209_aa MPPTLDSREAAVPCTQMTQKAFLNERPGIVTHAGSHIWQIWVTSRTVLSVLRVPGNALPR VKCTSCSGRLRSHASLRHPDGHTLWGWLRLRAPEAEPTYSRGKEKLQQGENPASNSSFEL LGNPNLSTLHTAKKHESDYVTSGFRTLSGSHCKKDISQTPSQAPNAFHALSGLASLGLAQ LNSFLALTARRQENYRKQKRENHSRLRIA >gi568815581f:67278155_67787184|GENSCAN_predicted_CDS_5|630_bp atgcctccaaccctggacagcagggaagcagcagtgccctgcacccagatgacccagaaa gcatttcttaatgaacggcccggcattgtgactcatgctggctcccacatctggcagatc tgggttacatcacgcacggtcctcagtgtcctgcgggttcctgggaacgcgctgcccaga gtcaagtgcaccagctgcagtggccggctgcgttcacacgcctctctgcgtcatcctgat ggccacacactatggggctggctccggctccgggctccagaggctgagccgacttattct aggggcaaagaaaaacttcagcaaggagaaaatcccgccagcaacagctcttttgagctg ctcggtaatccaaacctttcaactctgcacacagccaagaaacatgaatcagattatgtc acttctggttttagaactctcagtggctcccattgcaaaaaggacatatcccaaactccc agccaggccccgaacgctttccacgccctgtccggtctggcttcccttggtctggcccaa ctaaattctttcctggcactgactgccaggagacaggaaaactatagaaagcagaagagg gagaaccacagtaggctccgtattgcatga >gi568815581f:67278155_67787184|GENSCAN_predicted_peptide_6|446_aa MYKIGQLYMISKHSHEQSDRGEGVEVVQNEPFEDPHHGNGQFTEKRVYLNSKLPSWARAV VPKIFYVTEKAWNYYPYTITEACSVLAHVALMKHHRLGGLSSRHLFLTVAEAESATGFLV RAVFLCSFLPKFSIHIETKYEDNKGSNDTACRHLTGDRIFSDTLRPETFAFSLLYNSLLA LEKQFSPLRQRPNDIVHQKGFSREALKFTQQLRRSGLTLMDEFPLLSLNTPEILWQIFDN EAKDVEREVCFIDIACDEIPERYYKESEDPKHFKSEKTGRGQLREGWRDSHQPIMCSYKL VTVKFEVWGLQTRVEQFVHKVVRDILLIGHRQAFAWVDEWYDMTMDEVREFERATQEATN KKIGIFPPAISISSIPLLPSSVRSAPSSAPSTPLSTDAPEFLSVPKDRPRKKSAPETLTL PDPEKKATLNLPGMHSSDKPCRPKSE >gi568815581f:67278155_67787184|GENSCAN_predicted_CDS_6|1341_bp atgtacaaaattggacagctgtacatgatcagcaaacacagccatgaacagagtgaccgg ggagaaggggtggaggtcgtccagaatgagccctttgaggaccctcaccatggcaatggg cagttcaccgagaagcgggtgtatctcaacagcaaactgcctagttgggctagagctgtt gtccccaaaatattttatgtgacagagaaggcttggaactattatccctacacaattaca gaggcctgttctgtgttagctcatgttgctctaatgaaacaccacagacttgggggcttg agcagcagacatttatttctcacagttgcagaagctgaaagtgcgactgggttcctggtg agggctgtcttcctgtgttcctttctgccgaaattctccattcatatagaaaccaagtat gaggacaacaaaggaagcaatgacaccgcttgtcgccacctcactggggacagaattttc tcagacactctcaggcctgaaacctttgccttcagtttgctttacaactccctgctggct ctggaaaaacaattttcccctctgcgccaaaggcctaatgatattgtccaccagaaaggg ttctccagagaagcactgaagtttacccagcagctaagacgatccggcctaaccctgatg gatgagtttcctttactcagccttaatacccctgaaatcctgtggcaaattttcgacaat gaagccaaagacgtggagagagaagtttgctttattgatattgcctgcgatgaaattcca gagcgctactacaaagaatctgaggatcctaagcacttcaagtcagagaagacaggacgg ggacagttgagggaaggctggagagatagtcatcagcctatcatgtgctcctacaagctg gtgactgtgaagtttgaggtctgggggcttcagaccagagtggaacaatttgtacacaag gtggtccgagacattctgctgattggacatagacaggcttttgcatgggttgatgagtgg tatgacatgacaatggatgaagtccgagaatttgaacgagccactcaggaagccaccaac aagaaaatcggcattttcccacctgcaatttctatctccagcatccccctgctgccttct tccgtccgcagtgcgccttctagtgctccatccacccctctctccacagacgcacccgaa tttctgtccgttcccaaagatcggccccggaaaaagtctgccccagaaactctcacactt ccagaccctgagaaaaaagccaccctgaatttacccggcatgcactcttcagataagcca tgtcggcccaaatctgagtaa >gi568815581f:67278155_67787184|GENSCAN_predicted_peptide_7|546_aa MAALEEEFTLSSVVLSAGPEGLLGVEQSDKTDQFLVTDSGRTVILYKVKAIGLGAPRLPS RPFLSAELELKFLTASERDRGEGLAESRLGLLGTLSEVLRIWNNEDVNLDKVFKATLSAE VYRILSVQGTEPLVLFKEGAVRGLEALLADPQQKIETVISDEEVIKWTKFFVVFRHPVLI FITEKHGNYFAYVQMFNSRILTKYTLLLGQDENSVIKSFTASVDRKFISLMSLKCLSVWN IKFQTLQTSKELPQGTSGQKDSEKHIEVEVRKFLALKQTPDFHTVIGDTVTGLLERCKAE PSFYPRNCLMQLIQTHVLSYSLCPDLMEIALKKKDVQLLQLCLQQFPDIPESVTCACLKI FLSIGDDSLQETDVNMESVFDYSINSVHDEKMEEQTEILQNGFNPEEDKCNNCDQELNKK PQDETKESTSCPVIMDWICLLLDANFTVVVMMPEAKRLLINLYKLVKSQISVYSELNKIE ARTVFCLRPSQCQEKRVRDKNLGPTKHGYKEGCTPSTSGEQLPHVMGSSSRAKPALEPWA RAGQQD >gi568815581f:67278155_67787184|GENSCAN_predicted_CDS_7|1641_bp atggcagcgctggaggaagaattcacgttgtcttcggtagtcctgagcgccgggcctgaa ggactcctaggcgtggagcagagcgacaaaacagaccagtttctagtgacagacagcggc aggacagtcatcctctataaggtgaaggcaataggtttgggagcgccccgactgccttct cgcccctttctgagcgcggagctcgagctcaagtttctcacagcttcagaaagagatcgt ggagaaggcttagcagagagccggctgggcctgttggggacgctgagtgaggttttaaga atatggaataatgaagatgtaaacctggataaagtatttaaagctacattgtcagcagaa gtatataggatactttcagtgcaagggacagaacccttggtgctcttcaaggaaggtgct gttcgtggtttagaggccttgcttgcagacccccagcagaaaattgaaactgttatctct gatgaagaagtgattaaatggacaaagtttttcgtagtattcagacatcctgttttaatt tttattactgaaaaacatggaaattactttgcttacgtgcaaatgtttaactcacgtatc ttaaccaaatatacactcttacttggacaagacgaaaactctgttataaagagttttact gcatctgtagatcggaaattcatctctttgatgtcattaaaatgcctctctgtatggaac ataaaatttcaaacactacagacttcaaaagagttaccacaagggaccagtggtcaaaaa gattcagaaaaacacattgaagtagaagtacggaaatttttggctctgaagcagacacct gactttcatactgtcattggggacacagtaacaggacttctggaaaggtgtaaagcagaa ccatcattttatccccggaactgtctgatgcagcttatccaaacgcatgtgctttcttac agtttgtgccccgacttaatggagattgccttaaaaaagaaagatgtacagttgttacaa ctctgtctacagcagttccctgacattcctgaatcagtcacctgtgcttgcttaaaaatt ttcttgagcattggtgatgacagtcttcaagaaacagatgttaatatggagtcagttttt gactatagtataaattctgtacatgatgagaaaatggaagagcaaactgaaattcttcaa aatggcttcaatcctgaagaagataaatgcaataactgtgatcaagagttaaataaaaag ccccaggacgaaacaaaggagagcacttcatgccctgtgattatggattggatatgtcta cttctggatgcaaattttactgttgttgtaatgatgccagaagcaaagaggctactgata aatctttacaagcttgtaaaatctcagatatctgtttattctgagctcaacaagattgaa gcaagaacggtgttttgtttgcgaccatctcagtgtcaagagaaacgtgtcagggataag aacttgggacccaccaagcacgggtacaaagaaggctgtacaccctccaccagtggagag cagctgcctcatgtgatgggaagcagcagcagggccaagccagccctggagccttgggcc agagcagggcaacaggactga >gi568815581f:67278155_67787184|GENSCAN_predicted_peptide_8|91_aa MLKNAEFKGLDIDSLVIEHIQVNKAPKKGCWTYRAHGWINPCMSSPCHTEIILTEKEQIV PKPEDEVAQKKKISQKKLKKQKLMTQDYIQH >gi568815581f:67278155_67787184|GENSCAN_predicted_CDS_8|276_bp atgcttaaaaatgcagaatttaagggtttagatatagattctctggtcattgagcatatc caagtgaacaaagcacctaagaagggctgctggacctacagagctcatggttggattaac ccatgcatgagctctccctgccacactgagattatccttactgaaaaggaacagattgtt cctaaaccagaagacgaggttgcccagaagaaaaagatatcccagaagaaactgaagaaa caaaaactgatgacacaggactacattcagcattaa