GENSCAN 1.0 Date run: 8-Nov-116 Time: 02:04:57 Sequence gi568815597f:45227149_45428261 : 201113 bp : 43.82% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 15804 15843 40 -1.16 1.01 Sngl + 58443 58736 294 2 0 88 54 270 0.984 19.30 1.02 PlyA + 59322 59327 6 1.05 2.00 Prom + 59857 59896 40 -4.96 2.01 Sngl + 59950 60852 903 0 0 49 49 247 0.992 13.12 2.02 PlyA + 61260 61265 6 1.05 3.00 Prom + 63079 63118 40 -2.46 3.01 Init + 66555 66632 78 2 0 69 72 13 0.611 -1.14 3.02 Intr + 68915 69013 99 1 0 100 60 87 0.794 7.41 3.03 Term + 69837 70007 171 2 0 -14 45 136 0.385 -3.07 3.04 PlyA + 72114 72119 6 1.05 4.03 PlyA - 72859 72854 6 1.05 4.02 Term - 77955 77858 98 1 2 72 44 123 0.083 4.43 4.01 Init - 91814 91751 64 2 1 47 111 62 0.419 5.72 4.00 Prom - 96673 96634 40 -4.36 5.00 Prom + 99103 99142 40 -6.26 5.01 Sngl + 100001 101116 1116 1 0 106 49 1075 0.999 101.18 5.02 PlyA + 101336 101341 6 1.05 6.11 PlyA - 103192 103187 6 -0.45 6.10 Term - 104186 103977 210 2 0 85 42 164 0.942 8.79 6.09 Intr - 104408 104272 137 0 2 125 107 114 0.982 17.29 6.08 Intr - 104701 104513 189 2 0 39 82 129 0.989 6.96 6.07 Intr - 104938 104875 64 1 1 123 77 44 0.998 5.09 6.06 Intr - 105162 105018 145 2 1 69 63 124 0.996 8.28 6.05 Intr - 105340 105243 98 1 2 80 62 142 0.984 9.61 6.04 Intr - 105539 105426 114 2 0 57 58 159 0.999 10.44 6.03 Intr - 105686 105615 72 2 0 90 116 52 0.995 7.80 6.02 Intr - 106022 105949 74 0 2 70 62 88 0.917 3.53 6.01 Init - 107357 107243 115 2 1 100 91 72 0.910 9.08 6.00 Prom - 110860 110821 40 -6.46 7.00 Prom + 111031 111070 40 -9.75 7.01 Init + 113105 113156 52 1 1 84 101 79 0.851 10.13 7.02 Intr + 113925 114195 271 1 1 85 100 189 0.818 16.40 7.03 Intr + 114325 114421 97 1 1 22 86 113 0.909 4.51 7.04 Intr + 114801 114959 159 2 0 83 102 178 0.999 18.88 7.05 Intr + 115236 115495 260 2 2 101 77 318 0.997 28.26 7.06 Intr + 115695 115854 160 0 1 78 89 162 0.998 15.29 7.07 Term + 115934 116554 621 1 0 91 44 308 0.992 21.12 7.08 PlyA + 116790 116795 6 -0.45 8.10 PlyA - 116852 116847 6 -3.24 8.09 Term - 118410 117692 719 1 2 61 55 402 0.985 27.85 8.08 Intr - 118846 118729 118 1 1 36 78 124 0.994 6.24 8.07 Intr - 119631 119545 87 0 0 80 92 63 0.977 6.07 8.06 Intr - 119914 119831 84 1 0 -1 83 137 0.913 4.32 8.05 Intr - 120545 120461 85 1 1 63 115 50 0.550 5.02 8.04 Intr - 120852 120770 83 0 2 49 89 84 0.542 3.04 8.03 Intr - 128301 128155 147 0 0 79 86 96 0.817 8.93 8.02 Intr - 158812 158764 49 0 1 116 94 -3 0.001 1.78 8.01 Intr - 194698 194577 122 1 2 105 63 80 0.144 6.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 147452 147583 132 1 0 87 49 108 0.863 4.89 S.002 Term + 188039 188196 158 0 2 48 38 175 0.938 6.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:45227149_45428261|GENSCAN_predicted_peptide_1|97_aa MGKKQNRKTGNSKKQSASPPPKEYSSSPAKEQSWMENDFDELREEGFRRSNYSELKEEVR TQGKEVKNLLKKLDKWLTRITNAEKSLKDLMELKTMA >gi568815597f:45227149_45428261|GENSCAN_predicted_CDS_1|294_bp atggggaaaaaacagaacagaaaaaccggaaactctaaaaagcagagcgcctctcctcct ccaaaggaatatagctcctcaccagcaaaggaacaaagctggatggagaatgactttgac gagttgagagaagaaggcttcagacgatcaaactactccgagctaaaggaggaagttcga acccaaggcaaagaagttaaaaaccttttaaaaaaattagacaaatggctaactagaata accaatgcagagaagtccttaaaggacctgatggagctgaaaaccatggcatga >gi568815597f:45227149_45428261|GENSCAN_predicted_peptide_2|300_aa MGDFYTPLSTSDRSTKQKVNKDIQELNSALHQADLIDIYRILHPKSTEYTFFSAPHHTYS KIDHIVGSKALLSKCKRTEIITNCLSDHSAIKLELSIKKLTQILSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNKNKDTTYQNLWDTFKAVCRGKFIALNAHKRKQEKSKIDTLTSQLK EPEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKISESRSWFFEKINKIDRLLARLIK KKREKNQIDTIKNDKGDTTTNPTEIQTTIREYYKHLCANKLENLEEMDKFLDIYTLPRLN >gi568815597f:45227149_45428261|GENSCAN_predicted_CDS_2|903_bp atgggagacttttacaccccactgtcaacatcagacagatcaacgaaacagaaagttaac aaggatatccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga attctccaccccaaatcaacagaatatacattcttctcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcagcattaagaaactc actcaaatcctctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataatgaaatgaaggcagaaataaagatgttctttgaaaccaacaagaacaaagacaca acataccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaaaatctaaaatagacaccctaacatcacaattaaaa gaaccagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaag atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcagtgaatcc aggagctggttttttgaaaagatcaacaaaattgatagactgctagcaagactaataaag aagaaaagagagaagaatcaaatagacacaataaaaaatgacaaaggggataccaccacc aatcccacagaaatacaaactaccatcagagaatactataaacacctctgcgcaaataaa ctagaaaatctagaagaaatggataaattcctcgatatatacaccctcccaagactaaac tag >gi568815597f:45227149_45428261|GENSCAN_predicted_peptide_3|115_aa MERSGCFQEPCKRENWQHLVTGHGYRVLAKQLQQFQDTFRLNNIPGKKRYFCLHHSHLGV GIEAESMDELTQRVFRGRSTEDLGASVELEIQLNWKATEADTESVTAPAAEGLPV >gi568815597f:45227149_45428261|GENSCAN_predicted_CDS_3|348_bp atggaaagaagtggatgctttcaagagccatgtaagagagaaaattggcagcacttggtg actggacacggctacagggtgttagcaaaacagctgcagcagttccaggacaccttcaga cttaacaacattcccgggaaaaagagatacttctgtttacatcactctcacctaggggtg ggaattgaagctgaaagcatggatgagctcacccagagagtgttcagagggagaagcaca gaggacctaggcgcttcagttgagctggaaatccagctcaactggaaggctacagaggca gacaccgaatcagttacggctcctgctgcagaagggcttcctgtctga >gi568815597f:45227149_45428261|GENSCAN_predicted_peptide_4|53_aa MRKGLTEVMGAVPEALWQYSPVTPLLMMNRMLLNPGVAGALNDFDDERLGMES >gi568815597f:45227149_45428261|GENSCAN_predicted_CDS_4|162_bp atgagaaagggcttgactgaagtaatgggggctgtccctgaagccttgtggcagtacagc ccagtcacccctctgctcatgatgaacaggatgctgctgaaccctggggtcgctggggcc ttaaatgactttgatgatgagagactgggcatggagagctag >gi568815597f:45227149_45428261|GENSCAN_predicted_peptide_5|371_aa MAAPALRLCHIAFHVPAGQPLARNLQRLFGFQPLASREVDGWRQLALRSGDAVFLVNEGA GSGEPLYGLDPRHAVPSATNLCFDVADAGAATRELAALGCSVPVPPVRVRDAQGAATYAV VSSPAGILSLTLLERAGYRGPFLPGFRPVSSAPGPGWVSRVDHLTLACTPGSSPTLLRWF HDCLGFCHLPLSPGEDPELGLEMTAGFGLGGLRLTALQAQPGSIVPTLVLAESLPGATTR QDQVEQFLARHKGPGLQHVGLYTPNIVEATEGVATAGGQFLAPPGAYYQQPGKERQIRAA GHEPHLLARQGILLDGDKGKFLLQVFTKSLFTEDTFFLELIQRQGATGFGQGNIRALWQS VQEQSARSQEA >gi568815597f:45227149_45428261|GENSCAN_predicted_CDS_5|1116_bp atggccgcgcccgcccttcgtttgtgccacatcgccttccacgtgcccgccgggcagccc ctagcccggaacctgcagcgcctcttcggcttccagcccctggcttcgcgggaggtggac ggctggcggcagctagccctgcgcagcggcgacgcggtctttttggtgaacgagggcgca gggtctggagagccgctgtacggcctggatccgcgtcacgccgtgcccagcgccacaaac ctgtgcttcgacgtggcggacgccggcgctgcaacccgggagctggcagcgctgggctgc agcgtgcctgtccctcccgttcgcgtgcgggacgcgcagggtgccgccacttacgccgtg gtcagctcgcctgccggcatcctcagcctgaccttgctggagcgcgctggctaccgcgga cccttcctacccggcttcaggcccgtgtcctctgcgcctggccccgggtgggtcagccgc gtggaccacctgaccttggcctgcacccccggcagctcccccacacttttgcgctggttc cacgactgcctgggcttttgccacttgccgctgagcccaggtgaggatcccgagctgggc ctcgaaatgacagcagggtttgggcttgggggactgaggcttacagccctgcaggcccag ccgggcagcattgtccccactcttgttctggctgagtcccttccgggggcgacgacacga caggaccaggtggagcagttcctggcccggcacaaggggccaggcctgcagcacgtgggg ctgtatacgcctaacattgtggaggccactgagggggtggcaactgctggaggccagttc ctggctccccctggggcatactaccagcagccaggaaaggagaggcagatccgagctgca gggcacgagcctcatctgcttgctcgacaggggatcctgctagatggtgataaaggcaag tttctgcttcaggtcttcaccaagtccctttttactgaggacactttcttcctggagctg attcagaggcagggggccactggctttggtcagggcaacatcagagctctgtggcagtcc gtacaggagcaatctgccaggagccaggaagcctaa >gi568815597f:45227149_45428261|GENSCAN_predicted_peptide_6|405_aa MRKPRAAVGSGHRKQAASQEGRQKHAKNNSQAKPSACDVWVSEVMLQQTQVATVINYYTG WMQEVNQLWAGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQLLPGVGRYTAGAIAS IAFGQATGVVDGNVARVLCRVRAIGADPSSTLVSQQLWGLAQQLVDPARPGDFNQAAMEL GATVCTPQRPLCSQCPVESLCRARQRVEQEQLLASGSLSGSPDVEECAPNTGQCHLCLPP SEPWDQTLGVVNFPRKASRKPPREESSATCVLEQPGALGAQILLVQRPNSGLLAGLWEFP SVTWEPSEQLQRKALLQELQRWAGPLPATHLRHLGEVVHTFSHIKLTYQVYGLALEGQTP VTTVPPGARWLTQEEFHTAAVSTAMKKALPLLSLLYFLVFPTCST >gi568815597f:45227149_45428261|GENSCAN_predicted_CDS_6|1218_bp atgaggaagccacgagcagccgtgggaagtggtcacaggaagcaggcagccagccaggaa gggaggcagaagcatgctaagaacaacagtcaggccaagccttctgcctgtgatgtgtgg gtctcagaggtcatgctgcagcagacccaggttgccactgtgatcaactactataccgga tggatgcaggaggtgaatcaactctgggctggcctgggctactattctcgtggccggcgg ctgcaggagggagctcggaaggtggtagaggagctagggggccacatgccacgtacagca gagaccctgcagcagctcctgcctggcgtggggcgctacacagctggggccattgcctct atcgcctttggccaggcaaccggtgtggtggatggcaacgtagcacgggtgctgtgccgt gtccgagccattggtgctgatcccagcagcacccttgtttcccagcagctctggggtcta gcccagcagctggtggacccagcccggccaggagatttcaaccaagcagccatggagcta ggggccacagtgtgtaccccacagcgcccactgtgcagccagtgccctgtggagagcctg tgccgggcacgccagagagtggagcaggaacagctcttagcctcagggagcctgtcgggc agtcctgacgtggaggagtgtgctcccaacactggacagtgccacctgtgcctgcctccc tcggagccctgggaccagaccctgggagtggtcaacttccccagaaaggccagccgcaag ccccccagggaggagagctctgccacctgtgttctggaacagcctggggcccttggggcc caaattctgctggtgcagaggcccaactcaggtctgctggcaggactgtgggagttcccg tccgtgacctgggagccctcagagcagcttcagcgcaaggccctgctgcaggaactacag cgttgggctgggcccctcccagccacgcacctccggcaccttggggaggttgtccacacc ttctctcacatcaagctgacatatcaagtatatgggctggccttggaagggcagacccca gtgaccaccgtaccaccaggtgctcgctggctgacgcaggaggaatttcacaccgcagct gtttccaccgccatgaaaaaggcactacctttgttgtctttgttgtacttccttgtgttt cctacatgttctacatga >gi568815597f:45227149_45428261|GENSCAN_predicted_peptide_7|539_aa MAADSDDGAVSAPAASDGGVSKSTTSGEELVVQVPVVDVQSNNFKEMWPSLLLAIKTANF VAVDTVRVGKQGGQVVVKGLVLEACHNSPFTYPQELSGLGDRKSLLNQCIEERYKAVCHA ARTRSILSLGLACFKRQPDKGEHSYLAQVFNLTLLCMEEYVIEPKSVQFLIQHGFNFNQQ YAQGIPYHKGNDKGDESQSQSVRTLFLELIRARRPLVLHNGLIDLVFLYQNFYAHLPESL GTFTADLCEMFPAGIYDTKYAAEFHARFVASYLEYAFRKCERENGKQRAAGSPHLTLEFC NYPSSMRDHIDYRCCLPPATHRPHPTSICDNFSAYGWCPLGPQCPQSHDIDLIIDTDEAA AEDKRRRRRRREKRKRALLNLPGTQTSGEAKDGPPKKQVCGDSIKPEETEQEVAADETRN LPHSKQGNKNDLEMGIKAARPEIADRATSEVPGSQASPNPVPGDGLHRAGFDAFMTGYVM AYVEVSQGPQPCSSGPWLPECHNKVYLSGKAVPLTVAKSQFSRSSKAHNQKMKLTWGSS >gi568815597f:45227149_45428261|GENSCAN_predicted_CDS_7|1620_bp atggccgccgacagtgacgatggcgcagtttcagctcccgcagcttccgacggtggtgtc agcaaaagcacaacatctggggaggagctagtagtccaggttcccgtagtggatgtgcaa agcaacaacttcaaggagatgtggccatccctcctgctagccataaagacagctaatttc gtggctgtggacacggtgagagttgggaaacaaggagggcaggtggttgtgaaggggctg gtgctagaggcctgtcacaactctccctttacctacccacaggagctgagtgggcttggg gacaggaagagtttgctgaaccagtgcattgaggaacgttacaaggccgtgtgtcatgct gccaggacccgttctatcctttccctgggcctcgcctgcttcaagcggcagccagacaag ggtgaacattcctatctggctcaagtgttcaatctcactctgctgtgcatggaggagtat gtcatagaaccaaagtctgtgcagttcctgatacagcatggcttcaacttcaaccagcag tatgcccaaggcatcccctaccataagggcaatgacaagggtgatgagagccagagccag tcagtacggaccctattcctggagctaatccgagcccgccggcccctggtgctacacaat ggccttatagacttggtgttcctgtaccagaacttctatgcacacctccctgagagtctg ggaaccttcaccgctgacctgtgtgagatgttcccagcaggcatttatgacaccaaatat gctgctgagtttcatgcccgtttcgtggcctcctacttagaatatgccttccggaaatgt gaacgggaaaatgggaagcagcgggcagctggcagcccacaccttaccctggagttctgc aactatccttccagcatgagggaccatattgattaccgctgctgcctgcccccagcaacc caccgtcctcatcccaccagcatctgtgacaacttctcggcttatggctggtgccccctg ggaccacagtgtcctcagtctcacgatattgaccttatcattgacactgatgaggctgcg gcagaggacaagcggcgacggcgacgacgtagggaaaaacggaagagggctttattgaac ctaccggggacacagacctctggggaagctaaggatggtcctcccaagaagcaggtctgt ggggatagcatcaagcctgaagaaaccgagcaggaggtggctgccgatgaaactaggaac ctgcctcactccaagcaaggcaacaaaaatgacttagagatggggattaaggcagcaagg cctgaaatagctgatagagctacctcagaagtgccagggagccaagccagtcctaaccca gtgcctggggatggattgcaccgggctggttttgatgcctttatgacaggttatgtgatg gcctatgtggaagtgagccagggaccgcagccctgcagctctggaccctggctccctgaa tgccacaataaggtatatttgagtggcaaagctgtacccctcacagtggccaagagccag ttctctcgttcctccaaagcccacaatcagaagatgaagctcacttggggcagtagctga >gi568815597f:45227149_45428261|GENSCAN_predicted_peptide_8|497_aa VRHRASGQVMALKMNTLSSNRANMLKEVQLMNRLSHPNILRFMGVCVHQGQLHALTEYIN SGNLEQLLDSNLHLPWTVRVKLAYDIAVGLSYLHFKGIFHRDLTSKNCLIKRDENGYSAV VADFGLAEKIPDVSMGSEKLAVVGSPFWMAPEVLRDEPYNEKADVFSYGIILCEIIARIQ ADPDYLPRTENFGLDYDAFQHMVGDCPPDFLQLTFNCCNMDPKLRPSFVEIGKTLEEILS RLQEEEQERDRKLQPTARGLLEKAPGVKRLSSLDDKIPHKSPCPRRTIWLSRSQSDIFSR KPPRTVSVLDPYYRPRDGAARTPKVNPFSARQDLMGGKIKFFDLPSKSVISLVFDLDAPG PGTMPLADWQEPLAPPIRRWRSLPGSPEFLHQEACPFVGREESLSDGPPPRLSSLKYRVK EIPPFRASALPAAQAHEAMDCSILQEENGFGSRPQGTSPCPAGASEEMEVEERPAGSTPA TFSTSGIGLQTQGKQDG >gi568815597f:45227149_45428261|GENSCAN_predicted_CDS_8|1494_bp gtacgacaccgagcttctggtcaggtgatggctcttaagatgaacacattgagcagtaac cgggcaaacatgctgaaagaagtacagctcatgaatagactctcccatcccaacatcctt aggttcatgggtgtatgtgttcatcaaggacaattgcatgcacttacagagtatatcaac tccgggaacctggaacagttgctagacagtaacctgcatttgccttggactgtgagggta aaactggcctatgacatagcagtgggcctcagctaccttcacttcaaaggcatttttcat cgggacctcacatctaagaactgcctgataaagagggatgagaatggttactctgcagtg gtagctgactttggcctggctgagaagatccccgatgtcagcatggggagtgagaagctg gccgtggtgggttccccattctggatggcacctgaggttctccgagatgagccctataat gaaaaggcagatgtgttctcttatggtatcatcctctgcgagatcatcgcccgcatccag gccgatccggactatcttccccgcacagagaatttcgggctggactatgatgctttccag cacatggtgggagactgtcccccagattttctgcaacttactttcaactgctgtaacatg gatcccaaactgcgcccatcttttgtggagattgggaagaccctggaggaaattctgagc cgcctacaggaagaagagcaggagagggataggaagctgcagcccacagccaggggactc ttggagaaagcacctggggtgaagcgactaagctcactggatgacaagatcccccacaag tcaccatgcccaagacgtaccatctggctgtctcgaagccagtcagatatcttttcccgt aagcccccacgtacagtgagtgtcttggacccatactaccggccacgagatggtgctgcc cgcacccccaaagtcaacccttttagtgctcgccaggacctcatggggggcaagatcaag ttttttgacctgcccagcaagtctgtcatctctctggtatttgacctggatgcaccaggg cccggaactatgcccctggctgactggcaggagcccctggccccacctattcgccggtgg cgttccttgcctggttcgcctgagttcttgcatcaagaggcttgtccatttgtgggccgg gaagaatcgctatctgatgggcccccaccacgcctaagtagtctcaagtacagagttaaa gagatcccaccattccgggcatctgccctaccagctgctcaagcccatgaggctatggac tgctccattctccaggaagaaaatggttttgggtccaggccccaggggaccagtccatgc cctgcgggtgcttctgaggagatggaggtagaagaaaggccagcaggctcaactccagcc accttctccacctcaggcataggcctgcaaacccagggaaagcaggatgggtga