GENSCAN 1.0 Date run: 3-Nov-116 Time: 22:11:12 Sequence gi568815596f:32178156_32405623 : 227468 bp : 41.60% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 14181 14261 81 0 0 87 83 30 0.057 0.23 1.02 Intr + 15734 15828 95 2 2 91 95 28 0.106 2.49 1.03 Intr + 19163 19237 75 0 0 61 116 55 0.834 4.17 1.04 Term + 23400 23560 161 1 2 90 39 180 0.931 10.42 1.05 PlyA + 23740 23745 6 -0.45 2.00 Prom + 24059 24098 40 -8.35 2.01 Init + 24705 25082 378 2 0 79 80 320 0.885 27.25 2.02 Intr + 25399 25733 335 0 2 58 77 230 0.976 12.34 2.03 Intr + 26435 26537 103 2 1 74 110 57 0.979 5.76 2.04 Intr + 28731 28778 48 2 0 99 94 35 0.898 3.16 2.05 Intr + 31338 31406 69 2 0 118 76 35 0.930 3.76 2.06 Term + 42058 42558 501 0 0 79 44 275 0.991 15.69 2.07 PlyA + 45645 45650 6 -0.45 3.09 PlyA - 45974 45969 6 1.05 3.08 Term - 46610 46318 293 0 2 85 36 164 0.438 5.42 3.07 Intr - 57457 57246 212 0 2 109 82 79 0.579 7.03 3.06 Intr - 62970 62847 124 1 1 77 101 63 0.655 5.22 3.05 Intr - 73446 71452 1995 1 0 102 39 1630 0.760 145.87 3.04 Intr - 74524 74264 261 2 0 81 106 164 0.311 14.04 3.03 Intr - 80194 80011 184 1 1 15 -13 327 0.019 13.74 3.02 Intr - 86708 86583 126 2 0 101 84 47 0.125 5.56 3.01 Init - 87227 87225 3 2 0 113 22 0 0.108 -4.05 3.00 Prom - 87949 87910 40 -7.55 4.00 Prom + 87955 87994 40 -8.85 4.01 Init + 88677 88842 166 2 1 51 -17 211 0.012 6.74 4.02 Intr + 97964 98084 121 0 1 57 67 66 0.404 0.03 4.03 Intr + 99581 100079 499 2 1 -11 100 259 0.416 9.26 4.04 Intr + 106837 106975 139 0 1 -2 115 105 0.282 3.32 4.05 Intr + 112328 112481 154 0 1 74 36 156 0.738 7.21 4.06 Intr + 114022 114193 172 1 1 28 110 58 0.611 1.02 4.07 Intr + 115161 115251 91 2 1 45 74 41 0.441 -2.95 4.08 Term + 115347 115441 95 1 2 84 49 117 0.916 4.41 4.09 PlyA + 117182 117187 6 1.05 5.00 Prom + 154105 154144 40 -3.25 5.01 Init + 179091 179331 241 2 1 91 89 466 0.993 43.08 5.02 Intr + 180123 180146 24 1 0 112 93 13 0.208 1.28 5.03 Intr + 199433 199614 182 0 2 90 107 190 0.999 19.77 5.04 Intr + 201998 202135 138 1 0 82 116 61 0.994 8.04 5.05 Intr + 210595 210788 194 0 2 75 80 132 0.524 8.47 5.06 Intr + 223008 223230 223 0 1 28 95 101 0.248 2.01 5.07 Intr + 223308 223472 165 2 0 51 73 108 0.192 4.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:32178156_32405623|GENSCAN_predicted_peptide_1|137_aa XFERLEVLAVFASTVLAQLGALFILKERGRLLVGTFVALCFNLFTMLSIRNKPFAYVSED NLSIFFLKLLVRAGFKSMLQILVEGGCGHCARREAAVASNAWETPLGVHYGTGSTLGGVR EPEEGEAEELYGYHSFG >gi568815596f:32178156_32405623|GENSCAN_predicted_CDS_1|414_bp nngtttgaaagattagaagtcctggctgtatttgcctccacagtcttggcacagttggga gctctctttatattaaaagaaaggggaagattattagttggtacttttgtggctctttgt ttcaacctgttcacgatgctttctattcggaataaaccttttgcttatgtctcagaagat aatttaagtattttcttcttaaagctgctagtacgagctggcttcaagagcatgttgcag atcttagtcgaaggtggttgtggccactgtgcccggagggaggcagcagtggccagtaat gcctgggaaactcctctgggggtacattatggaactggaagcacccttggaggagtccga gagccagaagaaggagaggcagaagagctttatggatatcacagtttcggctaa >gi568815596f:32178156_32405623|GENSCAN_predicted_peptide_2|477_aa MTQKAAASVEHLAIRCHWSQRPAVTGDVLQVYSGSEGTAIIFCETQRSVTEIAMNPHIKQ NAQCLHGDIAQSQREFTLKDFREGSFKVLVATNVAACGLDIPEVDLVIHGSPPQDVESIS IVLDAQGFVTMTLESLEEIQDVSCAWKELNRKLSSNAVSQITRMCLLKGNVRVCFDVPTT KSERLQAEWHDSDWIFSVPAKLPEIEEYYDRNTSSNSRQRSGWSSGQSGRSGGRSGSRNY FAVDTASAIAIALMTFGTMYPMSVYSGKVLLQTTPPHVIGQLDKLIREVSTLDGVLEVRN EHFWTLGFGSLAGSVHVRIRRDANEQMVLAHVTNRLYTLVSTLTVQIFKDDWIRPALLSG PVAANVLNFSDHHVIPMPLLKGTDDLNPVTSTPAKPSSPPPEFSFNTPGKNVNPVILLNT QTRPYGFGLNHGHTPYSSMLNQGLGVPGIGATQGLRTGFTNIPSRYGTNNRIGQPRP >gi568815596f:32178156_32405623|GENSCAN_predicted_CDS_2|1434_bp atgactcaaaaggctgcagcttctgtggaacatttggccatccggtgtcattggtctcag aggccagcagttactggagatgtccttcaagtctacagtgggtctgaagggacggctatt attttctgtgagacccagaggagtgtaactgaaatagccatgaatccacacataaaacag aatgcccagtgtttacatggggacattgcacagtcacaaagagaatttacactaaaagac ttcagagaaggtagttttaaagttttggtggcaaccaacgtggctgcctgtggtttggac attcctgaagttgacctggtgattcatggttctcctcctcaggatgttgagtctatatcc atcgttctggacgcacagggatttgtgaccatgactctggaaagcctagaggaaatacag gatgtcagctgtgcttggaaagaacttaacagaaagctgagtagtaatgcagtgtctcag attaccagaatgtgcctcctgaaaggaaatgtgcgtgtttgctttgatgttcctacaact aagtcagaaaggttacaggcagagtggcatgattccgactggatattctcagtgccagcc aaattacctgaaattgaagaatattatgatagaaacacatcttctaattccagacagagg agtggttggtcaagtggccagtcgggccgatcaggtggtcgatctggcagccgtaattat tttgccgtagacactgcctctgctatagctattgccttgatgacatttggcactatgtat cccatgagtgtgtacagtgggaaagtcttactccagacaacaccaccccatgttattggt cagttggacaaactcatcagagaggtatctaccttagatggagttttagaagtccgaaat gaacatttttggaccctaggttttggctcattggctggatcagtgcatgtaagaattcga cgagatgccaatgaacaaatggttcttgctcatgtgaccaacaggctgtacactctagtg tctactctaactgttcaaattttcaaggatgactggattaggcctgccttattgtctggg cctgttgcagccaatgtcctaaacttttcagatcatcacgtaatcccaatgcctctttta aagggtactgatgatttgaacccagttacatcaactccagctaaacctagtagtccacct ccagaattttcatttaacactcctgggaaaaatgtgaacccagttattcttctaaacaca caaacaaggccttatggttttggtctcaatcatggacacacaccttacagcagcatgctt aatcaaggacttggagttccaggaattggagcaactcaaggattgaggactggttttaca aatataccaagtagatatggaactaataatagaattggacaaccaagaccatga >gi568815596f:32178156_32405623|GENSCAN_predicted_peptide_3|1065_aa MNKKVSGLQELEASLKRKANTKKLYFKNMSWSPKKRAIGLLSQRPLQADTQAAGRREEHI GGRTYKQLDVQRTLKGECWRKSTQQTSARQQAIHQRNDSEFGLEVNFIKDNSRALIQRMG MTVIKQITDDLFVWNVLNREEVNIICCEKVEQDAARGIIHMILKKGSESCNLFLKSLKEW NYPLFQDLNGQSLFHQTSEGDLDDLAQDLKDLYHTPSFLNFYPLGEDIDIIFNLKSTFTE PVLWRKDQHHHRVEQLTLNGLLQALQSPCIIEGESGKGKSTLLQRIAMLWGSGKCKALTK FKFVFFLRLSRAQGGLFETLCDQLLDIPGTIRKQTFMAMLLKLRQRVLFLLDGYNEFKPQ NCPEIEALIKENHRFKNMVIVTTTTECLRHIRQFGALTAEVGDMTEDSAQALIREVLIKE LAEGLLLQIQKSRCLRNLMKTPLFVVITCAIQMGESEFHSHTQTTLFHTFYDLLIQKNKH KHKGVAASDFIRSLDHCGDLALEGVFSHKFDFELQDVSSVNEDVLLTTGLLCKYTAQRFK PKYKFFHKSFQEYTAGRRLSSLLTSHEPEEVTKGNGYLQKMVSISDITSTYSSLLRYTCG SSVEATRAVMKHLAAVYQHGCLLGLSIAKRPLWRQESLQSVKNTTEQEILKAININSFVE CGIHLYQESTSKSALSQEFEAFFQGKSLYINSGNIPDYLFDFFEHLPNCASALDFIKLDF YGGAMASWEKAAEDTGGIHMEEAPETYIPSRAVSLFFNWKQEFRTLEVTLRDFSKLNKQD IRYLGKIFSSATSLRLQIKRCAGVAGSLSLVLSTCKNIYSLMVEASPLTIEDERHITSVT NLKTLSIHDLQNQRLPGGLTDSLGNLKNLTKLIMDNIKMNEEDAIKLGQISVLIFVSIFE TIQSLNCSLFLPVDRMNVLEQLTALMLPWGCDVQGSLSSLLKHLEEVPQLVKLGLKNWRL TDTEIRILGAFFGKNPLKNFQQLNLAGNRVSSDGWLAFMGVFENLKQLVFFDFSTKEFLP DPALVRKLSQVLSKLTFLQEARLVGWQFDDDDLSVITGAFKLVTA >gi568815596f:32178156_32405623|GENSCAN_predicted_CDS_3|3198_bp atgaacaagaaggtatctggtctacaagaactcgaggcctcactgaaacggaaagcaaat acaaagaaactttattttaaaaacatgtcttggtctcccaagaagagggcaattggattg ctcagccagagacccttgcaggcagacacacaagcggctggacgtcgagaggaacacatc ggcggaagaacatacaagcagctggacgtccagaggacgttgaagggagaatgctggcgg aagagcacacaacagacatcggcacgccagcaggccatccaccagaggaacgactcggag tttggcctggaggtgaatttcataaaggacaatagccgagcccttattcaaagaatggga atgactgttataaagcaaatcacagatgacctatttgtatggaatgttctgaatcgcgaa gaagtaaacatcatttgctgcgagaaggtggagcaggatgctgctagagggatcattcac atgattttgaaaaagggttcagagtcctgtaacctctttcttaaatcccttaaggagtgg aactatcctctatttcaggacttgaatggacaaagtctttttcatcagacatcagaagga gacttggacgatttggctcaggatttaaaggacttgtaccataccccatcttttctgaac ttttatccccttggtgaagatattgacattatttttaacttgaaaagcaccttcacagaa cctgtcctgtggaggaaggaccaacaccatcaccgcgtggagcagctgaccctgaatggc ctcctgcaggctcttcagagcccctgcatcattgaaggggaatctggcaaaggcaagtcc actctgctgcagcgaattgccatgctctggggctccggaaagtgcaaggctctgaccaag ttcaaattcgtcttcttcctccgtctcagcagggcccagggtggactttttgaaaccctc tgtgatcaactcctggatatacctggcacaatcaggaagcagacattcatggccatgctg ctgaagctgcggcagagggttcttttccttcttgatggctacaatgaattcaagccccag aactgcccagaaatcgaagccctgataaaggaaaaccaccgcttcaagaacatggtcatc gtcaccactaccactgagtgcctgaggcacatacggcagtttggtgccctgactgctgag gtgggggatatgacagaagacagcgcccaggctctcatccgagaagtgctgatcaaggag cttgctgaaggcttgttgctccaaattcagaaatccaggtgcttgaggaatctcatgaag acccctctctttgtggtcatcacttgtgcaatccagatgggtgaaagtgagttccactct cacacacaaacaacgctgttccataccttctatgatctgttgatacagaaaaacaaacac aaacataaaggtgtggctgcaagtgacttcattcggagcctggaccactgtggagaccta gctctggagggtgtgttctcccacaagtttgatttcgaactgcaggatgtgtccagcgtg aatgaggatgtcctgctgacaactgggctcctctgtaaatatacagctcaaaggttcaag ccaaagtataaattctttcacaagtcattccaggagtacacagcaggacgaagactcagc agtttattgacgtctcatgagccagaggaggtgaccaaggggaatggttacttgcagaaa atggtttccatttcggacattacatccacttatagcagcctgctccggtacacctgtggg tcatctgtggaagccaccagggctgttatgaagcacctcgcagcagtgtatcaacacggc tgccttctcggactttccatcgccaagaggcctctctggagacaggaatctttgcaaagt gtgaaaaacaccactgagcaagaaattctgaaagccataaacatcaattcctttgtagag tgtggcatccatttatatcaagagagtacatccaaatcagccctgagccaagaatttgaa gctttctttcaaggtaaaagcttatatatcaactcagggaacatccccgattacttattt gacttctttgaacatttgcccaattgtgcaagtgccctggacttcattaaactggacttt tatgggggagctatggcttcatgggaaaaggctgcagaagacacaggtggaatccacatg gaagaggccccagaaacctacattcccagcagggctgtatctttgttcttcaactggaag caggaattcaggactctggaggtcacactccgggatttcagcaagttgaataagcaagat atcagatatctggggaaaatattcagctctgccacaagcctcaggctgcaaataaagaga tgtgctggtgtggctggaagcctcagtttggtcctcagcacctgtaagaacatttattct ctcatggtggaagccagtcccctcaccatagaagatgagaggcacatcacatctgtaaca aacctgaaaaccttgagtattcatgacctacagaatcaacggctgccgggtggtctgact gacagcttgggtaacttgaagaaccttacaaagctcataatggataacataaagatgaat gaagaagatgctataaaactaggtcagatttctgttctcatatttgtatcaatttttgag accatccagtccctgaactgctctttgtttcttccagtcgacaggatgaacgtgctagaa cagctcaccgcactgatgctgccctggggctgtgacgtgcaaggcagcctgagcagcctg ttgaaacatttggaggaggtcccacaactcgtcaagcttgggttgaaaaactggagactc acagatacagagattagaattttaggtgcattttttggaaagaaccctctgaaaaacttc cagcagttgaatttggcgggaaatcgtgtgagcagtgatggatggcttgccttcatgggt gtatttgagaatcttaagcaattagtgttttttgactttagtactaaagaatttctacct gatccagcattagtcagaaaacttagccaagtgttatccaagttaacttttctgcaagaa gctaggcttgttgggtggcaatttgatgatgatgatctcagtgttattacaggtgctttt aaactagtaactgcttaa >gi568815596f:32178156_32405623|GENSCAN_predicted_peptide_4|478_aa MVPYYREPEGDLKLTGEVSKTMNAEIRVIRLRAKRCQQPPAAGRGINGFSPKAPRTQTLY GCRVGGRSGLSLPTRPYFRLSHGEKPWTTPGFPRQRNKSTQGGKRGLRFPRITSELPVLG KLPESTTGRPNQNLPPGANGRAHYKEPIPAPLSLLWSCQGAGVVVLGWSPPRRLWWGSLG AAQRPAVPVSGLARSLHVETRRPHRRASVRVAAAAASGSGPSRSLFYRGRLGVAARCSLR ARPRPMPPLTGTSPLSPQQTRKKESPGYSEEYGLEGDKNDWGEGSLYTIIIMQTGVDGLH ELATGGRNDLSGSIASPDVKLNLGGDFIKESTATTFLRQRGYGWLLEVEDDDPEDNKPLL EELDIDLKDIYYKIRCVLMPMPSLGFNRQVVRDNPDFWGPLAVVLFFSMISLYGQFRHLF NKAHLAPPLTHLTLSGHSTCFREHRFGDTATIRFLNLFPTFPPFLFHKTAIVIVARSQ >gi568815596f:32178156_32405623|GENSCAN_predicted_CDS_4|1437_bp atggtcccttattacagagagccagagggagatttgaaactgacaggagaagtcagtaag accatgaatgcagagattcgagtaatacggctacgagccaaaagatgccagcagccacct gcagctggaagaggcataaatggattctcccctaaagctcccaggacacagaccctttac gggtgtcgggttggaggacggtcaggtctttcccttcctacgaggccatatttcagacta tcacatggggagaaaccttggacaacacctggctttcctaggcagaggaacaaaagcacc caaggaggaaaaagaggcctgagatttccacgaattacaagtgaacttcctgtcctgggt aaacttccggaatcaacaacaggaagacccaatcagaatcttcctccgggagccaatggg agagcacattacaaagagccaatcccagcgccgctgtcactgttatggtcctgtcagggt gccggcgtcgtggtgcttgggtggtcgccaccaagaagactttggtggggtagtctcggg gcagctcagcggcccgctgtgcccgtttctggcctcgctcgcagcttgcacgtcgagact cgtaggccgcaccgtagggcgagcgtgcgggtcgccgccgcggccgcctcggggtctggg cccagccgcagcctcttctaccgcggccggttgggagtcgccgcgagatgcagcctccgg gcccgcccccggcctatgcccccactaacggggacttcacctttgtctcctcagcagacg cggaagaaagagtctccggggtacagtgaagagtatggattggaaggagacaagaatgac tggggggagggcagtttgtacaccattataatcatgcagacaggagttgatggtttgcat gagcttgctactggtggtagaaatgatctcagtggttcaatagcatccccagatgtcaaa ttaaatcttggtggagattttatcaaagaatctacagctactacatttctgagacaaaga ggttatggctggcttctggaagttgaagatgatgatcctgaagataacaagccactcttg gaagaattggacattgatctaaaggatatttactacaaaatccgatgtgttttgatgcca atgccatcacttggttttaatagacaagtggtgagagacaatcctgacttttggggtcct ctggctgttgttcttttcttttccatgatatcattatatggacagtttaggcatctgttt aacaaagcacatcttgcaccgcccttaacccatttaaccctgagtggacacagcacatgt ttcagagagcacaggtttggggacacggcaaccatccgatttctcaatcttttccccacc tttcccccctttctgttccacaaaaccgccattgtcatcgtggcccgttctcaatga >gi568815596f:32178156_32405623|GENSCAN_predicted_peptide_5|389_aa MAAAAAAASGPGCSSAAGAGAAGVSEWLVLRDGCMHCDADGLHSLSYHPALNAILAVTSR GTIKVIDGTSGATLQASALSASRKLGFEAKPGGQVKCQYISAVDKVIFVDDYAVGCRKDL NGILLLDTALQTPVSKQDDVVQLELPVTEAQQLLSACLEKVDISSTEGYDLFITQLKDGL KNTSHETAANHKVAKWATVTFHLPHHVLKSIASAIVNELKKINQNVAALPVASSVMDRLS YLLPSARPELGVGPGRSVDRSEHERHSPNCPFVKGEHTQNVPLSVTLATSPAQFPCTDGT DRISCFGSGSCPHFLAAATKRGKICIWDVSKLMKVHLKFEINAYDPAIVQQLILSGDPSS GVDSRRPTLAWLEDSSSCSDIPKLEGDRY >gi568815596f:32178156_32405623|GENSCAN_predicted_CDS_5|1167_bp atggcggctgcggctgcggcggcctcgggccccggctgctcctcggcggcgggggcgggg gcggccggggtctcagagtggctggtgctgcgggacggctgcatgcactgcgacgccgac gggctgcacagcctgtcctaccaccctgcgctcaacgccatcctggccgtcactagccgc gggaccatcaaagtcatcgacggcacctcgggggccacactgcaggcctccgcgctcagt gcgagccgaaaattaggatttgaagctaaaccaggtggacaggtgaaatgtcagtatatc tctgctgtggataaagttatatttgtggatgattatgcagtagggtgtaggaaggacctt aatggaatcttgttgttagacactgctctgcaaactccagtttcaaagcaggatgatgtg gttcagcttgaattacccgttacagaggcacagcagctcttatcagcatgtttagaaaag gtagatatttctagtacagagggttatgatttgttcatcacacagctcaaagatggttta aaaaatacatctcatgagactgcagcaaaccacaaagttgctaagtgggccacagttaca tttcatcttcctcatcatgtgttgaagtccattgccagtgccattgtaaatgaactcaag aaaataaatcaaaatgttgctgccttacctgtggcgtcctcagtgatggacagattgtct tacctcttacctagtgcacgtccagaactcggagtggggccaggccgttctgtagacagg tctgaacacgaaagacattccccaaactgcccatttgtgaaaggtgagcacacacagaat gtgccattgtcagtcactcttgcaacaagtcctgcacagtttccttgtacggatggaact gacagaatatcttgctttgggtcggggagctgccctcattttctagctgctgcaactaaa cgaggaaagatctgcatatgggatgtttccaaacttatgaaggtgcacttaaagtttgaa attaatgcctatgatccagcaattgtacaacagcttattctatcaggagacccaagctca ggagttgattcaaggagaccaactttggcgtggctggaggactcctctagttgctcagat ataccaaaattggaaggagataggtat