GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:08:51 Sequence gi568815596r:32124476_32352679 : 228204 bp : 41.52% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2483 2547 65 0 2 75 95 40 0.283 0.92 1.02 Intr + 12088 12163 76 0 1 69 82 64 0.856 2.07 1.03 Intr + 12402 12493 92 1 2 92 78 46 0.935 2.79 1.04 Intr + 12634 12713 80 0 2 53 91 88 0.929 3.03 1.05 Intr + 20462 20532 71 2 2 110 92 83 0.915 8.81 1.06 Intr + 22743 22783 41 1 2 92 65 51 0.754 0.22 1.07 Term + 41203 41688 486 0 0 58 38 276 0.686 13.31 1.08 PlyA + 41717 41722 6 1.05 2.00 Prom + 43977 44016 40 -3.65 2.01 Init + 44992 45027 36 0 0 125 41 10 0.830 0.06 2.02 Intr + 46812 46898 87 2 0 58 110 94 0.938 7.85 2.03 Intr + 69337 69508 172 0 1 41 95 64 0.077 0.99 2.04 Intr + 72843 72917 75 1 0 61 116 55 0.823 4.17 2.05 Term + 77080 77240 161 2 2 90 39 180 0.930 10.42 2.06 PlyA + 77420 77425 6 -0.45 3.00 Prom + 77739 77778 40 -8.35 3.01 Init + 78385 78762 378 0 0 79 80 320 0.885 27.25 3.02 Intr + 79079 79413 335 1 2 58 77 230 0.976 12.34 3.03 Intr + 80115 80217 103 0 1 74 110 57 0.979 5.76 3.04 Intr + 82411 82458 48 0 0 99 94 35 0.898 3.16 3.05 Intr + 85018 85086 69 0 0 118 76 35 0.930 3.76 3.06 Term + 95738 96238 501 1 0 79 44 275 0.991 15.69 3.07 PlyA + 99325 99330 6 -0.45 4.09 PlyA - 99654 99649 6 1.05 4.08 Term - 100290 99998 293 1 2 85 36 164 0.438 5.42 4.07 Intr - 111137 110926 212 1 2 109 82 79 0.579 7.03 4.06 Intr - 116650 116527 124 2 1 77 101 63 0.655 5.22 4.05 Intr - 127126 125132 1995 2 0 102 39 1630 0.760 145.87 4.04 Intr - 128204 127944 261 0 0 81 106 164 0.311 14.04 4.03 Intr - 133874 133691 184 2 1 15 -13 327 0.019 13.74 4.02 Intr - 140388 140263 126 0 0 101 84 47 0.125 5.56 4.01 Init - 140907 140905 3 0 0 113 22 0 0.108 -4.05 4.00 Prom - 141629 141590 40 -7.55 5.00 Prom + 141635 141674 40 -8.85 5.01 Init + 142357 142522 166 0 1 51 -17 211 0.012 6.74 5.02 Intr + 151644 151764 121 1 1 57 67 66 0.404 0.03 5.03 Intr + 153261 153759 499 0 1 -11 100 259 0.416 9.26 5.04 Intr + 160517 160655 139 1 1 -2 115 105 0.282 3.32 5.05 Intr + 166008 166161 154 1 1 74 36 156 0.738 7.21 5.06 Intr + 167702 167873 172 2 1 28 110 58 0.611 1.02 5.07 Intr + 168841 168931 91 0 1 45 74 41 0.441 -2.95 5.08 Term + 169027 169121 95 2 2 84 49 117 0.916 4.41 5.09 PlyA + 170862 170867 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:32124476_32352679|GENSCAN_predicted_peptide_1|303_aa XLRAPARGLLLFGPPGNGKTMLVGEGEKLVRALFAVARELQPSIIFIDEVDSLLCERREG EHDASRRLKTEFLIEFDGVQSAGDDRVLVMGATNRPQELDEAVLRMTDGYSGSDLTALAK DAALGPIRELKPEQVKNMSASEVPTNLSESSEGAALAAFRHGVQGQPGRGRKPEARARTS SHAFLTANRPLVPHPFRSGKTRAPRTASGGSCAAPYHGELAVGVRVSAVADSVYLIWSRS DLEIPQPPRGGVIGVEVGPLGRSGCLPKCSPSEFWSLSVPHLFRRRSRSRPRCRQSFQLL ASF >gi568815596r:32124476_32352679|GENSCAN_predicted_CDS_1|912_bp nggcttagagctcctgccagagggctgttactctttggtccacctgggaatgggaagaca atgctggtgggagaaggagagaaattggtgagggctctttttgctgtggctcgagaactt caaccttctataatttttatagatgaagttgatagccttttgtgtgaaagaagagaaggg gagcacgatgctagtagacgcctaaaaactgaatttctaatagaatttgatggtgtacag tctgctggagatgacagagtacttgtaatgggtgcaactaataggccacaagagcttgat gaggctgttctcagaatgactgatggatactcaggaagtgacctaacagctttggcaaaa gatgcagcactgggtcctatccgagaactaaaaccagaacaggtgaagaatatgtctgcc agtgaggtcccgaccaatctctcggagagctcagagggagccgctctagccgccttccgc cacggagtgcagggacagccgggaaggggacggaagccggaggccagagcacgcacttcc tcccacgctttcctcacagccaatcgcccactggtgccccaccccttccgttctgggaaa actcgagcacccagaacggcttccggcgggagctgtgcagctccttatcatggtgagttg gctgttggggtgagggtttcggctgtagctgattcggtttatcttatttggtcccgaagc gaccttgaaatccctcagccacccagaggaggggtcattggagttgaagtcgggcccctt ggcaggagcggctgtctcccaaaatgttccccttcagagttctggtccctttcggtccct cacctgttccgcaggcgctcacgatctaggccccgctgcaggcagagctttcagctcctg gcgagtttctga >gi568815596r:32124476_32352679|GENSCAN_predicted_peptide_2|176_aa MASQEKDIFIGWGTIHLFRKPQRSFFGKLLREFRLVAADRRSLYVLKLHTDNTKNKLKVN VIVVLFRGRLLVGTFVALCFNLFTMLSIRNKPFAYVSEDNLSIFFLKLLVRAGFKSMLQI LVEGGCGHCARREAAVASNAWETPLGVHYGTGSTLGGVREPEEGEAEELYGYHSFG >gi568815596r:32124476_32352679|GENSCAN_predicted_CDS_2|531_bp atggccagccaagaaaaagatatatttattggctgggggacaattcatctctttcgaaaa ccacaaagatccttttttggcaagttgttacgggaatttagacttgtagcagctgaccga aggtccctctacgtattgaaattacatactgataataccaagaacaaattaaaggtgaat gttatcgttgttctttttaggggaagattattagttggtacttttgtggctctttgtttc aacctgttcacgatgctttctattcggaataaaccttttgcttatgtctcagaagataat ttaagtattttcttcttaaagctgctagtacgagctggcttcaagagcatgttgcagatc ttagtcgaaggtggttgtggccactgtgcccggagggaggcagcagtggccagtaatgcc tgggaaactcctctgggggtacattatggaactggaagcacccttggaggagtccgagag ccagaagaaggagaggcagaagagctttatggatatcacagtttcggctaa >gi568815596r:32124476_32352679|GENSCAN_predicted_peptide_3|477_aa MTQKAAASVEHLAIRCHWSQRPAVTGDVLQVYSGSEGTAIIFCETQRSVTEIAMNPHIKQ NAQCLHGDIAQSQREFTLKDFREGSFKVLVATNVAACGLDIPEVDLVIHGSPPQDVESIS IVLDAQGFVTMTLESLEEIQDVSCAWKELNRKLSSNAVSQITRMCLLKGNVRVCFDVPTT KSERLQAEWHDSDWIFSVPAKLPEIEEYYDRNTSSNSRQRSGWSSGQSGRSGGRSGSRNY FAVDTASAIAIALMTFGTMYPMSVYSGKVLLQTTPPHVIGQLDKLIREVSTLDGVLEVRN EHFWTLGFGSLAGSVHVRIRRDANEQMVLAHVTNRLYTLVSTLTVQIFKDDWIRPALLSG PVAANVLNFSDHHVIPMPLLKGTDDLNPVTSTPAKPSSPPPEFSFNTPGKNVNPVILLNT QTRPYGFGLNHGHTPYSSMLNQGLGVPGIGATQGLRTGFTNIPSRYGTNNRIGQPRP >gi568815596r:32124476_32352679|GENSCAN_predicted_CDS_3|1434_bp atgactcaaaaggctgcagcttctgtggaacatttggccatccggtgtcattggtctcag aggccagcagttactggagatgtccttcaagtctacagtgggtctgaagggacggctatt attttctgtgagacccagaggagtgtaactgaaatagccatgaatccacacataaaacag aatgcccagtgtttacatggggacattgcacagtcacaaagagaatttacactaaaagac ttcagagaaggtagttttaaagttttggtggcaaccaacgtggctgcctgtggtttggac attcctgaagttgacctggtgattcatggttctcctcctcaggatgttgagtctatatcc atcgttctggacgcacagggatttgtgaccatgactctggaaagcctagaggaaatacag gatgtcagctgtgcttggaaagaacttaacagaaagctgagtagtaatgcagtgtctcag attaccagaatgtgcctcctgaaaggaaatgtgcgtgtttgctttgatgttcctacaact aagtcagaaaggttacaggcagagtggcatgattccgactggatattctcagtgccagcc aaattacctgaaattgaagaatattatgatagaaacacatcttctaattccagacagagg agtggttggtcaagtggccagtcgggccgatcaggtggtcgatctggcagccgtaattat tttgccgtagacactgcctctgctatagctattgccttgatgacatttggcactatgtat cccatgagtgtgtacagtgggaaagtcttactccagacaacaccaccccatgttattggt cagttggacaaactcatcagagaggtatctaccttagatggagttttagaagtccgaaat gaacatttttggaccctaggttttggctcattggctggatcagtgcatgtaagaattcga cgagatgccaatgaacaaatggttcttgctcatgtgaccaacaggctgtacactctagtg tctactctaactgttcaaattttcaaggatgactggattaggcctgccttattgtctggg cctgttgcagccaatgtcctaaacttttcagatcatcacgtaatcccaatgcctctttta aagggtactgatgatttgaacccagttacatcaactccagctaaacctagtagtccacct ccagaattttcatttaacactcctgggaaaaatgtgaacccagttattcttctaaacaca caaacaaggccttatggttttggtctcaatcatggacacacaccttacagcagcatgctt aatcaaggacttggagttccaggaattggagcaactcaaggattgaggactggttttaca aatataccaagtagatatggaactaataatagaattggacaaccaagaccatga >gi568815596r:32124476_32352679|GENSCAN_predicted_peptide_4|1065_aa MNKKVSGLQELEASLKRKANTKKLYFKNMSWSPKKRAIGLLSQRPLQADTQAAGRREEHI GGRTYKQLDVQRTLKGECWRKSTQQTSARQQAIHQRNDSEFGLEVNFIKDNSRALIQRMG MTVIKQITDDLFVWNVLNREEVNIICCEKVEQDAARGIIHMILKKGSESCNLFLKSLKEW NYPLFQDLNGQSLFHQTSEGDLDDLAQDLKDLYHTPSFLNFYPLGEDIDIIFNLKSTFTE PVLWRKDQHHHRVEQLTLNGLLQALQSPCIIEGESGKGKSTLLQRIAMLWGSGKCKALTK FKFVFFLRLSRAQGGLFETLCDQLLDIPGTIRKQTFMAMLLKLRQRVLFLLDGYNEFKPQ NCPEIEALIKENHRFKNMVIVTTTTECLRHIRQFGALTAEVGDMTEDSAQALIREVLIKE LAEGLLLQIQKSRCLRNLMKTPLFVVITCAIQMGESEFHSHTQTTLFHTFYDLLIQKNKH KHKGVAASDFIRSLDHCGDLALEGVFSHKFDFELQDVSSVNEDVLLTTGLLCKYTAQRFK PKYKFFHKSFQEYTAGRRLSSLLTSHEPEEVTKGNGYLQKMVSISDITSTYSSLLRYTCG SSVEATRAVMKHLAAVYQHGCLLGLSIAKRPLWRQESLQSVKNTTEQEILKAININSFVE CGIHLYQESTSKSALSQEFEAFFQGKSLYINSGNIPDYLFDFFEHLPNCASALDFIKLDF YGGAMASWEKAAEDTGGIHMEEAPETYIPSRAVSLFFNWKQEFRTLEVTLRDFSKLNKQD IRYLGKIFSSATSLRLQIKRCAGVAGSLSLVLSTCKNIYSLMVEASPLTIEDERHITSVT NLKTLSIHDLQNQRLPGGLTDSLGNLKNLTKLIMDNIKMNEEDAIKLGQISVLIFVSIFE TIQSLNCSLFLPVDRMNVLEQLTALMLPWGCDVQGSLSSLLKHLEEVPQLVKLGLKNWRL TDTEIRILGAFFGKNPLKNFQQLNLAGNRVSSDGWLAFMGVFENLKQLVFFDFSTKEFLP DPALVRKLSQVLSKLTFLQEARLVGWQFDDDDLSVITGAFKLVTA >gi568815596r:32124476_32352679|GENSCAN_predicted_CDS_4|3198_bp atgaacaagaaggtatctggtctacaagaactcgaggcctcactgaaacggaaagcaaat acaaagaaactttattttaaaaacatgtcttggtctcccaagaagagggcaattggattg ctcagccagagacccttgcaggcagacacacaagcggctggacgtcgagaggaacacatc ggcggaagaacatacaagcagctggacgtccagaggacgttgaagggagaatgctggcgg aagagcacacaacagacatcggcacgccagcaggccatccaccagaggaacgactcggag tttggcctggaggtgaatttcataaaggacaatagccgagcccttattcaaagaatggga atgactgttataaagcaaatcacagatgacctatttgtatggaatgttctgaatcgcgaa gaagtaaacatcatttgctgcgagaaggtggagcaggatgctgctagagggatcattcac atgattttgaaaaagggttcagagtcctgtaacctctttcttaaatcccttaaggagtgg aactatcctctatttcaggacttgaatggacaaagtctttttcatcagacatcagaagga gacttggacgatttggctcaggatttaaaggacttgtaccataccccatcttttctgaac ttttatccccttggtgaagatattgacattatttttaacttgaaaagcaccttcacagaa cctgtcctgtggaggaaggaccaacaccatcaccgcgtggagcagctgaccctgaatggc ctcctgcaggctcttcagagcccctgcatcattgaaggggaatctggcaaaggcaagtcc actctgctgcagcgaattgccatgctctggggctccggaaagtgcaaggctctgaccaag ttcaaattcgtcttcttcctccgtctcagcagggcccagggtggactttttgaaaccctc tgtgatcaactcctggatatacctggcacaatcaggaagcagacattcatggccatgctg ctgaagctgcggcagagggttcttttccttcttgatggctacaatgaattcaagccccag aactgcccagaaatcgaagccctgataaaggaaaaccaccgcttcaagaacatggtcatc gtcaccactaccactgagtgcctgaggcacatacggcagtttggtgccctgactgctgag gtgggggatatgacagaagacagcgcccaggctctcatccgagaagtgctgatcaaggag cttgctgaaggcttgttgctccaaattcagaaatccaggtgcttgaggaatctcatgaag acccctctctttgtggtcatcacttgtgcaatccagatgggtgaaagtgagttccactct cacacacaaacaacgctgttccataccttctatgatctgttgatacagaaaaacaaacac aaacataaaggtgtggctgcaagtgacttcattcggagcctggaccactgtggagaccta gctctggagggtgtgttctcccacaagtttgatttcgaactgcaggatgtgtccagcgtg aatgaggatgtcctgctgacaactgggctcctctgtaaatatacagctcaaaggttcaag ccaaagtataaattctttcacaagtcattccaggagtacacagcaggacgaagactcagc agtttattgacgtctcatgagccagaggaggtgaccaaggggaatggttacttgcagaaa atggtttccatttcggacattacatccacttatagcagcctgctccggtacacctgtggg tcatctgtggaagccaccagggctgttatgaagcacctcgcagcagtgtatcaacacggc tgccttctcggactttccatcgccaagaggcctctctggagacaggaatctttgcaaagt gtgaaaaacaccactgagcaagaaattctgaaagccataaacatcaattcctttgtagag tgtggcatccatttatatcaagagagtacatccaaatcagccctgagccaagaatttgaa gctttctttcaaggtaaaagcttatatatcaactcagggaacatccccgattacttattt gacttctttgaacatttgcccaattgtgcaagtgccctggacttcattaaactggacttt tatgggggagctatggcttcatgggaaaaggctgcagaagacacaggtggaatccacatg gaagaggccccagaaacctacattcccagcagggctgtatctttgttcttcaactggaag caggaattcaggactctggaggtcacactccgggatttcagcaagttgaataagcaagat atcagatatctggggaaaatattcagctctgccacaagcctcaggctgcaaataaagaga tgtgctggtgtggctggaagcctcagtttggtcctcagcacctgtaagaacatttattct ctcatggtggaagccagtcccctcaccatagaagatgagaggcacatcacatctgtaaca aacctgaaaaccttgagtattcatgacctacagaatcaacggctgccgggtggtctgact gacagcttgggtaacttgaagaaccttacaaagctcataatggataacataaagatgaat gaagaagatgctataaaactaggtcagatttctgttctcatatttgtatcaatttttgag accatccagtccctgaactgctctttgtttcttccagtcgacaggatgaacgtgctagaa cagctcaccgcactgatgctgccctggggctgtgacgtgcaaggcagcctgagcagcctg ttgaaacatttggaggaggtcccacaactcgtcaagcttgggttgaaaaactggagactc acagatacagagattagaattttaggtgcattttttggaaagaaccctctgaaaaacttc cagcagttgaatttggcgggaaatcgtgtgagcagtgatggatggcttgccttcatgggt gtatttgagaatcttaagcaattagtgttttttgactttagtactaaagaatttctacct gatccagcattagtcagaaaacttagccaagtgttatccaagttaacttttctgcaagaa gctaggcttgttgggtggcaatttgatgatgatgatctcagtgttattacaggtgctttt aaactagtaactgcttaa >gi568815596r:32124476_32352679|GENSCAN_predicted_peptide_5|478_aa MVPYYREPEGDLKLTGEVSKTMNAEIRVIRLRAKRCQQPPAAGRGINGFSPKAPRTQTLY GCRVGGRSGLSLPTRPYFRLSHGEKPWTTPGFPRQRNKSTQGGKRGLRFPRITSELPVLG KLPESTTGRPNQNLPPGANGRAHYKEPIPAPLSLLWSCQGAGVVVLGWSPPRRLWWGSLG AAQRPAVPVSGLARSLHVETRRPHRRASVRVAAAAASGSGPSRSLFYRGRLGVAARCSLR ARPRPMPPLTGTSPLSPQQTRKKESPGYSEEYGLEGDKNDWGEGSLYTIIIMQTGVDGLH ELATGGRNDLSGSIASPDVKLNLGGDFIKESTATTFLRQRGYGWLLEVEDDDPEDNKPLL EELDIDLKDIYYKIRCVLMPMPSLGFNRQVVRDNPDFWGPLAVVLFFSMISLYGQFRHLF NKAHLAPPLTHLTLSGHSTCFREHRFGDTATIRFLNLFPTFPPFLFHKTAIVIVARSQ >gi568815596r:32124476_32352679|GENSCAN_predicted_CDS_5|1437_bp atggtcccttattacagagagccagagggagatttgaaactgacaggagaagtcagtaag accatgaatgcagagattcgagtaatacggctacgagccaaaagatgccagcagccacct gcagctggaagaggcataaatggattctcccctaaagctcccaggacacagaccctttac gggtgtcgggttggaggacggtcaggtctttcccttcctacgaggccatatttcagacta tcacatggggagaaaccttggacaacacctggctttcctaggcagaggaacaaaagcacc caaggaggaaaaagaggcctgagatttccacgaattacaagtgaacttcctgtcctgggt aaacttccggaatcaacaacaggaagacccaatcagaatcttcctccgggagccaatggg agagcacattacaaagagccaatcccagcgccgctgtcactgttatggtcctgtcagggt gccggcgtcgtggtgcttgggtggtcgccaccaagaagactttggtggggtagtctcggg gcagctcagcggcccgctgtgcccgtttctggcctcgctcgcagcttgcacgtcgagact cgtaggccgcaccgtagggcgagcgtgcgggtcgccgccgcggccgcctcggggtctggg cccagccgcagcctcttctaccgcggccggttgggagtcgccgcgagatgcagcctccgg gcccgcccccggcctatgcccccactaacggggacttcacctttgtctcctcagcagacg cggaagaaagagtctccggggtacagtgaagagtatggattggaaggagacaagaatgac tggggggagggcagtttgtacaccattataatcatgcagacaggagttgatggtttgcat gagcttgctactggtggtagaaatgatctcagtggttcaatagcatccccagatgtcaaa ttaaatcttggtggagattttatcaaagaatctacagctactacatttctgagacaaaga ggttatggctggcttctggaagttgaagatgatgatcctgaagataacaagccactcttg gaagaattggacattgatctaaaggatatttactacaaaatccgatgtgttttgatgcca atgccatcacttggttttaatagacaagtggtgagagacaatcctgacttttggggtcct ctggctgttgttcttttcttttccatgatatcattatatggacagtttaggcatctgttt aacaaagcacatcttgcaccgcccttaacccatttaaccctgagtggacacagcacatgt ttcagagagcacaggtttggggacacggcaaccatccgatttctcaatcttttccccacc tttcccccctttctgttccacaaaaccgccattgtcatcgtggcccgttctcaatga