GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:10:17 Sequence gi568815588f:75050529_75276251 : 225723 bp : 46.28% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2564 2622 59 1 2 109 72 32 0.517 4.49 1.02 Term + 3280 3301 22 1 1 133 43 3 0.504 -1.92 1.03 PlyA + 3927 3932 6 1.05 2.08 PlyA - 4153 4148 6 1.05 2.07 Term - 7472 7424 49 1 1 106 48 26 0.607 -2.82 2.06 Intr - 8020 7787 234 0 0 96 80 365 0.549 33.30 2.05 Intr - 8505 8320 186 2 0 16 81 104 0.402 1.30 2.04 Intr - 10309 10156 154 2 1 53 41 88 0.017 -0.27 2.03 Intr - 21421 21338 84 2 0 50 51 79 0.002 0.19 2.02 Intr - 31211 31098 114 0 0 104 92 6 0.393 3.02 2.01 Init - 36458 36311 148 2 1 38 32 151 0.482 4.75 2.00 Prom - 36755 36716 40 -3.66 3.10 PlyA - 37450 37445 6 1.05 3.09 Term - 39159 38668 492 0 0 -9 35 387 0.364 18.31 3.08 Intr - 44350 44190 161 2 2 85 23 179 0.567 10.81 3.07 Intr - 45267 45047 221 2 2 131 50 407 0.979 39.15 3.06 Intr - 47357 47184 174 1 0 99 66 110 0.978 8.95 3.05 Intr - 48630 48503 128 0 2 4 74 148 0.361 4.48 3.04 Intr - 53569 53366 204 1 0 98 75 113 0.002 10.40 3.03 Intr - 55367 55234 134 0 2 36 56 179 0.002 9.86 3.02 Intr - 57681 57464 218 2 2 107 41 387 0.995 34.05 3.01 Init - 58629 58481 149 0 2 62 66 106 0.744 5.52 3.00 Prom - 60473 60434 40 -9.75 4.00 Prom + 60593 60632 40 -11.14 4.01 Init + 61021 61161 141 0 0 86 90 181 0.491 16.24 4.02 Intr + 99986 100578 593 1 2 106 111 280 0.346 23.50 4.03 Intr + 114117 114212 96 0 0 93 82 14 0.049 0.42 4.04 Intr + 118013 118130 118 2 1 88 63 -1 0.119 -2.13 4.05 Intr + 125538 125688 151 2 1 67 101 114 0.709 10.34 4.06 Intr + 152590 152697 108 2 0 67 64 58 0.019 1.56 4.07 Intr + 160064 160188 125 0 2 15 47 127 0.523 1.60 4.08 Intr + 160321 160410 90 0 0 86 86 113 0.852 11.09 4.09 Intr + 161254 161298 45 0 0 84 65 43 0.323 0.21 4.10 Intr + 168535 168687 153 0 0 42 92 111 0.854 7.17 4.11 Intr + 168776 168828 53 1 2 31 98 34 0.917 -3.59 4.12 Intr + 170215 170442 228 1 0 105 75 193 0.903 16.68 4.13 Intr + 172778 172865 88 2 1 64 101 17 0.539 0.57 4.14 Intr + 179116 179173 58 0 1 69 119 41 0.560 3.76 4.15 Term + 180370 180461 92 2 2 103 36 47 0.480 -1.12 4.16 PlyA + 180901 180906 6 1.05 5.08 PlyA - 182006 182001 6 1.05 5.07 Term - 183697 183545 153 1 0 110 49 185 0.985 14.72 5.06 Intr - 184215 184082 134 1 2 65 61 403 0.999 35.66 5.05 Intr - 184464 184410 55 0 1 106 80 88 0.975 8.35 5.04 Intr - 184650 184532 119 1 2 77 99 228 0.999 22.98 5.03 Intr - 184844 184739 106 2 1 90 45 186 0.999 14.29 5.02 Intr - 185215 185088 128 2 2 92 53 125 0.963 9.80 5.01 Init - 185400 185307 94 0 1 98 80 222 0.999 20.94 5.00 Prom - 206423 206384 40 -5.36 6.03 PlyA - 208754 208749 6 1.05 6.02 Term - 212283 212170 114 0 0 58 46 112 0.473 2.57 6.01 Intr - 219373 219276 98 2 2 93 102 19 0.671 3.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:75050529_75276251|GENSCAN_predicted_peptide_1|26_aa MDCIVSEKESLLWPATKMLGSKQGVQ >gi568815588f:75050529_75276251|GENSCAN_predicted_CDS_1|81_bp atggactgcattgtgagtgagaaagaatcattgctgtggccggccaccaagatgttgggc tccaagcaaggtgtccagtaa >gi568815588f:75050529_75276251|GENSCAN_predicted_peptide_2|322_aa MLVKAQKGRLQDHRDGYGDHNSGILQWEEEIGFNSEYGMDKWDFIDKERAALVFERIRAQ ASCCNRDSRIQWLKQTAAYFSLMSLYRGLPVPVGVIFCRRAFDNQPLVSSAYVFLTSRLI DIDLFGEMIPAMVVCSSIRRHMMSDHLFLFIRGWQFMLSLWMQQLIRYGLGDVLFPLADG SLQTLRGPQAWNVRNLLGLPILGPKWPKPVGFSQIKEKVGTKGVKSKESCRKERKSLGSK MTSGEVKTSLKNAYSSAKRLSPKMEEEGEEEDYCTPGAFELERLFWKGSPQYTHVNEVWP KLYIGDECCRVPDCRQDPSHQL >gi568815588f:75050529_75276251|GENSCAN_predicted_CDS_2|969_bp atgcttgttaaagcacagaaaggaagacttcaggaccaccgagatgggtatggggaccac aacagtgggattttgcagtgggaagaagagattgggttcaactctgaatacggcatggac aagtgggactttatagacaaggagagggctgcattggtctttgaacgcataagggcacag gccagttgttgtaacagagactccagaatacagtggcttaaacagacagctgcttatttc tccctcatgtcactgtacagaggcctgccagtgcctgtcggcgtgatcttctgccggcgc gcttttgacaaccagccgcttgtgtcttctgcctatgtgttcctcacaagtcggctgata gatatagacctctttggtgagatgattcccgccatggtggtgtgctcttctattcgaagg catatgatgtctgaccatctcttcttattcatacgtggctggcagtttatgctcagtctc tggatgcagcaactcatcaggtatggccttggcgatgtactctttccactggctgacggc tccctgcagacactgagagggccacaggcctggaatgtcaggaatcttctgggccttcct attttgggtcccaaatggcccaaaccagtaggtttcagtcagataaaggaaaaagtagga accaaaggagtaaagtctaaagaaagctgcagaaaggagagaaaatcccttggctctaaa atgacatctggagaagtgaagacaagcctcaagaatgcctactcatctgccaagaggctg tcgccgaagatggaggaggaaggggaggaggaggactactgcacccctggagcctttgag ctggagcggctcttctggaagggcagtccccagtacacccacgtcaacgaggtctggccc aagctctacattggcgatgagtgctgcagggtcccagattgccgccaggaccccagccat caactctag >gi568815588f:75050529_75276251|GENSCAN_predicted_peptide_3|626_aa MAETSLPELGGEDKATPCPSILELEELLRAGKSSCSRVDEVWPNLFIGDAATANNRFELW KLGITHVLNAAHKGLYCQGGPDFYGSSVSYLGVPAHDLPDFDISAYFSSAADFIHRALNT PGGFFTRMSSSAKVLVHCVVGVSRSATLVLAYLMLHQRLSLRQAVITSWSLLPAMGLCHF ATLALILLVLLEALAQADTQKMVEAQRGVGPRACYSIWLLLAPTPPLSHCLQSPQAHILV PLKIQLRRVPDSFSQQMPETSYLTRVGPDIQCWPESWGMDSLQKQDLRRPKIHGAVQASP YQPPTLASLQRLLWVRQAATLNHIDEVWPSLFLGDAYAARDKSKLIQLGITHVVNAAAGK FQVDTGAKFYRGMSLEYYGIEADDNPFFDLSVYFLPVARYIRAALSVPQGRVLVHCAMGV SRSATLVLAFLMICENMTLVEAIQTVQAHRNICPNSGFLRQLQPLDCVSFELFADKVSKT AENFRALSTGEKGFGCKGSCFHRIIPGFMCQGGDFTCHNGTGGKSIYWEKFDDENFILKH TGPGILSMANAGPNTNGSQFFICTAKTKWLDGKHVAFGKVKEGMNIVEAMEGPGMARPAR RSPLLTVDNSNKFDLCFILTSRPFLL >gi568815588f:75050529_75276251|GENSCAN_predicted_CDS_3|1881_bp atggctgagacctctctcccagagctggggggagaggacaaagccacgccttgccccagc atcctggagctggaggagctcctgcgggcagggaagtcttcttgcagccgtgtggacgaa gtttggcccaaccttttcataggagatgcggccacggcaaacaaccgctttgagctgtgg aagctgggcatcacccacgtgctgaacgccgcccacaagggcctctactgtcagggcggc cctgacttctacggcagcagtgtgagctacctgggggtgccagcccacgacctccctgat tttgacatcagtgcctacttctcctctgcggctgacttcatccaccgtgccctcaacacg cctggggggtttttcaccaggatgtcttcctcagccaaggtcctggtgcactgtgtggtg ggcgtgagccgctctgccacgctggtcctggcctacctcatgctgcaccagcggctgtcc ctgcgccaggcggtgatcaccagctggtccttactccctgccatggggctctgccacttt gccaccctggcactgatcctgctggtgctgctggaggctctggcccaggcggacacacag aagatggtggaagcccagcgtggggtcggccctagagcctgctactccatctggctcctc ctggcgcctacaccccctctcagccactgtcttcagtctccacaggcccatattctggtg ccgctgaaaatccagctccgcagggtccctgactccttcagccagcagatgcctgaaaca agctacctgacccgggtggggcctgacatccagtgctggcctgagtcgtgggggatggac tcactgcagaagcaggacctccggaggcccaagatccatggggcagtccaggcatctccc taccagccgcccacattggcttcgctgcagcgcttgctgtgggtccgtcaggctgccaca ctgaaccatatcgatgaggtctggcccagcctcttcctgggagatgcgtacgcagcccgg gacaagagcaagctgatccagctgggaatcacccacgttgtgaatgccgctgcaggcaag ttccaggtggacacaggtgccaaattctaccgtggaatgtccctggagtactatggcatc gaggcggacgacaaccccttcttcgacctcagtgtctactttctgcctgttgctcgatac atccgagctgccctcagtgttccccaaggccgcgtgctggtacactgtgccatgggggta agccgctctgccacacttgtcctggccttcctcatgatctgtgagaacatgacgctggta gaggccatccagacggtgcaggcccaccgcaatatctgccctaactcaggcttcctccgg cagctccagcccttagactgtgtctccttcgagctgtttgcagacaaagtttcaaagaca gcagaaaactttcgtgctctgagcactggagagaaaggatttggttgtaagggttcctgc tttcacagaattattccagggtttatgtgtcagggtggtgacttcacatgccataatggc actggtggcaagtccatctactgggagaaatttgatgatgagaacttcatcctaaagcat acaggtcctggcatcttatccatggcaaatgctggacccaacacaaatggttcccagttt ttcatctgcactgccaagactaagtggttggatggcaagcatgtggcctttggcaaggtg aaagaaggcatgaatattgtggaggccatggagggtccaggaatggcaagaccagcaaga agatcaccattgctgactgtggacaactctaataagtttgacttgtgttttatcttaacc tccagaccattccttctgtaa >gi568815588f:75050529_75276251|GENSCAN_predicted_peptide_4|712_aa MPARSRHRPRLHSGSPPRAPPPPLEALHSGEAGRAPDSDGGSDADSEAAEEEMAGPNQLC IRRWTTKHVAVWLKDEGFFEYVDILCNKHRLDGITLLTLTEYDLRSPPLEIKVLGDIKRL MLSVRKLQKIHIDVLEEMGYNSDSPMGSMTPFISALQSTDWLCNGELSHDCDGPITDLNS DQYQYMNGKNKHSVRRLDPEYWKTILSCIYVFIVFGFTSFIMVIVHERVPDMQTYPPLPD IFLDSVPRIPWAFAMTEVCGMILCYIWLLVLLLHKHRSILLRRLCSLMGTVFLLRCFTMF VTSLSVPGQHLQCTGKIYGSVWEKLHRAFAIWSGFGMTLTGVHTCGDYMFSGHTVVLTML NFFVTEFHRSSDGEELLYSRNYTQGASCGYREGWDLGEVLGGSCPDSCLIQGPAASAFRI PTDGRRQPTCRTGALSVHVAALEQLLELQPDRERAKRLQQLAERWRRPPSGHHQGVGFPL SEDIKGRIREFSTSGSSNTDTGKVTGTLETKYKWCEYGLTFTEKWNTDNTLGTEIAIEDQ ICQGLKLTFDTTFSPNTGKKSGKIKSSYKRECINLGCDVDFDFAGPAIHGSAVFGYEGWL AGYQMTFDSAKSKLTRNNFAVGYRTGDFQLHTNVNKNLKIHTVHYSVTYIDVLGCLWHFN RFNAKVNNSSLIGVGYTQTLRPGVKLTLSALVDGKSINAGGHKVGLALELEA >gi568815588f:75050529_75276251|GENSCAN_predicted_CDS_4|2139_bp atgcctgcgcgcagtcgccaccgcccccgcctccactccggctccccgccccgggctccg cccccgccgcttgaggcgcttcactccggcgaggcggggagggccccggactccgacggc ggctcggacgccgactcggaggcagcggaggaggaaatggcaggtcctaatcaactctgc attcgccgctggactaccaagcatgtagctgtgtggctgaaggatgaaggcttttttgaa tatgtggacattttatgcaataagcaccgacttgatggaatcacattgctaacattgact gaatatgatctccggtctcctcctctggaaatcaaagtcttaggggacattaaaaggtta atgctctcagtccgaaaattgcagaaaatacatattgatgttttagaagagatgggctac aacagtgacagtcccatgggttccatgacccctttcatcagtgctcttcagagtacagac tggctctgtaatggggagctttcccatgactgtgacggacccataactgacttgaattct gatcagtaccagtacatgaatggtaaaaacaaacattctgttcgaagattggacccagaa tactggaagactatactgagttgtatatatgtttttatagtatttggatttacatctttc attatggttatagtccatgagcgagtgcctgacatgcagacctatccaccactcccagat atattcttagacagcgttcctagaatcccatgggcctttgccatgacggaagtatgtggc atgattctgtgctatatttggctcctggttcttcttcttcacaagcacaggtcaatactt ctgcgaaggctctgtagtctgatgggaactgtattcttgcttcgctgctttaccatgttt gtgacctccctctccgtgccaggacaacacctgcagtgtactggaaagatatatggcagt gtatgggagaaattacatcgagcctttgccatttggagtggctttggtatgaccctgact ggcgttcacacatgtggagattacatgtttagtggccacacagtcgtcctaactatgctg aatttctttgtcaccgaatttcacaggtcttcagatggagaggaactgctgtactcaagg aactacacccaaggagcctcatgtggctaccgtgaaggctgggatctcggggaggttttg ggagggagctgcccggatagctgcctgatccaaggcccagcagcttctgccttccggatc cccaccgacggacggaggcagcccacctgcaggaccggggcgctgtcggtccacgtggcc gctctggagcagctgctggagctgcagcccgaccgcgagcgtgccaagcggcttcagcag ctagcggagcggtggcggcggcccccctcaggacaccaccagggtgttggttttcccctc agcgaagatattaaaggacgtatcagggaattttcaacgtccggttcatctaatacagac actggtaaagttactgggaccttggagaccaaatacaagtggtgtgagtatggtctgact ttcacagaaaagtggaacactgataacactctgggaacagaaatcgcaattgaagaccag atttgtcaaggtttgaaactgacatttgatactaccttctcaccaaacacaggaaagaaa agtggtaaaatcaagtcttcttacaagagggagtgtataaaccttggttgtgatgttgac tttgattttgctggacctgcaatccatggttcagctgtctttggttatgagggctggctt gctggctaccagatgacctttgacagtgccaaatcaaagctgacaaggaataactttgca gtgggctacaggactggggacttccagctacacactaatgtcaataaaaacctcaagatt catacagtacactattctgttacttacattgatgttcttggatgtctgtggcattttaac agatttaatgcaaaagtcaacaactctagcttaattggagtaggctatactcagactctg aggcctggtgtgaagcttacactctctgctctggtagatgggaagagcattaatgctgga ggccacaaggttgggctcgccctggagttggaggcttaa >gi568815588f:75050529_75276251|GENSCAN_predicted_peptide_5|262_aa MTQPVPRLSVPAALALGSAALGAAFATGLFLGRRCPPWRGRREQCLLPPEDSRLWQYLLS RSMREHPALRSLRLLTLEQPQGDSMMTCEQAQLLANLARLIQAKKALDLGTFTGYSALAL ALALPADGRVVTCEVDAQPPELGRPLWRQAEAEHKIDLRLKPALETLDELLAAGEAGTFD VAVVDADKENCSAYYERCLQLLRPGGILAVLRVLWRGKVLQPPKGDVAAECVRNLNERIR RDVRVYISLLPLGDGLTLAFKI >gi568815588f:75050529_75276251|GENSCAN_predicted_CDS_5|789_bp atgacccagccggtgccccggctctccgtgcccgccgcgctggccctgggctcagccgca ctgggcgccgccttcgccactggcctcttcctggggaggcggtgccccccatggcgaggc cggcgagagcagtgcctgcttccccccgaggacagccgcctgtggcagtatcttctgagc cgctccatgcgggagcacccggcgctgcgaagcctgaggctgctgaccctggagcagccg cagggggattctatgatgacctgcgagcaggcccagctcttggccaacctggcgcggctc atccaggccaagaaggcgctggacctgggcaccttcacgggctactccgccctggccctg gccctggcgctgcccgcggacgggcgcgtggtgacctgcgaggtggacgcgcagcccccg gagctgggacggcccctgtggaggcaggccgaggcggagcacaagatcgacctccggctg aagcccgccttggagaccctggacgagctgctggcggcgggcgaggccggcaccttcgac gtggccgtggtggatgcggacaaggagaactgctccgcctactacgagcgctgcctgcag ctgctgcgacccggaggcatcctcgccgtcctcagagtcctgtggcgcgggaaggtgctg caacctccgaaaggggacgtggcggccgagtgtgtgcgaaacctaaacgaacgcatccgg cgggacgtcagggtctacatcagcctcctgcccctgggcgatggactcaccttggccttc aagatctag >gi568815588f:75050529_75276251|GENSCAN_predicted_peptide_6|70_aa XTATAELSRTYCYPTRDFISQPPLQLGEGVAIYLFHSPFGQEFPGAPAQGLRGYATIRAA PSKGDSLIHA >gi568815588f:75050529_75276251|GENSCAN_predicted_CDS_6|213_bp ngaacagcaactgctgaactcagcaggacttactgctatccaactagagacttcatttca cagcctcccttgcagctgggggagggggtggccatctaccttttccactcaccttttggc caggaattccccggagcccctgcccagggcctgcgaggttacgccaccatccgagctgct ccctcaaaaggggacagcctaattcacgcatga