GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:51:00 Sequence gi568815587r:82882168_83097316 : 215149 bp : 40.53% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1549 1634 86 0 2 75 111 48 0.730 6.14 1.02 Term + 7169 7445 277 2 1 7 44 183 0.132 -0.25 1.03 PlyA + 7645 7650 6 1.05 2.04 PlyA - 8635 8630 6 1.05 2.03 Term - 19514 18857 658 1 1 25 52 318 0.983 14.33 2.02 Intr - 19660 19541 120 0 0 26 21 163 0.062 2.09 2.01 Init - 36398 36292 107 2 2 89 39 131 0.342 8.04 2.00 Prom - 38958 38919 40 -8.65 3.00 Prom + 39009 39048 40 -6.25 3.01 Init + 43508 43545 38 1 2 65 80 17 0.238 -1.87 3.02 Intr + 46610 46771 162 2 0 92 110 47 0.352 5.47 3.03 Intr + 47990 48107 118 2 1 64 43 53 0.353 -2.05 3.04 Term + 49565 52168 2604 1 0 100 42 1118 0.930 92.76 3.05 PlyA + 52470 52475 6 1.05 4.03 PlyA - 52817 52812 6 1.05 4.02 Term - 60656 60415 242 0 2 66 39 149 0.437 3.00 4.01 Init - 62851 62641 211 1 1 68 47 178 0.615 10.70 4.00 Prom - 68691 68652 40 -3.95 5.00 Prom + 71333 71372 40 -6.05 5.01 Init + 74047 74248 202 0 1 74 50 126 0.597 5.04 5.02 Term + 75531 75937 407 1 2 46 48 327 0.761 18.86 5.03 PlyA + 76089 76094 6 1.05 6.06 PlyA - 76494 76489 6 1.05 6.05 Term - 82156 82044 113 2 2 62 43 74 0.514 -1.96 6.04 Intr - 83662 83525 138 2 0 44 92 147 0.139 10.21 6.03 Intr - 86431 86337 95 0 2 79 83 45 0.116 1.79 6.02 Intr - 88486 88315 172 2 1 38 89 138 0.124 6.98 6.01 Init - 88773 88698 76 0 1 88 68 103 0.137 7.68 6.00 Prom - 89594 89555 40 -6.95 7.06 PlyA - 90989 90984 6 1.05 7.05 Term - 100248 99998 251 1 2 63 43 193 0.783 7.18 7.04 Intr - 102403 102281 123 2 0 4 43 149 0.671 1.64 7.03 Intr - 105603 105420 184 0 1 115 106 254 0.990 28.34 7.02 Intr - 111955 111872 84 1 0 99 116 57 0.641 8.70 7.01 Init - 115149 115057 93 0 0 30 110 55 0.188 2.33 7.00 Prom - 122944 122905 40 -3.25 8.00 Prom + 131675 131714 40 -3.75 8.01 Init + 133270 133360 91 0 1 72 73 77 0.869 5.30 8.02 Intr + 134687 134835 149 0 2 84 101 17 0.384 1.63 8.03 Term + 139130 139207 78 1 0 62 41 130 0.549 2.48 8.04 PlyA + 140308 140313 6 1.05 9.00 Prom + 145103 145142 40 -5.25 9.01 Init + 166717 166779 63 0 0 101 58 63 0.325 5.82 9.02 Term + 182536 182646 111 0 0 55 38 109 0.117 0.18 9.03 PlyA + 183538 183543 6 1.05 10.03 PlyA - 184187 184182 6 1.05 10.02 Term - 190200 190003 198 0 0 50 46 191 0.407 7.52 10.01 Intr - 200182 200086 97 0 1 89 86 47 0.635 3.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 18235 18068 168 1 0 85 116 200 0.869 20.08 S.002 Init - 19794 19541 254 0 2 69 21 166 0.902 4.86 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:82882168_83097316|GENSCAN_predicted_peptide_1|120_aa MRLGAAAGLSDAQMSVTQGCLMKSQKGGCLGDRARLCLKKKTNKIKKRKKKKEKEEGWRK EGEEEKERRGRGSEKKEKDDSSWRRQRQKQQRKLQKQQQKTQQQEQQKQKKKNSSWRRQC >gi568815587r:82882168_83097316|GENSCAN_predicted_CDS_1|363_bp atgagacttggtgctgctgcaggcctctcagatgcacagatgagtgtgacccaaggttgc ttaatgaaatcacagaagggtggttgcctgggtgacagagcaagactctgtctcaaaaaa aaaaccaacaaaataaaaaagaggaagaagaagaaggagaaggaggagggatggaggaag gagggggaggaagagaaagaaagaagaggaagaggaagcgagaagaaggagaaggatgac agctcctggaggaggcagaggcagaagcagcagaggaagctgcagaagcagcagcagaag acgcagcagcaggagcagcagaagcagaagaaaaaaaacagctcctggaggaggcagtgt tag >gi568815587r:82882168_83097316|GENSCAN_predicted_peptide_2|294_aa MACTAKARLSKKNKPGGIALPDFKLYYKAIVTKTPCSPGGEDQERAQATHSDSDRPNSVA RRQPSNRISRVLFRRKANQKKAPQRVELSEEILQASGNDHFRQTAGRVFLPDRWISSTVR KIGFGASLSLLGVGFPGKCGVSRSALGSGGVTERGNGSWMKRVEGALEARPGAVLGASTT HGRREEGGGGRRQEGGGARSGFSGAGEERGRYLKEGTGTWRDLGAMWTRGDLCNGWVVRA TQFWGDPRTEYHREWLGSRWRWVQGAQSLDLDPAVCDDVLRSLLSDLVQVPAPL >gi568815587r:82882168_83097316|GENSCAN_predicted_CDS_2|885_bp atggcctgcacagccaaagcaagactaagcaaaaagaacaaacctggaggcatcgcatta cctgatttcaaactgtactataaggccatcgtcaccaaaacaccatgctcacctggggga gaagaccaggaacgggcccaggcgacgcacagtgactcggaccggccgaactcagtagca cgccggcagccttccaatcgaatctcccgcgtccttttccgccggaaagccaatcagaag aaggcgccgcagagggtggagcttagcgaggaaatcctacaagcatctggaaacgaccat tttcgtcaaactgcgggtagagtctttctccccgacaggtggatcagtagtacagtccgt aaaattggttttggagcttcgctgagcctcctgggcgtagggttcccgggaaagtgtgga gtgagccgcagcgccctggggtctggcggggtgactgaaagagggaatggaagctggatg aagagggttgagggagctttggaagccagacccggcgctgtcctaggggctagcactacc cacgggcggagggaggaagggggcggtggcaggaggcaggaggggggtggagcgcggagt gggttttcgggtgccggagaagaaagggggcgttatttgaaagaagggaccgggacctgg cgggatttaggggcgatgtggacccggggcgacctgtgtaatggatgggttgtgcgagca acacagttttggggtgacccaagaacggagtatcatagggagtggttaggctccagatgg aggtgggtgcaaggagcacagtctttggacctggacccagcggtctgcgacgacgtgctc cgctcactgctgagtgatctagtgcaagttcccgcacctctctga >gi568815587r:82882168_83097316|GENSCAN_predicted_peptide_3|973_aa MDDVLNNSTMVARSNCPKCGSTGESGNANYRYKLSLKVAESNKLFVITVFGSCLDTFFGL TATGLHRYIQDPNKIPETLDNDTTQNLLTKAVETCFVGQSFIFGVTNFENQPGQGSDASN FLQQCSDHKRKAKALVACQIVLPDPGIAGFTVIDYFHQLLQTFNFRKLQCDSQAPNNHLL ALDHSNSDLSSIYTSDSTSDFFKSCSKDTFSKFWQPSLEFTCIVSQLTDNDDFSASEQSK AFGTLQQNRKSISIAEATGSSSCHDPIQDSWSLVSYMDKKSTAEKLGKELGLQAKELSAV HSSHHEIGVNDSNLFSLEMREPLESSNTKSFHSAVEIKNRSQHELPCFQHHGIDTPTSLQ KRSACCPPSLLRLEETASSSQDGDPQIWDDLPFSESLNKFLAVLESEIAVTQADVSSRKH HVDNDIDKFHADHSRLSVTPQRTTGALHTPPIALRSSQVIVKANCSKDDFLFNCKGNLSP SVEKESQPDNKVEAVSVNHNGRDMSEYFLPNPYLSALSSSSKDLETIVTLKKTIRISPHR ESDHSSLNNKYLNGCGEISVSEMNEKLTTLCYRKYNDVSDLCKLENKQYCRWSKNQDDSF TICRKLTYPLETLCNSPNRSTNTLKEMPWGHINNNVTQSYSIGYEGSYDASADLFDDIAK EMDIATEITKKSQDILLKWGTSLAESHPSESDFSLRSLSEDFIQPSQKLSLQSLSDSRHS RTCSPTPHFQSDSEYNFENSQDFVPCSQSTPISGFHQTRIHGINRAFKKPVFYSDLDGNY EKIRIFPENDKQQASPSCPKNIKTPSQKIRSPIVSGVSQPDVFNHYPFAECHETDSDEWV PPTTQKIFPSDMLGFQGIGLGKCLAAYHFPDQQELPRKKLKHIRQGTNKGLIKKKLKNML AAVVTKKKTHKYNCKSSGWISKCPDIQVLAAPQLHPILGPDSCSEVKCCLPFSEKGPPSV CETRSAWSPELFS >gi568815587r:82882168_83097316|GENSCAN_predicted_CDS_3|2922_bp atggatgatgtgttaaataattcgactatggtagccaggtctaattgtccaaaatgtggc tctactggtgaatctggaaatgccaattacagatacaaactttccttaaaagttgcagaa tcaaacaaattgtttgttattactgtatttggaagttgcttagatacattttttggtctt actgccactggtttgcacaggtacattcaggatcctaataaaattccagaaacactggac aatgatacaactcagaatctattaactaaagcagttgaaacttgctttgttggacaaagc tttatttttggagtgacgaattttgaaaaccaacctggacaaggttcagatgccagtaac ttcttacagcaatgctctgaccacaaaagaaaagccaaagcactagtggcttgccagatt gttctaccagacccaggtattgcaggctttactgtcattgactacttccatcaacttttg cagacttttaatttcaggaaacttcagtgtgactctcaggcacctaacaatcacttactt gctttagatcactcaaatagtgatctcagcagcatatatacttctgacagcacttctgat tttttcaagtcctgcagcaaggatactttttcaaaattctggcagccatcacttgaattc acttgcattgtttcacaactaacagataatgatgatttttcagcttcagaacaaagtaag gcctttggtactcttcagcagaacagaaagtccatctccattgcagaggccactggttcc agtagctgccatgatcccattcaggattcatggagccttgtttcatatatggataaaaag agtacagcagaaaagttgggtaaagaacttggcttacaagctaaggagctgagtgcagtt cacagcagtcatcatgaaattggagttaatgactctaatttattctctttggaaatgcga gagccccttgagtcaagtaatacaaaatccttccacagtgcagtggaaattaaaaatagg tcccagcatgagctaccatgttttcagcatcatggtatagataccccaactagccttcag aagagatctgcatgttgtccaccttcgttactcagacttgaagagacagccagcagttcc caggatggtgaccctcaaatttgggatgatctgccattctctgaaagcctgaacaagttt ctggcagttcttgaaagtgagattgctgtaacccaggcagatgtcagtagtaggaaacat catgtagataatgacattgataaatttcatgcagaccacagcaggttatctgtgactccc cagagaactactggagccctgcatacaccacctatagctttaagatcatcacaagtaata gtcaaagcaaactgtagcaaagatgacttccttttcaactgtaaaggaaatctaagtcct agtgttgaaaaggagtcacaaccagataacaaagtagaggctgtctctgtaaatcataat ggaagagatatgtcagaatattttttaccgaatccttacctgtcagctctgtcttcatct tcaaaagatttagaaacaatagttactcttaagaagactatcagaatctcaccacacagg gagagtgaccattctagtctaaataacaaatatttgaatggatgtggagaaatatcagtt tcagaaatgaatgaaaagttgacaactctgtgttataggaagtataatgatgtctctgat ctttgcaaattagaaaataaacaatattgtaggtggtccaagaaccaagatgacagtttt acaatttgcaggaaacttacatatcctttagaaactctttgcaatagtccaaatagaagt acaaatacattgaaagaaatgccttggggacatatcaataacaacgtaacacagagctat tctattggttatgaaggtagctatgatgcctctgctgatctctttgatgatattgctaaa gaaatggacattgcaactgagattaccaaaaaatcacaggatattttgttaaaatgggga acatctttggcagaaagtcacccttcagagtctgatttttcactgagatcactttctgaa gacttcatccagccttcacaaaaattatccttgcaaagcctatctgactctaggcattca agaacatgctctccaacacctcattttcaatcagattcagaatataattttgaaaatagt caagactttgttccatgttcacagtcaactccaatttcagggttccaccaaacaagaatt catgggataaacagagctttcaaaaaacctgtattttattcagatcttgatggtaactat gaaaaaataaggattttccctgaaaatgacaaacagcaagccagcccaagctgtccaaaa aatataaaaacacctagccagaaaatcagaagccctattgtatctggtgtttcacaacca gacgttttcaatcactacccttttgctgagtgccatgaaactgatagtgatgaatgggtc cctcctaccacacaaaaaatatttccttcagatatgcttggattccaaggcataggtcta gggaaatgccttgctgcctatcatttccctgatcaacaagagttaccaagaaagaaactg aaacatattagacaaggaaccaataaaggtttaattaagaagaaattaaagaatatgctt gcagcagttgttacgaaaaagaaaactcataaatataactgtaaaagttcaggctggatt tccaaatgtccagacattcaagtcttagcagcacctcagctgcaccctattcttggacct gattcttgttcagaagtcaaatgttgccttccattttcagaaaaaggcccaccttcagtg tgtgaaactcgaagtgcttggtcacctgaattgttttcataa >gi568815587r:82882168_83097316|GENSCAN_predicted_peptide_4|150_aa MWKRLWNWVTGRGWNSLEGSEEDRKMWESLEPPRDLLNGFAQNADNDIDNEIQAEVVSDG DEELVGNWSRGRKKKEKLQPFGDPRPSSSLSQGCDILFGALQFLASPSFWAPPHSPMSAV EVACGMTGPATALQRAGACAGSWSCPSCCS >gi568815587r:82882168_83097316|GENSCAN_predicted_CDS_4|453_bp atgtggaagcgactttggaactgggtaacaggcagaggttggaacagtttggaaggctca gaagaagacaggaaaatgtgggaaagtttggaacctcctagagacttgttgaatggcttt gcccaaaatgctgataatgatatagacaatgaaatccaggctgaggtggtctcagatgga gatgaggaacttgttggcaattggagtagagggagaaaaaagaaagaaaagctgcagccc tttggggatcccagacctagtagctccttaagccagggctgtgacatcctctttggggct ctgcagttcctggcatctccaagcttctgggccccaccacattccccaatgtcagctgtg gaagtagcttgtgggatgactggtccagccacagccttgcagagagctggtgcttgtgct ggttcctggagctgcccatcctgctgcagttag >gi568815587r:82882168_83097316|GENSCAN_predicted_peptide_5|202_aa MVGAPPPTKLKHLWATSDCCASSENFKPVDLSLLGSMGMRPAKPDYLDPWLQPPFQGSEW FHLTGVPAAQLLKLAHEYRPVTKQEKKQELLAHADRNAASNGDIPTKSSPVLQAGVNTVT NSMVNKKSQLVLTAQDMDPIKLVVFLPALCCKMGVPYCVIKEKARPGSLVHRRTCIIVTF TQVKWKKQGALAKLWELQDQLQ >gi568815587r:82882168_83097316|GENSCAN_predicted_CDS_5|609_bp atggtgggcgcccctcccccaaccaagctcaagcatctctgggcgacctcagactgctgt gctagcagcgagaatttcaagccagtggatcttagcttgctgggctccatgggcatgcga cctgccaagccagactacttggatccctggcttcagccccctttccagggaagtgaatgg ttccatctcactggtgttccagctgctcagcttcttaagctggcccacgagtacagacca gtgacaaagcaagagaagaagcaggaattgttggcccatgctgacaggaatgctgccagc aatggagacatccccactaagagttcacctgtccttcaagcaggggttaatactgtcacc aactcaatggtgaacaagaagtctcagctggtgctgactgcacaagatatggatcccatc aagctggttgtcttcctgccagccctgtgttgtaaaatgggggtcccttactgtgttatc aaagagaaggcaaggccgggaagtttagtccacaggaggacctgcatcattgtcaccttc acacaggttaaatggaaaaaacaaggagctttagctaaactttgggagcttcaggatcaa ttacaatga >gi568815587r:82882168_83097316|GENSCAN_predicted_peptide_6|197_aa MAGCRSRALPRGEAAKARRKIERSAAQGAGSGLGQPREGLPWCSGGLKGSSSAARMDGEA EEAPRASDAARAASMLSPLNMILIDYKGAKAEEMKPSRETTGKLQVRKGDLYHRNLQREG TKSEQREDPEAELKEEKAGNPAWSTVHWTLFLAHNGSRGMVLEVLATAIRQEKERKGIQI GREEVKLSLFADDMILI >gi568815587r:82882168_83097316|GENSCAN_predicted_CDS_6|594_bp atggcgggctgccgctcccgagccctaccccgcggggaggcagctaaggcccggcgaaaa atcgagcgcagcgccgcccagggagccggctccggcctcggccagcccagagaagggctc ccatggtgcagcggcgggctgaagggctcctcaagcgcggccagaatggacggcgaggcc gaggaggcaccgagagcgagcgacgctgcgcgggctgccagcatgctgtcacctctcaat atgatcttaatagattataaaggggcaaaggcagaagaaatgaaaccatctagggagact actggtaaactgcaggtgagaaaaggtgacttgtaccataggaatcttcagagggaaggc accaagagtgaacagagggaagacccagaagctgagctgaaggaggagaaagctgggaac cctgcatggtctactgtgcactggaccttgttcctggcccacaatggctccagaggaatg gtattggaagtcctggccacggcaatcaggcaagagaaagaaagaaagggcatccaaata ggaagagaggaagtcaaactatccctgtttgcagatgacatgattctaatctag >gi568815587r:82882168_83097316|GENSCAN_predicted_peptide_7|244_aa MSMEDYDFLFKIVLIGNAGVGKTCLVRRFTQGLFPPGQGATIGVDFMIKTVEINGEKVKL QIWDTAGQERFRSITQSYYRSANALILTYDITCEESFRCLPEWLREIEQYASNKVITVLV DYRRPRVASCSLRRTQDRKYSLTADQHPATLFSGSNPSSSPRNKIDLAERREVSQQRAEE FSEAQDMYYLETSAKESDNVEKLFLDLACRLISEARQNTLVNNVSSPLPGEGKSISYLTC CNFN >gi568815587r:82882168_83097316|GENSCAN_predicted_CDS_7|735_bp atgagtatggaagattatgatttcctgttcaaaattgttttaattggcaacgctggtgtg gggaagacgtgcctcgtccgaagattcactcagggtcttttccccccaggtcaaggagcc acaattggagttgattttatgattaagacagtggagattaatggtgaaaaagtaaagcta cagatctgggacacagcaggtcaagagagatttcggtccattacccagagttactaccga agcgccaatgccttgatcctcacctatgacattacctgtgaggaatccttccgttgcctt cctgagtggctgcgggagatagaacaatatgccagcaacaaggtcatcactgtgttagtg gactaccgtagaccacgcgtggcatcgtgttcacttagaaggacacaggacaggaaatac agcttgacagcagatcagcatccggcaaccctcttcagtggaagtaatcccagtagcagt ccacgcaacaagattgacctggctgaaaggagagaggtttcccagcagcgagctgaagaa ttctcagaagctcaggacatgtattatctggagacctcagccaaggaatctgataatgtg gagaaactcttccttgacttagcatgccgactcatcagtgaagccagacagaacacactt gtgaacaatgtatcctcacccttacctggagaagggaaaagcatcagctatttgacttgt tgtaatttcaactaa >gi568815587r:82882168_83097316|GENSCAN_predicted_peptide_8|105_aa MAPLSEVGKIGKTEIGVELNQESSFGHVRKYFYYCIFTQMSFMCAHLAGVLLAAGPSMVA LSCSERVGGEDGDMEKSQCNVMSEEERRAVALQEAQTWELPDPGL >gi568815587r:82882168_83097316|GENSCAN_predicted_CDS_8|318_bp atggcaccactttctgaagttgggaagattgggaaaactgagattggagtggagctgaat caagagtcctcttttggccatgttaggaaatacttctattattgcattttcactcaaatg tcatttatgtgtgcacatctggctggggtcctactagctgcaggaccctcaatggttgcc ctttcatgttcagaaagagttggaggggaggatggggacatggaaaaatcacagtgcaat gttatgagtgaagaggagagacgagctgtggccctccaggaagcccagacctgggagctc cctgatccagggctgtga >gi568815587r:82882168_83097316|GENSCAN_predicted_peptide_9|57_aa MALSGCTPPFLDALPFSLLEHLCYDIPEDIQERQLIIYQYLQRVRLNKWNHYHIQEN >gi568815587r:82882168_83097316|GENSCAN_predicted_CDS_9|174_bp atggccctgagtggctgtactccacccttcttagatgctctgcccttctccctgctggag catctttgctatgacattcctgaagacattcaggaacgccagctgatcatataccaatat ctacaacgtgtcaggttaaacaaatggaatcattatcacatccaagaaaactaa >gi568815587r:82882168_83097316|GENSCAN_predicted_peptide_10|98_aa XHWTPKFFSFQTQTLALSFPQACSQPTVGPCDRLKNGEGSVGPHATPSGAETEAHWASGC RFQSVPPYISPQHGLHPLVPRFVPAAIRPDRPPPHRPH >gi568815587r:82882168_83097316|GENSCAN_predicted_CDS_10|297_bp nnacactggacccccaagttcttcagttttcagactcagactctggctctctcttttcct caagcttgcagccagcctactgtgggaccttgtgatcgtctaaaaaatggagaaggaagc gtagggccccatgcaaccccttcaggagctgagaccgaggctcactgggcctctggatgc cgcttccagtcggtcccgccctacatcagtccccagcatggcctccaccctctggttccc aggttcgtgcccgcggctatccgaccagatcgtcctccacctcaccgtccccactga