GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:33:21 Sequence gi568815588r:74994679_75204076 : 209398 bp : 45.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 11193 11232 40 0.04 1.01 Init + 18042 18103 62 2 2 73 59 61 0.488 2.42 1.02 Intr + 20996 21072 77 2 2 96 34 41 0.114 -1.34 1.03 Intr + 25904 26135 232 0 1 83 109 128 0.247 11.23 1.04 Intr + 26448 26607 160 0 1 64 80 62 0.971 2.99 1.05 Intr + 27203 27553 351 1 0 93 100 487 0.999 45.92 1.06 Intr + 30280 30571 292 0 1 18 100 218 0.416 12.71 1.07 Term + 33811 36368 2558 2 2 49 42 2596 0.290 236.49 1.08 PlyA + 37591 37596 6 1.05 2.10 PlyA - 37880 37875 6 1.05 2.09 Term - 42180 42017 164 1 2 -1 49 141 0.565 -0.60 2.08 Intr - 43399 43187 213 2 0 118 89 448 0.670 46.59 2.07 Intr - 49339 49119 221 0 2 56 61 529 0.878 44.95 2.06 Intr - 63870 63637 234 2 0 96 80 365 0.563 33.30 2.05 Intr - 64355 64170 186 1 0 16 81 104 0.401 1.30 2.04 Intr - 66159 66006 154 1 1 53 41 88 0.016 -0.27 2.03 Intr - 77271 77188 84 1 0 50 51 79 0.002 0.19 2.02 Intr - 87061 86948 114 2 0 104 92 6 0.393 3.02 2.01 Init - 92308 92161 148 1 1 38 32 151 0.482 4.75 2.00 Prom - 92605 92566 40 -3.66 3.10 PlyA - 93300 93295 6 1.05 3.09 Term - 95009 94518 492 2 0 -9 35 387 0.364 18.31 3.08 Intr - 100200 100040 161 1 2 85 23 179 0.567 10.81 3.07 Intr - 101117 100897 221 1 2 131 50 407 0.979 39.15 3.06 Intr - 103207 103034 174 0 0 99 66 110 0.978 8.95 3.05 Intr - 104480 104353 128 2 2 4 74 148 0.361 4.48 3.04 Intr - 109419 109216 204 0 0 98 75 113 0.002 10.40 3.03 Intr - 111217 111084 134 2 2 36 56 179 0.002 9.86 3.02 Intr - 113531 113314 218 1 2 107 41 387 0.995 34.05 3.01 Init - 114479 114331 149 2 2 62 66 106 0.744 5.52 3.00 Prom - 116323 116284 40 -9.75 4.00 Prom + 116443 116482 40 -11.14 4.01 Init + 116871 117011 141 2 0 86 90 181 0.491 16.24 4.02 Intr + 155836 156428 593 0 2 106 111 280 0.346 23.50 4.03 Intr + 169967 170062 96 2 0 93 82 14 0.049 0.42 4.04 Intr + 173863 173980 118 1 1 88 63 -1 0.119 -2.13 4.05 Intr + 181388 181538 151 1 1 67 101 114 0.710 10.34 4.06 Term + 199845 199909 65 1 2 107 38 29 0.010 -2.15 4.07 PlyA + 200684 200689 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 55389 55331 59 0 2 78 50 89 0.875 4.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:74994679_75204076|GENSCAN_predicted_peptide_1|1243_aa MQNDQPRKGVHTYPQNHGTIWLANPVPSTDTDLSAGFAELNRCLMKSYLLSRREGQAGSP EKPLSDLGRLSYLAYWKSVILEYLYHHHERHISIKAISRATGMCPHDIATTLQHLHMIDK RDGRFVIIRREKLILSHMEKLKTCSRANELDPDSLRWTPILISNAAVSEEEREAEKEAER LMEQASCWEKEEQEILSTRANSRQSPAKVQSKNKYLHSPESRPVTGERGQLLELSKESSE EEEEEEDEEEEEEEEEEEEDEEEEEEEEEEEEEENIQSSPPRLTKPQSVAIKRKRPFVLK KKRGRKRRRINSSVTTETISETTEVLNEPFDNSDEERPMPQLEPTCEIEVEEDGRKPVLR KAFQHQPGKKRQTEEEEGKDNHCFKNADPCRNNMNDDSSNLKEGSKDNPEPLKCKQVWPK GTKRGLSKWRQNKERKTGFKLNLYTPPETPMEPDEQVTVEEQKETSEGKTSPSPIRIEEE VKETGEALLPQEENRREETCAPVSPNTSPGEKPEDDLIKPEEEEEEEEEEEEEEEEEEGE EEEGGGNVEKDPDGAKSQEKEEPEISTEKEDSARLDDHEEEEEEDEEPSHNEDHDADDED DSHMESAEVEKEELPRESFKEVLENQETFLDLNVQPGHSNPEVLMDCGVDLTASCNSEPK ELAGDPEAVPESDEEPPPGEQAQKQDQKNSKEVDTEFKEGNPATMEIDSETVQAVQSLTQ ESSEQDDTFQDCAETQEACRSLQNYTRADQSPQIATTLDDCQQSDHSSPVSSVHSHPGQS VRSVNSPSVPALENSYAQISPDQSAISVPSLQNMETSPMMDVPSVSDHSQQVVDSGFSDL GSIESTTENYENPSSYDSTMGGSICGNGSSQNSCSYSNLTSSSLTQSSCAVTQQMSNISG SCSMLQQTSISSPPTCSVKSPQGCVVERPPSSSQQLAQCSMAANFTPPMQLAEIPETSNA NIGLYERMGQSDFGAGHYPQPSATFSLAKLQQLTNTLIDHSLPYSHSAAVTSYANSASLS TPLSNTGLVQLSQSPHSVPGGPQAQATMTPPPNLTPPPMNLPPPLLQRNMAASNIGISHS QRLQTQIASKGHISMRTKSASLSPAAATHQSQIYGRSQTVAMQGPARTLTMQRGMNMSVN LMPAPAYNVNSVNMNMNTLNAMNGYSMSQPMMNSGYHSNHGYMNQTPQYPMQMQMGMMGT QPYAQQPMQTPPHGNMMYTAPGHHGYMNTGMSKQSLNGSYMRR >gi568815588r:74994679_75204076|GENSCAN_predicted_CDS_1|3732_bp atgcagaatgaccaaccccgcaagggagtccacacctacccacagaaccatggaaccatc tggcttgcaaacccagtgccgtcgactgatacagacctttcagcaggatttgctgagcta aacaggtgtctgatgaaaagctatttgctttctagaagagaaggccaagcagggtctcct gaaaagcctctctccgatctgggccgtctctcctacctggcatattggaagagcgtcatc ttggagtatctctaccaccaccatgagaggcacatcagcatcaaggcaattagcagagcg acgggcatgtgcccacatgacattgccaccactctgcagcacctccacatgatcgacaag agagatggcagatttgtcatcattagacgggaaaagttgatattgagccacatggaaaag ctgaaaacctgttccagagccaatgaacttgatccagacagtctgaggtggaccccaatt ttaatttctaatgctgcagtgtctgaagaagagcgagaagctgagaaagaggctgagcgg ctaatggaacaagctagctgctgggagaaggaggaacaagaaatcctgtcaactagagct aacagtaggcaatcacctgcaaaagtacaatcgaaaaataaatatttgcattccccggag agccggccagtcacaggggagcgagggcagctgctggagctgtctaaagagagcagtgaa gaagaagaggaggaggaggacgaggaggaggaagaagaggaggaagaagaggaagaggat gaagaggaggaagaagaggaagaagaagaagaagaagaagaaaatattcaaagctctccc ccaagattgacgaaaccacagtcagttgccataaagagaaagaggccttttgtactaaag aagaaaaggggtcgtaaacgcaggaggatcaacagcagtgtaacaacagagaccatttca gagacgacagaagtactgaatgagccctttgacaactcagatgaagagaggccaatgcca cagctggagcctacctgtgagattgaagtggaggaagatggcaggaagccagtcctgaga aaagcattccagcatcagcctgggaagaaaagacaaacagaggaagaggaaggaaaagac aatcattgcttcaagaatgctgacccttgtagaaacaatatgaatgatgattcaagtaac ttgaaagaaggcagtaaagacaatcccgaacctctaaagtgcaaacaagtgtggccaaaa ggaacaaagcgcggtctatctaagtggaggcaaaacaaagagaggaagaccggatttaaa ctgaatttgtacaccccgccagaaacacccatggagcctgacgagcaggtaacagtggaa gaacagaaggagacttcagaaggaaaaaccagccccagtcccatcaggattgaggaggag gtcaaggaaactggggaagccctgttgcctcaagaggaaaacagaagggaagaaacatgt gcccctgtaagtccaaacacatcaccaggtgaaaaaccagaagatgatctcatcaaacct gaggaagaggaagaggaggaggaggaggaagaggaagaagaggaagaagaggaaggggaa gaagaagaaggaggaggaaatgtagaaaaagatccagatggtgctaaaagccaagaaaaa gaggaaccagaaatctccacggaaaaagaagactctgcacgtttggatgatcacgaagag gaggaggaagaggatgaagagccatcccacaacgaggaccatgatgccgatgacgaggat gacagccacatggagtctgccgaagtggagaaggaagagctgcccagagaaagcttcaaa gaagtactggaaaaccaggagacttttttagaccttaatgtgcagcctggtcactcgaac ccagaggtcttaatggactgtggcgtcgacctgacagcttcttgtaacagtgagcccaag gagcttgctggggaccctgaagctgtacccgaatctgacgaggagccacccccaggagaa caggcacagaagcaggaccaaaagaacagcaaggaagtcgatacagagttcaaagaggga aacccagcaaccatggaaatcgactctgagactgtccaggccgttcagtctttgacccag gagagcagcgaacaggacgacacctttcaggattgtgccgagactcaagaggcctgtaga agcctacagaactacacccgtgcagaccaaagtccacagattgccaccacgctcgacgat tgccaacagtcggaccacagtagcccagtttcatccgtccactcccatcctggccagtcc gtacgttctgtcaacagcccaagtgtccctgctctggaaaacagctacgcccaaatcagc ccagatcaaagtgccatctcagtgccatctctgcagaacatggaaaccagtcccatgatg gatgtcccatcagtttcagatcattcacagcaagtcgtagacagtggatttagtgacctg ggcagtatcgagagcacaactgagaactacgaaaacccaagcagctacgattctactatg ggaggcagcatctgtggaaacggctcttcacagaacagctgctcctatagcaacctcacc tccagcagtctgacacagagcagctgtgctgtcacccagcagatgtccaacatcagcggg agctgcagcatgctgcagcaaaccagcatcagctcccctccgacctgcagcgtcaagtct cctcaaggctgtgtggtggagaggcctccgagcagcagccagcagctggctcagtgcagc atggctgctaacttcaccccacccatgcagctggctgaaatccccgagacgagcaacgcc aacattggcttatacgagcgaatgggtcagagtgattttggggctgggcattacccgcag ccgtcagccaccttcagccttgccaaactgcagcagttaactaatacacttattgatcat tcattgccttacagccattccgctgctgtgacttcctatgcaaacagtgcctctttgtcc acaccattaagtaacacagggcttgttcaactttctcagtctccacactccgtccctggg ggaccccaagcacaagctaccatgaccccaccccccaacctgactcctcctccaatgaat ctgccgccgcctcttttgcaacggaacatggctgcatcaaatattggcatctctcacagc caaagactgcaaacccagattgccagcaagggccacatctccatgagaaccaagtcagcg tctctgtcaccagccgctgccacccatcagtcacaaatctatgggcgctcccagactgta gccatgcagggtcctgcacggactttaacgatgcaaagaggcatgaacatgagtgtgaac ctgatgccagcgccagcctacaatgtcaactctgtgaacatgaacatgaacactctcaac gccatgaatgggtacagcatgtcccagccaatgatgaacagtggctaccacagcaatcat ggctatatgaatcaaacgccccaataccctatgcagatgcagatgggcatgatgggcacc cagccatatgcccagcagccaatgcagaccccaccccacggtaacatgatgtacacggcc cccggacatcacggctacatgaacacaggcatgtccaaacagtctctcaatggctcctac atgagaaggtag >gi568815588r:74994679_75204076|GENSCAN_predicted_peptide_2|505_aa MLVKAQKGRLQDHRDGYGDHNSGILQWEEEIGFNSEYGMDKWDFIDKERAALVFERIRAQ ASCCNRDSRIQWLKQTAAYFSLMSLYRGLPVPVGVIFCRRAFDNQPLVSSAYVFLTSRLI DIDLFGEMIPAMVVCSSIRRHMMSDHLFLFIRGWQFMLSLWMQQLIRYGLGDVLFPLADG SLQTLRGPQAWNVRNLLGLPILGPKWPKPVGFSQIKEKVGTKGVKSKESCRKERKSLGSK MTSGEVKTSLKNAYSSAKRLSPKMEEEGEEEDYCTPGAFELERLFWKGSPQYTHVNEVWP KLYIGDEATALDRYRLQKAGFTHVLNAAHGRWNVDTGPDYYRDMDIQYHGVEADDLPTFD LSVFFYPAAAFIDRALSDDHSKILVHCVMGRSRSATLVLAYLMIHKDMTLVDAIQQVAKN RCVLPNRGFLKQLRELDKQLVQQRRRSQRQDGENGSRHEAGSDSFQGQESQTRQKPKRVA AVGDAGRQGPGMEMAWRNQNVIKAF >gi568815588r:74994679_75204076|GENSCAN_predicted_CDS_2|1518_bp atgcttgttaaagcacagaaaggaagacttcaggaccaccgagatgggtatggggaccac aacagtgggattttgcagtgggaagaagagattgggttcaactctgaatacggcatggac aagtgggactttatagacaaggagagggctgcattggtctttgaacgcataagggcacag gccagttgttgtaacagagactccagaatacagtggcttaaacagacagctgcttatttc tccctcatgtcactgtacagaggcctgccagtgcctgtcggcgtgatcttctgccggcgc gcttttgacaaccagccgcttgtgtcttctgcctatgtgttcctcacaagtcggctgata gatatagacctctttggtgagatgattcccgccatggtggtgtgctcttctattcgaagg catatgatgtctgaccatctcttcttattcatacgtggctggcagtttatgctcagtctc tggatgcagcaactcatcaggtatggccttggcgatgtactctttccactggctgacggc tccctgcagacactgagagggccacaggcctggaatgtcaggaatcttctgggccttcct attttgggtcccaaatggcccaaaccagtaggtttcagtcagataaaggaaaaagtagga accaaaggagtaaagtctaaagaaagctgcagaaaggagagaaaatcccttggctctaaa atgacatctggagaagtgaagacaagcctcaagaatgcctactcatctgccaagaggctg tcgccgaagatggaggaggaaggggaggaggaggactactgcacccctggagcctttgag ctggagcggctcttctggaagggcagtccccagtacacccacgtcaacgaggtctggccc aagctctacattggcgatgaggcgacggcgctggaccgctataggctgcagaaggcgggg ttcacgcacgtgctgaacgcggcccacggccgctggaacgtggacactgggcccgactac taccgcgacatggacatccagtaccacggcgtggaggccgacgacctgcccaccttcgac ctcagtgtcttcttctacccggcggcagccttcatcgacagagcgctaagcgacgaccac agtaagatcctggttcactgcgtcatgggccgcagccggtcagccaccctggtcctggcc tacctgatgatccacaaggacatgaccctggtggacgccatccagcaagtggccaagaac cgctgcgtcctcccgaaccggggctttttgaagcagctccgggagctggacaagcagctg gtgcagcagaggcgacggtcccagcgccaggacggagaaaacggcagcagacatgaagcg ggtagtgacagcttccagggccaggaatcccaaaccagacagaaaccaaagagagttgct gctgttggagatgccggccgtcaaggaccaggcatggaaatggcctggagaaaccagaat gtgataaaggccttctag >gi568815588r:74994679_75204076|GENSCAN_predicted_peptide_3|626_aa MAETSLPELGGEDKATPCPSILELEELLRAGKSSCSRVDEVWPNLFIGDAATANNRFELW KLGITHVLNAAHKGLYCQGGPDFYGSSVSYLGVPAHDLPDFDISAYFSSAADFIHRALNT PGGFFTRMSSSAKVLVHCVVGVSRSATLVLAYLMLHQRLSLRQAVITSWSLLPAMGLCHF ATLALILLVLLEALAQADTQKMVEAQRGVGPRACYSIWLLLAPTPPLSHCLQSPQAHILV PLKIQLRRVPDSFSQQMPETSYLTRVGPDIQCWPESWGMDSLQKQDLRRPKIHGAVQASP YQPPTLASLQRLLWVRQAATLNHIDEVWPSLFLGDAYAARDKSKLIQLGITHVVNAAAGK FQVDTGAKFYRGMSLEYYGIEADDNPFFDLSVYFLPVARYIRAALSVPQGRVLVHCAMGV SRSATLVLAFLMICENMTLVEAIQTVQAHRNICPNSGFLRQLQPLDCVSFELFADKVSKT AENFRALSTGEKGFGCKGSCFHRIIPGFMCQGGDFTCHNGTGGKSIYWEKFDDENFILKH TGPGILSMANAGPNTNGSQFFICTAKTKWLDGKHVAFGKVKEGMNIVEAMEGPGMARPAR RSPLLTVDNSNKFDLCFILTSRPFLL >gi568815588r:74994679_75204076|GENSCAN_predicted_CDS_3|1881_bp atggctgagacctctctcccagagctggggggagaggacaaagccacgccttgccccagc atcctggagctggaggagctcctgcgggcagggaagtcttcttgcagccgtgtggacgaa gtttggcccaaccttttcataggagatgcggccacggcaaacaaccgctttgagctgtgg aagctgggcatcacccacgtgctgaacgccgcccacaagggcctctactgtcagggcggc cctgacttctacggcagcagtgtgagctacctgggggtgccagcccacgacctccctgat tttgacatcagtgcctacttctcctctgcggctgacttcatccaccgtgccctcaacacg cctggggggtttttcaccaggatgtcttcctcagccaaggtcctggtgcactgtgtggtg ggcgtgagccgctctgccacgctggtcctggcctacctcatgctgcaccagcggctgtcc ctgcgccaggcggtgatcaccagctggtccttactccctgccatggggctctgccacttt gccaccctggcactgatcctgctggtgctgctggaggctctggcccaggcggacacacag aagatggtggaagcccagcgtggggtcggccctagagcctgctactccatctggctcctc ctggcgcctacaccccctctcagccactgtcttcagtctccacaggcccatattctggtg ccgctgaaaatccagctccgcagggtccctgactccttcagccagcagatgcctgaaaca agctacctgacccgggtggggcctgacatccagtgctggcctgagtcgtgggggatggac tcactgcagaagcaggacctccggaggcccaagatccatggggcagtccaggcatctccc taccagccgcccacattggcttcgctgcagcgcttgctgtgggtccgtcaggctgccaca ctgaaccatatcgatgaggtctggcccagcctcttcctgggagatgcgtacgcagcccgg gacaagagcaagctgatccagctgggaatcacccacgttgtgaatgccgctgcaggcaag ttccaggtggacacaggtgccaaattctaccgtggaatgtccctggagtactatggcatc gaggcggacgacaaccccttcttcgacctcagtgtctactttctgcctgttgctcgatac atccgagctgccctcagtgttccccaaggccgcgtgctggtacactgtgccatgggggta agccgctctgccacacttgtcctggccttcctcatgatctgtgagaacatgacgctggta gaggccatccagacggtgcaggcccaccgcaatatctgccctaactcaggcttcctccgg cagctccagcccttagactgtgtctccttcgagctgtttgcagacaaagtttcaaagaca gcagaaaactttcgtgctctgagcactggagagaaaggatttggttgtaagggttcctgc tttcacagaattattccagggtttatgtgtcagggtggtgacttcacatgccataatggc actggtggcaagtccatctactgggagaaatttgatgatgagaacttcatcctaaagcat acaggtcctggcatcttatccatggcaaatgctggacccaacacaaatggttcccagttt ttcatctgcactgccaagactaagtggttggatggcaagcatgtggcctttggcaaggtg aaagaaggcatgaatattgtggaggccatggagggtccaggaatggcaagaccagcaaga agatcaccattgctgactgtggacaactctaataagtttgacttgtgttttatcttaacc tccagaccattccttctgtaa >gi568815588r:74994679_75204076|GENSCAN_predicted_peptide_4|387_aa MPARSRHRPRLHSGSPPRAPPPPLEALHSGEAGRAPDSDGGSDADSEAAEEEMAGPNQLC IRRWTTKHVAVWLKDEGFFEYVDILCNKHRLDGITLLTLTEYDLRSPPLEIKVLGDIKRL MLSVRKLQKIHIDVLEEMGYNSDSPMGSMTPFISALQSTDWLCNGELSHDCDGPITDLNS DQYQYMNGKNKHSVRRLDPEYWKTILSCIYVFIVFGFTSFIMVIVHERVPDMQTYPPLPD IFLDSVPRIPWAFAMTEVCGMILCYIWLLVLLLHKHRSILLRRLCSLMGTVFLLRCFTMF VTSLSVPGQHLQCTGKIYGSVWEKLHRAFAIWSGFGMTLTGVHTCGDYMFSGHTVVLTML NFFVTEYCEIVDCYLDPSTVSDTLLVP >gi568815588r:74994679_75204076|GENSCAN_predicted_CDS_4|1164_bp atgcctgcgcgcagtcgccaccgcccccgcctccactccggctccccgccccgggctccg cccccgccgcttgaggcgcttcactccggcgaggcggggagggccccggactccgacggc ggctcggacgccgactcggaggcagcggaggaggaaatggcaggtcctaatcaactctgc attcgccgctggactaccaagcatgtagctgtgtggctgaaggatgaaggcttttttgaa tatgtggacattttatgcaataagcaccgacttgatggaatcacattgctaacattgact gaatatgatctccggtctcctcctctggaaatcaaagtcttaggggacattaaaaggtta atgctctcagtccgaaaattgcagaaaatacatattgatgttttagaagagatgggctac aacagtgacagtcccatgggttccatgacccctttcatcagtgctcttcagagtacagac tggctctgtaatggggagctttcccatgactgtgacggacccataactgacttgaattct gatcagtaccagtacatgaatggtaaaaacaaacattctgttcgaagattggacccagaa tactggaagactatactgagttgtatatatgtttttatagtatttggatttacatctttc attatggttatagtccatgagcgagtgcctgacatgcagacctatccaccactcccagat atattcttagacagcgttcctagaatcccatgggcctttgccatgacggaagtatgtggc atgattctgtgctatatttggctcctggttcttcttcttcacaagcacaggtcaatactt ctgcgaaggctctgtagtctgatgggaactgtattcttgcttcgctgctttaccatgttt gtgacctccctctccgtgccaggacaacacctgcagtgtactggaaagatatatggcagt gtatgggagaaattacatcgagcctttgccatttggagtggctttggtatgaccctgact ggcgttcacacatgtggagattacatgtttagtggccacacagtcgtcctaactatgctg aatttctttgtcaccgaatactgtgagattgttgactgttacttagaccccagcacagtg tctgacacattactagttccttaa