GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:40:28 Sequence gi568815581f:74492047_74692775 : 200729 bp : 47.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12620 12904 285 1 0 95 65 126 0.464 8.54 1.02 Intr + 20176 20379 204 0 0 7 75 104 0.068 0.40 1.03 Term + 25021 25170 150 0 0 80 43 62 0.292 -1.19 1.04 PlyA + 25957 25962 6 1.05 2.16 PlyA - 27019 27014 6 1.05 2.15 Term - 30854 30692 163 1 1 125 44 177 0.996 14.41 2.14 Intr - 31605 31533 73 1 1 78 105 63 0.977 5.46 2.13 Intr - 34031 33702 330 0 0 79 110 241 0.939 21.00 2.12 Intr - 49690 49557 134 0 2 104 76 120 0.509 12.69 2.11 Intr - 50941 50815 127 2 1 76 82 78 0.986 5.74 2.10 Intr - 52901 52563 339 0 0 48 88 336 0.976 25.05 2.09 Intr - 57497 57367 131 1 2 107 76 143 0.815 15.34 2.08 Intr - 70851 70788 64 1 1 84 60 8 0.061 -4.62 2.07 Intr - 72328 71873 456 2 0 91 94 345 0.685 28.59 2.06 Intr - 77715 77596 120 1 0 96 56 33 0.585 1.37 2.05 Intr - 78247 78085 163 1 1 -2 98 80 0.033 -0.45 2.04 Intr - 88067 87983 85 1 1 100 32 118 0.238 7.22 2.03 Intr - 88571 88454 118 0 1 45 98 55 0.314 1.72 2.02 Intr - 96803 96465 339 0 0 101 110 248 0.992 23.75 2.01 Init - 100156 100117 40 1 1 73 109 69 0.987 5.92 2.00 Prom - 104394 104355 40 -2.56 3.00 Prom + 108469 108508 40 -3.36 3.01 Init + 110082 110181 100 2 1 85 52 70 0.832 3.53 3.02 Intr + 111804 112095 292 1 1 83 99 113 0.907 8.09 3.03 Intr + 114375 114459 85 0 1 62 63 114 0.558 6.12 3.04 Term + 117699 117752 54 2 0 114 34 23 0.348 -2.94 3.05 PlyA + 117846 117851 6 1.05 4.05 PlyA - 117859 117854 6 1.05 4.04 Term - 120727 120607 121 0 1 110 43 76 0.813 3.25 4.03 Intr - 121987 121879 109 2 1 72 98 48 0.784 3.54 4.02 Intr - 125524 125072 453 2 0 76 110 448 0.512 39.03 4.01 Init - 131575 131536 40 1 1 69 109 59 0.834 4.46 4.00 Prom - 131725 131686 40 -7.16 5.00 Prom + 132016 132055 40 -5.96 5.01 Init + 133994 134127 134 1 2 78 67 42 0.431 0.71 5.02 Intr + 134786 134909 124 2 1 42 53 66 0.397 -0.91 5.03 Intr + 137132 137271 140 1 2 14 28 323 0.961 18.16 5.04 Intr + 145768 145891 124 1 1 85 12 46 0.018 -2.71 5.05 Term + 161531 161662 132 1 0 68 42 139 0.739 5.39 5.06 PlyA + 163576 163581 6 1.05 6.00 Prom + 169315 169354 40 -3.26 6.01 Init + 178825 179176 352 0 1 66 27 181 0.263 7.12 6.02 Intr + 179236 179612 377 2 2 29 92 321 0.704 21.33 6.03 Intr + 191825 191891 67 1 1 69 98 58 0.062 3.38 6.04 Term + 193633 193649 17 2 2 131 49 1 0.339 -1.10 6.05 PlyA + 196575 196580 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 39304 39265 40 1 1 71 109 81 0.935 6.87 S.002 Init - 53736 53676 61 0 1 60 109 111 0.949 9.81 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:74492047_74692775|GENSCAN_predicted_peptide_1|212_aa QGELIALTQALTLAKGLRVNIYTDSKYAFHILHHHAVTWAERNFLTTQESSIINVSLIKP LLKAALLSKEAGVIRCKGHQKASDPIAQGNSYADKCVDSILTIRCSVSKMLGKKHPEEHK SEIQPLPGLLQGSRPMHEAQPESTCLREQTSKAYIGSFHLARKDEDSSEAMTTEQEGTCI NQFQGHHQGQTNKPKAPNQQLTVKQERKQWLP >gi568815581f:74492047_74692775|GENSCAN_predicted_CDS_1|639_bp caaggcgaactcattgccttaactcaggccctcactcttgcaaagggactacgcgtcaat atttatactgactctaaatatgccttccatatcctgcaccaccatgctgttacatgggca gaaagaaatttcctcactacacaagagtcctccattattaatgtgtccttaataaaacct cttcttaaagctgctttactttcaaaggaagctggagtcattcgctgcaagggccatcaa aaggcatcagatcccattgctcagggcaactcttatgctgataagtgtgtggactccatc ctcacaatcagatgctcagtgtccaaaatgctgggtaagaaacatcctgaagaacacaaa tctgagatacagcctcttccaggcctgctccaaggctccagaccaatgcatgaagcacag cctgagtccacgtgcctgagggaacagacctccaaagcctacattgggagcttccacctg gcaaggaaggatgaagacagttctgaggccatgaccacagagcaagaaggaacttgtatc aaccaattccaaggacaccaccaaggccaaacaaacaaaccaaaagccccaaatcagcaa ttaacagtgaaacaagagaggaagcagtggctgccatga >gi568815581f:74492047_74692775|GENSCAN_predicted_peptide_2|893_aa MWLSPSLLLLILPGYSIAAKITGPTTVNGSEQGSLTVQCAYGSGWETYLKWRCQGADWNY CNILVKTNGSEQEVKKNRVSIRDNQKNHVFTVTMENLKRDDADSYWCGTERPGIDLGVKV QVTINPESNREHKEPQGQVLVHRKRVRARSASKAFKNMMSDLIPERSPLKSTHFLFLFLL ELPLLLSMLGTVLWVSETEGQGPAGEGTAAPDAGLDMEKAWMSQMWVISSRNLSSPGGLQ GPEYICFKADWAPQPGAQAMLTHRLRIDERETSFCGYSMLGLGVLLWQSSGNTGRAEDLT LKVPLVQTLPGEDGETDLCFPGCLTVSGPSTVMGAVGESLSVQCRYEEKYKTFNKYWCRQ PCLPIWHEMVETGGSEGVVRSDQVIITDHPGDLTFTVTLENLTADDAGKYRCGIATILQE DGLSGFLPDPFFQVQVLVSSASSTENSVKTPASPTRPSQCQGSLFSSPYVLLLVLELPLL LSMLGAVVWVNGPQRSSGSRQSWPEGYFPLSHPMTVAGPVGGSLSVQCRYEKEHRTLNKF WCRPPQILRCDKIVETKGSAGKRNGRVSIRDSPANLSFTVTLENLTEEDAGTYWCGVDTP WLRDFHDPIVEVEVSVFPAGTTTASSPQSSMGTSGPPTKLPVHTWPSVTRKDSPEPSPHP GSLFSNVRFLLLVLLELPLLLSMLGAVLWVNRPQRSSRSRQNWPKGCFSIQGPESVRAPE QGSLTVQCHYKQGWETYIKWWCRGVRWDTCKILIETRGSEQGEKSDRVSIKDNQKDRTFT VTMEGLRRDDADVYWCGIERRGPDLGTQVKVIVDPEGAASTTASSPTNSNMAVFIGSHKR NHYMLLVFVKVPILLILVTAILWLKGSQRVPEEPGEQPIYMNFSEPLTKDMAT >gi568815581f:74492047_74692775|GENSCAN_predicted_CDS_2|2682_bp atgtggctgtccccatctctgctgcttctcatcctcccaggttactccattgccgctaaa atcactggtccaacaacagtgaatggctcggagcagggctcattgactgtgcagtgtgct tatggctcaggctgggagacctacttgaagtggcggtgtcaaggagctgattggaattac tgtaacatccttgttaaaacaaatggatcagagcaggaggtaaagaagaatcgagtttcc atcagggacaatcagaaaaaccacgtgttcaccgtgaccatggagaatctcaaaagagat gatgctgacagttattggtgtgggactgagagacctggaattgatcttggggtcaaagtt caagtgaccattaacccagaaagtaacagggaacataaggaaccccagggccaggtgctg gtccacaggaagagggtcagagcaaggtcagcctccaaggccttcaagaatatgatgtca gatctgatacctgaaaggtccccgctcaagagcacccacttcctgttcctgttcctcctg gagctgcctctgctcctgagcatgctggggaccgtcctctgggtttcagagacagaagga caagggcctgctggggaagggacagcggctccagatgctggccttgacatggaaaaagcc tggatgtcccagatgtgggtcatctcttcaaggaatttgtcatctcctgggggactccag ggcccagagtacatctgcttcaaagcggactgggctccgcagcctggtgcccaagcaatg ctgactcaccgcttgcgtatagacgagcgagaaacaagcttctgtggttacagcatgctg ggattaggagttttgctatggcaatcctcaggcaacactgggcgtgcagaggacctgaca ctcaaggttccactggtccagactcttccaggtgaagacggggaaacggacttgtgtttt ccaggctgtctgactgtgagtggccccagcaccgtgatgggcgccgtgggggaatccctg agtgttcagtgtcggtatgaagagaaatacaagacgtttaacaaatactggtgcagacaa ccatgcttgccaatttggcatgaaatggtggagaccggagggtctgagggagtggtgagg agtgaccaagtgatcatcacggaccatcctggagacctcaccttcaccgtgaccttggag aacctcacggcagacgatgcaggaaaataccgatgtgggattgcaacaatactgcaggaa gatggcctgtctggtttcctgcccgatcccttcttccaggttcaagtgctggtctcatcg gcctccagtactgagaactctgtgaagacacctgcatctcccaccaggcccagccaatgc caaggctccctgttcagcagcccctacgtcctgctcctggtcctggagctgcccctgctc ctgagcatgctgggtgccgtcgtctgggtgaacggacctcagagaagctctggaagcagg cagagttggccagagggctattttcctctgagccaccccatgaccgtggcgggccccgtg gggggatccctgagtgtgcagtgtcgctatgagaaggaacacaggaccctcaacaaattc tggtgcagaccaccacagattctccgatgtgacaagattgtggagaccaaagggtcagca gggaaaaggaatggccgagtgtccatcagggacagtcctgcaaacctcagcttcacagtg accctggagaatctcacagaggaggacgcaggcacctactggtgtggggtggatacaccg tggctccgagactttcatgatcccattgtcgaggttgaggtgtccgtgttcccggccggg acgaccacagcctccagcccccagagctccatgggcacctcaggtcctcccacgaagctg cccgtgcacacctggcccagcgtgaccagaaaggacagccccgaacccagcccacaccct ggctccctgttcagcaatgtccgcttcctgctcctggtcctcttggagctgcccctgctc ctgagcatgctgggtgccgtcctctgggtgaacagacctcagagaagctctagaagcagg cagaattggcccaagggctgtttctccatccaaggcccagagtctgtgagagccccagag caggggtccctgacggttcaatgccactataagcaaggatgggagacctacattaagtgg tggtgccgaggggtgcgctgggatacatgcaagatcctcattgaaaccagagggtcggag caaggagagaagagtgaccgtgtgtccatcaaggacaatcagaaagaccgcacgttcact gtgaccatggaggggctcaggcgagatgacgcagatgtttactggtgtgggattgaaaga agaggacctgaccttgggactcaagtgaaagtgatcgttgacccagagggagcggcttcc acaacagcaagctcacctaccaacagcaatatggcagtgttcatcggctcccacaagagg aaccactacatgctcctggtatttgtgaaggtgcccatcttgctcatcttggtcactgcc atcctctggttgaaggggtctcagagggtccctgaggagccaggggaacagcctatctac atgaacttctccgaacctctgactaaagacatggccacttag >gi568815581f:74492047_74692775|GENSCAN_predicted_peptide_3|176_aa MKMLWRSDTDTYWCGIERTGTDLGVQVEVTIYPATPRKQSEPKTTDAAAPGTADTAGPGT MDTAVPRKAETAVPGKRNTAVPGTADPASPGTADTAEPMRTTTPTVLASWPSLTQSTNST QPTALTSPLTRILLCNIHFLLPISLKGLLLVGLLCTVLQSTVQNPDTNSNTPHRYC >gi568815581f:74492047_74692775|GENSCAN_predicted_CDS_3|531_bp atgaagatgctctggaggtccgacactgacacctactggtgtgggattgagaggacaggc actgaccttggggtccaagttgaagtgaccatttacccagcaactccaagaaaacaatct gaacctaagacaacagacgcagctgcacctgggacagcagacacagctgggcctgggaca atggacacagctgtacctagaaaagcagaaacagctgtgcctgggaaaaggaacacagct gtgcctggaacagcagacccagcttcacctgggacagcagacacagcagaacctatgaga acaaccactccaacagttctggcctcctggccttctctcacccagagcaccaacagcacc cagcccacggctcttaccagccccctcaccaggatcctgctctgcaacatccacttcctg ctcccgatctccctgaaggggctgctgctcgtgggcctgctctgcactgtgctgcaaagc actgtgcagaaccctgacacaaattctaacacaccgcatcgctattgttag >gi568815581f:74492047_74692775|GENSCAN_predicted_peptide_4|240_aa MWLLPALLLLCLSGNTQVSLAIFPLSFRQTAADGAIWMETWDSGLCFPGCLSLKGPGSVT GTAGDSLTVWCQYESMYKGYNKYWCRGQYDTSCESIVETKGEEKVERNGRVSIRDHPEAL AFTVTMQNLNEDDAGSYWCKIQTVWVLDSWSRDPSDLVRVYVSPAITTPRRTTHPATPPI FLVVNPGRNLSTGEVLTQNSGFRLSSPHFLLVVLLKLPLLLSMLGAVFWVNRPQWAPPGR >gi568815581f:74492047_74692775|GENSCAN_predicted_CDS_4|723_bp atgtggctgctcccagctctactccttctctgcctctcaggaaacacccaggtctccctg gcaatcttccccttaagcttcagacagacggctgctgatggggccatctggatggagact tgggactcgggcttgtgttttccaggctgtttgtctctgaagggccccggctctgtgact ggcactgcgggggactctctgacagtgtggtgtcagtatgagagcatgtacaagggatat aacaagtactggtgccgaggacagtacgacacgtcatgtgagagcattgtggagaccaag ggagaagagaaggtggagaggaatggccgcgtgtccatcagagaccacccggaggctctc gccttcactgtgaccatgcagaacctcaatgaagatgatgctggatcttactggtgcaaa attcagacagtgtgggtcctggattcatggtcacgcgatccctcggacctggttagggtg tatgtttccccagcaattacaaccccaaggaggaccacacatccagccacacctcccatc ttcctggtggtgaaccctgggcgaaacctcagcaccggggaggtgttgacccaaaattca gggttccggctcagcagccctcacttcctgctcgtggtccttctgaagctgcccctgctc ctgagcatgctgggtgctgtcttctgggtgaacaggcctcagtgggctcctcctggaaga tag >gi568815581f:74492047_74692775|GENSCAN_predicted_peptide_5|217_aa MSNIGTFHYNPIRKSKLRPPSKTELTQKLSSGSKLKSNPIKGNSSFHATTAEFVTGPTSS YAHCTGTNQYTKTTGFAAEKEFNDHRHERQSETLLKKKEKERKRKRKKKKKKKKKKKKKK KRRKKKDEEKKKCYQGSVTRSGAPDTVITEDLLFLAGSKNDETSVVQTEGCNLVMKCIYM HLYNKWNLWQNSDDVDDDDDEDKDKDEGNSRHLHITV >gi568815581f:74492047_74692775|GENSCAN_predicted_CDS_5|654_bp atgtctaatataggcacatttcattataatccaattagaaaatccaaactccgaccacct agtaagaccgagttgacacagaagctatcatctggttccaaactgaagtcaaatccaata aaaggaaattccagctttcatgctacaacagcagagtttgttacaggaccaacgagttca tatgcccattgcactggaacaaaccaatacaccaagacaacagggtttgcagcagagaaa gagtttaatgatcacaggcatgagcgacagagtgagactctgttgaagaagaaggagaag gagaggaagaggaagaggaagaagaagaagaagaagaagaagaagaagaagaagaagaag aagaggaggaagaagaaggacgaggagaagaagaagtgctaccaggggagcgtgacccgt tctggggctccagacactgttatcacagaagatttgctctttctggcaggaagcaaaaac gatgagactagtgttgtccaaactgagggttgcaacctggttatgaagtgcatttacatg cacttgtataataagtggaatttatggcagaatagtgacgatgttgatgatgatgatgac gaagacaaagacaaagatgaaggcaacagtcgacacctacatatcactgtctaa >gi568815581f:74492047_74692775|GENSCAN_predicted_peptide_6|270_aa MCAGENVIKALDLLPRKTQVNRILHISGTLGTPKTHLLSTSDLEALFISYILERIVQGQK QKRRQRRRTSPLQTCEESCAALSCVVTEMPVRSGSGTVSSWDRNALRTQPGAEPPESASW SPKRQGLERTGSQNSGPGRTTAESLFLVLKRSVLEERGRLPAWYRAAAAAGNCPVLKTDA ARPAELRPKPAAPSGARTELLEALTQSAGRAGAASSKPGMDLQRPDSYQGGAGPDFNDHV LHKDSICEGHGQQEVWPGRQIRQSEDSQGP >gi568815581f:74492047_74692775|GENSCAN_predicted_CDS_6|813_bp atgtgtgctggagagaatgtgatcaaagccctggaccttctccctaggaaaacgcaggta aacagaattttgcatatttcagggactttggggacacctaagacccatctactgtcgacc tcagacttagaagctctatttatctcatacatcctggagagaattgtccagggacaaaaa cagaagcgccgccagcgccgcaggacttcgcctctgcaaacgtgtgaggagagctgcgct gcgctgagctgcgtggtgacggagatgcccgttcgctcgggatcaggaacggtgagctcc tgggaccgcaatgccctgagaacgcagccgggtgctgagcctccggagtccgcatcctgg agtcctaagaggcagggattggagcggacaggatctcagaactctggtcccgggcgcaca acggcggagtcgctgttcctggtgctgaaacgctcagtcctggaagaacgtggccgcctg cccgcctggtaccgcgccgcggccgctgcggggaactgtccagtgctgaaaacggatgcg gcccggcccgcagagctcagacccaagcctgccgcacccagcggagctcgaaccgagctc ctggaagcgctgacgcagagcgcagggagagccggagcggcgagctccaagcctggcatg gacctgcagagacccgattcctaccagggaggagctggccctgacttcaacgaccacgtc ctgcataaggacagcatctgcgagggtcacgggcagcaggaagtgtggcctggaaggcag atcagacagtccgaagattcccaaggtccttag