GENSCAN 1.0 Date run: 2-Nov-116 Time: 22:30:41 Sequence gi568815595f:97987173_98188099 : 200927 bp : 36.68% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 177 172 6 1.05 1.10 Term - 2653 2432 222 1 0 26 47 162 0.521 1.83 1.09 Intr - 5876 5680 197 0 2 49 87 224 0.017 16.61 1.08 Intr - 20732 20592 141 0 0 46 92 113 0.959 6.90 1.07 Intr - 21866 21784 83 1 2 105 87 37 0.994 3.66 1.06 Intr - 25395 25172 224 0 2 97 110 111 0.433 10.20 1.05 Intr - 30550 30483 68 2 2 72 115 53 0.283 4.11 1.04 Intr - 34978 34874 105 2 0 46 74 64 0.126 0.07 1.03 Intr - 38507 38395 113 1 2 -6 105 117 0.490 3.10 1.02 Intr - 39009 38898 112 1 1 98 61 36 0.429 0.42 1.01 Init - 39537 39408 130 0 1 56 38 301 0.354 22.26 1.00 Prom - 42604 42565 40 -3.75 2.06 PlyA - 43134 43129 6 1.05 2.05 Term - 51443 51349 95 0 2 107 42 91 0.522 3.41 2.04 Intr - 52540 52400 141 2 0 57 51 99 0.378 2.50 2.03 Intr - 53800 53740 61 1 1 126 25 35 0.080 -1.61 2.02 Intr - 77413 77333 81 1 0 6 87 101 0.007 0.72 2.01 Init - 90627 90391 237 0 0 36 59 159 0.078 5.86 2.00 Prom - 103414 103375 40 -3.65 3.03 PlyA - 103894 103889 6 1.05 3.02 Term - 104937 104514 424 2 1 55 48 128 0.592 -0.72 3.01 Init - 108364 108141 224 1 2 88 72 201 0.774 16.58 3.00 Prom - 117368 117329 40 -3.95 4.00 Prom + 119493 119532 40 -4.65 4.01 Init + 139675 139740 66 0 0 65 108 6 0.068 1.42 4.02 Intr + 145508 145790 283 1 1 60 52 130 0.109 2.67 4.03 Intr + 146136 146341 206 1 2 102 4 112 0.273 2.20 4.04 Intr + 162196 162478 283 0 1 61 52 147 0.071 4.47 4.05 Intr + 162824 163101 278 0 2 124 29 70 0.112 1.01 4.06 Intr + 167739 167862 124 2 1 53 63 97 0.360 2.94 4.07 Intr + 177145 177248 104 2 2 65 50 80 0.094 0.87 4.08 Intr + 181510 181792 283 0 1 52 52 122 0.102 1.07 4.09 Intr + 182138 182300 163 0 1 102 48 57 0.162 1.21 4.10 Intr + 190354 190479 126 1 0 31 72 105 0.074 2.07 4.11 Intr + 191175 192613 1439 0 2 44 60 281 0.066 9.19 4.12 Intr + 199718 199828 111 0 0 69 68 73 0.018 2.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 46217 46079 139 1 1 59 47 147 0.922 4.15 S.002 Init - 47815 47691 125 1 2 74 99 28 0.820 2.19 S.003 Sngl - 90627 90256 372 0 0 36 38 201 0.830 5.77 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:97987173_98188099|GENSCAN_predicted_peptide_1|464_aa MTRIHLVEPSGSCVIAPAAHDDDDDDDDDDDDDDDDSHLFLEGEHCLNLFTILRSFRFLA VFVQRLFSPSDPKDHLGSLFSKQETRMKKDDSTKARPQKYEQLLHIEDNDFAMRPGFGVK SFQYADTGYRVKTKAKVIKAACTVFACQRRGLYWSPVPVGIDVHVESIDSISETNMDFTM TFYLRHYWKDERLSFPSTANKSMTFDHRLTRKIWVPDIFFVHSKRSFIHDTTMENIMLRV HPDGNVLLSLRITVSAMCFMDFSRFPLDTQNCSLELESYAYNEDDLMLYWKHGNKSLNTE EHMSLSQFFIEDFSASSGLAFYSSTGITTVLTMSTIITAVSASMPQVSYLKAVDVYLWVS SLFVFLSVIEYAAVNYLTTVEERKQFKKTGKLKCTHCLGHWQNKNVQAYPWKDSISLLHF REKTWTPMRTPNGGLKRDRAKRASFFSQTATVEWPKDDEKGLEC >gi568815595f:97987173_98188099|GENSCAN_predicted_CDS_1|1395_bp atgactaggatccacttagtagagccctcagggtcttgcgttattgctcctgctgcccac gatgatgatgatgatgatgatgatgatgatgatgatgatgatgatgacagccaccttttt ctggaaggagaacactgccttaatctgtttacaatcttgcggtcctttcgttttctcgct gtcttcgtgcagagattgttcagccccagtgaccccaaggatcacctaggtagcttgttc agcaaacaagaaactagaatgaagaaagatgacagtaccaaagcgcggcctcagaaatat gagcaacttctccatatagaggacaacgatttcgcaatgagacctggatttggagtgaaa tcatttcagtatgcagatactggctacagagtcaagacaaaagctaaagttataaaagct gcttgcacagtctttgcttgtcaaaggagaggactatactggtctccagtgccagtaggt atagatgtccatgttgaaagcattgacagcatttcagagactaacatggactttacaatg actttttatctcaggcattactggaaagacgagaggctctcctttcctagcacagcaaac aaaagcatgacatttgatcatagattgaccagaaagatctgggtgcctgatatctttttt gtccactctaaaagatccttcatccatgatacaactatggagaatatcatgctgcgcgta caccctgatggaaacgtcctcctaagtctcaggataacggtttcggccatgtgctttatg gatttcagcaggtttcctcttgacactcaaaattgttctcttgaactggaaagctatgcc tacaatgaggatgacctaatgctatactggaaacacggaaacaagtccttaaatactgaa gaacatatgtccctttctcagttcttcattgaagacttcagtgcatctagtggattagct ttctatagcagcacaggaatcaccacagtgctgaccatgtccacaatcatcactgctgtg agcgcctccatgccccaggtgtcctacctcaaggctgtggatgtgtacctgtgggtcagc tccctctttgtgttcctgtcagtcattgagtatgcagctgtgaactacctcaccacagtg gaagagcggaaacaattcaagaagacaggaaagcttaaatgtactcactgtcttggtcac tggcaaaacaagaatgttcaggcctatccctggaaggacagtatctctttacttcatttc agagaaaagacctggacaccaatgcggacaccaaatggaggactcaaaagggacagagct aaacgtgccagtttcttctcccagactgctacagtagagtggccaaaggatgatgagaaa gggctggaatgttag >gi568815595f:97987173_98188099|GENSCAN_predicted_peptide_2|204_aa MNGEKLKAFPLRIGTRQGCPLSPLLFNVVLEVLARAITQEKEIKGIQISKQEVKPLLFAD DTIIHQENLKDSSKKLLELGDEVDDQEHQEQGRLQPWMICESCEDKMESSLRNMGIFLNG TRAQKVEQDSSGVLTRITADSVCQEGCDKRLKQGSWGGRWSTGYEKTMIYWLSGKIMATY PKIGDTAILSKFSLYVAAKEIKTG >gi568815595f:97987173_98188099|GENSCAN_predicted_CDS_2|615_bp atgaatggggaaaagttgaaagcattccccctgagaattggaacaagacaaggatgccca ctttcaccacttctattcaacgtagtactggaagtcctagccagagcaatcacacaagag aaagaaataaagggcatccaaattagtaaacaagaagtcaaaccattgctgtttgctgat gatacaatcatacaccaagaaaaccttaaagactcatccaaaaagctcctagaactgggt gatgaggtagatgaccaagaacaccaggaacaggggcgcctgcagccctggatgatctgt gagtcctgtgaggacaaaatggagtcttctctgagaaatatgggcatatttctcaatggt accagggcccagaaagtagagcaagattcttcaggtgtcctcactcgtatcacagctgat tctgtatgccaggaaggatgtgataagagactgaaacaagggtcgtgggggggacgatgg tctactgggtatgaaaaaacgatgatctactggttatctggcaaaataatggctacatac cccaagattggcgacacagccatcctttccaaattttctctttatgtcgctgccaaagaa ataaaaactggctga >gi568815595f:97987173_98188099|GENSCAN_predicted_peptide_3|215_aa MGRNQTRKAENSKNQSASSPPKDRSSSPVREQNWMESVFDKLTEVGFRRSLITNFSELKK HVLTHRKEAKILEKSVGSSGQGNQARERNKEYSIRKRGSQNVSVCRDMIVYLENPIVSGQ NLLKLISNFSKVSGYKISVQKSQAFLYTNNRQTESQIMSQLPFTIATKRIKYLGIQLTRD VKDLFKENYKPLLNEIKQDMYKWKNIPCSWIESIS >gi568815595f:97987173_98188099|GENSCAN_predicted_CDS_3|648_bp atggggagaaatcagaccagaaaagctgaaaattccaaaaaccagagtgcttcttctcct ccaaaggaccgcagctcctcaccagtaagggaacaaaactggatggagagcgtgtttgac aagttgacagaagtaggcttcagaaggtcactaataacaaacttctctgagctaaagaag catgttctaacccatcgcaaggaagctaaaatccttgaaaaaagtgttggaagttctggc cagggcaatcaggcaagagaaagaaataaagagtattcaattaggaaaagaggaagtcaa aacgtctctgtttgcagagacatgattgtatatttagaaaaccccatcgtctcaggccaa aatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcagtgtgcaa aaatcacaagcattcctatacacaaacaatagacaaacagagagccaaatcatgagccaa ctcccattcacaattgctacaaagagaataaaatacctaggaatccaacttacaagggat gtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaacaggacatg tacaaatggaagaacattccatgctcatggatagaatcaatatcgtga >gi568815595f:97987173_98188099|GENSCAN_predicted_peptide_4|1156_aa MGQAQNKYLNSKREKRRNNRSQRACCEDMEEENATLLTEFVLTGFLYQPQWKIPLFLAFL VIYLITIMGNLGLIAVIWKDPHLHIPMYLLLGNLAFVDAWISSTVTPKMLNNFLAKSSIQ VFSIVTILVSYTFVLFAILKKKSDKGVRKAFSTCGAHLFSVSLYYGPLLFIYVGPASPQA DDQDMRTCSEDMEEENATLLTEFVLTGFLYQPQWKIPLFLAFLVIYLITIMGNLGLIAVI WKDPHLHIPMYLLLGNLAFVDALLSSSVTLKMLINFLAKSSIQVFTIGTVLISYIFVLYT ILKKKSVKGMRKAFSTCGAHLLSVSLYYGPLAFMYMGSASPQADDQDMMESLFYTVIVPL LNPMIYSLRNKQIITRLEIMAPDHHATGGHGILTFLTAPTDNIPIVKSKIGVQERSCSTA ATYFGQCLNMLYNHRYPGPRLLFCSQLIRTCSEDMEEENATLLTEFVLTGFLYQPQWKIP LFLAFLVIYLITIMGNLGLIAVIWKDPHLHIPMYLLLGNLAFVDAWISSTVTPKMLNNFL AKSSIQVFSIVTILISYTFVLFTVLEKKSDKGVRKAFSTCGAHLFSVCLYYGPLLLILNQ EEVEFLNRPTGFEIVAIINSLLTKKSPGPDGFTAEFYQRPKSPSVYKQLQQSLRIQNHVQ KSQAFLYTNNRQTESQIMSDLPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDT NKWKNIPCSWVGRTNIMKMAILPKVIYRFNAIPNKLPMAFFTELEKTTLKFIWSQKRAHI TKSVLSQKNKAGGSTLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTQPSEIMPHIYNHLI FDKPDKNKKWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKVNSRWIKDLHVRHKTIKT LEENLGNTIQDIGMGKDFMSKTPKATATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTKW EKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCS SSLAIREMQIKTTMRYHLTPVRMTIIKMSGNNRCWRRCGEIGTLLHCWLDCKLVQPLWKS VWRFLRDLELEIPFDPAIPLLGIYPKDYKSCCYKDTCTRTDLAQEDSHHCKMAETKTQYC HAVTGHVPKDMKQDGX >gi568815595f:97987173_98188099|GENSCAN_predicted_CDS_4|3468_bp atgggccaagcacagaacaaatatttaaattccaaaagagagaaaagaaggaataacagg tctcaaagggcatgctgtgaggacatggaagaggaaaatgcaacattgctgacagagttt gttctcacaggatttttatatcaaccacagtggaaaatacccctgttcctggcattcttg gtaatatatctcatcaccatcatggggaatcttggtctgattgctgtcatctggaaagac cctcaccttcatatcccaatgtacttactccttgggaatttagcttttgtggatgcttgg atatcatccacagtgaccccaaagatgctgaataacttcttagctaagagttcaattcag gtattcagcattgtgactattcttgtatcttatacatttgttctcttcgcaatcttaaaa aagaaatctgataaaggtgtaaggaaagccttttccacctgtggagcccatctcttctct gtctctttatactatggaccccttctcttcatttatgtgggccctgcatctccgcaagca gatgatcaagatatgaggacatgcagtgaggacatggaagaggaaaatgcaacattgctg acagagtttgttctcacaggatttttatatcaaccacagtggaaaatacccctgttcctg gcattcttggtaatatatctcatcaccatcatggggaatcttggtctgattgctgtcatc tggaaagaccctcatcttcatatcccaatgtacttactccttgggaatttagcttttgtg gatgctttgttatcatcctcagtgactctgaagatgctgatcaacttcttagctaagagt tcaattcaagtttttaccatagggactgttcttatatcttacatatttgtcctctataca atcttgaaaaagaagtctgtcaaaggtatgagaaaagccttctccacctgtggagctcat ctcttatctgtatctttatactatgggcccctcgccttcatgtatatgggctctgcatcc ccacaggctgatgaccaagatatgatggagtctctattttacactgtcatagttccttta ttaaatcccatgatctacagcctgagaaacaagcaaattatcacaagattagaaattatg gctcctgatcatcatgcaactggaggccatggaattctaaccttcctaactgctcccaca gataatattcctattgtgaaatctaagattggtgttcaagaaaggagctgtagtactgca gccacctactttgggcagtgcctgaatatgttgtacaatcatcggtatccaggtccacgt cttctcttctgctctcaattgattaggacatgcagtgaggacatggaagaggaaaatgca acattgctgacagagtttgttctcacaggatttttatatcaaccacagtggaaaataccc ctgttcttggcattcttggtaatatatctcatcaccatcatggggaatcttggtctgatt gctgtcatctggaaagaccctcaccttcatatcccaatgtacttactccttgggaattta gcttttgtggatgcttggatatcatccacagtgaccccaaagatgctgaataacttctta gctaagagttcaattcaggtattcagcattgtgactattcttatatcttacacatttgtt ctcttcacagtcttagaaaagaaatctgataagggtgtaaggaaagccttttccacctgt ggagcccatctcttctctgtctgtttatactatggcccccttctcttaatactaaaccag gaagaagttgaatttctgaatagaccaacaggctttgaaattgtggcaataattaatagc ttgctaaccaaaaaaagtccaggaccagatggattcacagctgaattctaccagaggcca aaatctccttcagtttataagcaacttcagcaaagtctcaggatacaaaatcatgtacaa aaatcacaagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgac ctcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggat gtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggataca aacaaatggaagaacattccatgctcatgggtaggaagaaccaatatcatgaaaatggcc atactgcccaaggtaatttatagattcaatgccatccccaacaagctaccaatggctttc ttcacagaattggaaaaaactactttaaagttcatatggagccaaaaaagagcccacatc accaagtctgtcctaagccaaaagaacaaagctggaggcagcacgctacctgacttcaaa ctatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatata gatcaatggaacagaacacagccctcagaaataatgccgcatatctacaaccatctgatc tttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaataaatggtgc tgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacaccttat acaaaagttaattcaagatggattaaagacttacatgttagacataaaaccataaaaacc ctagaagaaaacctagggaataccattcaggacataggcatgggcaaggacttcatgtct aaaacaccaaaagcaacggcaacaaaagccaaaattgacaaatgggatctaattaaacta aagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaaaatgg gagaaaatttttgcaacctactcatctgacaaagggctaatatccagaatctacaatgaa ctcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcaaaggatatg aacagacacttctcaaaagaagacatttatgcagcaaaaaaacacatgaaaaaatgctca tcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcacacca gttagaatgacgatcattaaaatgtcaggaaacaataggtgctggagaagatgtggagaa ataggaacacttttacactgttggttggactgtaaactagttcaaccattgtggaagtca gtgtggcgattccttagggatctagaactggaaataccatttgacccagccatcccatta ctgggaatatacccaaaggattataaatcatgctgctataaagacacatgcacacgaact gacttagcacaagaagacagccaccattgtaaaatggcggagactaaaacacagtattgc catgcggttacaggtcatgttcccaaagacatgaaacaagacggagnn