GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:19:08 Sequence gi568815587r:120012417_120238031 : 225615 bp : 49.63% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 321 316 6 1.05 1.07 Term - 566 550 17 0 2 110 52 -1 0.010 -3.10 1.06 Intr - 1255 1113 143 0 2 51 72 82 0.014 2.90 1.05 Intr - 1743 1540 204 2 0 66 17 115 0.007 0.52 1.04 Intr - 6051 5877 175 1 1 99 68 23 0.160 0.40 1.03 Intr - 10584 10456 129 1 0 23 84 92 0.597 2.97 1.02 Intr - 10942 10895 48 2 0 82 119 5 0.643 1.65 1.01 Init - 13750 13675 76 1 1 76 56 132 0.934 8.12 1.00 Prom - 21841 21802 40 -3.46 2.00 Prom + 40333 40372 40 -4.06 2.01 Init + 46207 46338 132 0 0 55 94 61 0.473 3.58 2.02 Intr + 50558 50645 88 1 1 109 86 41 0.973 5.54 2.03 Term + 53689 53843 155 2 2 93 52 90 0.897 3.98 2.04 PlyA + 55848 55853 6 1.05 3.03 PlyA - 56317 56312 6 1.05 3.02 Term - 74662 74589 74 2 2 126 34 47 0.385 1.17 3.01 Init - 78547 78496 52 1 1 75 109 21 0.503 4.23 3.00 Prom - 82381 82342 40 -1.96 4.13 PlyA - 82627 82622 6 1.05 4.12 Term - 100060 99998 63 1 0 128 49 150 0.999 12.99 4.11 Intr - 102998 102922 77 0 2 124 95 46 0.998 8.13 4.10 Intr - 105905 105807 99 0 0 98 105 110 0.999 13.78 4.09 Intr - 108249 108157 93 1 0 111 100 142 0.978 17.64 4.08 Intr - 110023 109895 129 2 0 99 59 19 0.621 0.77 4.07 Intr - 110639 110538 102 0 0 129 116 162 0.998 23.25 4.06 Intr - 111804 111751 54 1 0 99 85 32 0.828 2.95 4.05 Intr - 112016 111903 114 0 0 9 93 98 0.861 2.82 4.04 Intr - 113473 113275 199 1 1 92 90 198 0.999 19.32 4.03 Intr - 115153 114920 234 1 0 57 92 523 0.916 47.49 4.02 Intr - 116079 115984 96 0 0 122 105 200 0.996 25.31 4.01 Init - 125615 124812 804 2 0 64 107 1473 0.996 141.52 4.00 Prom - 136739 136700 40 -4.06 5.02 PlyA - 137418 137413 6 1.05 5.01 Sngl - 138660 137776 885 0 0 62 48 161 0.621 5.56 5.00 Prom - 143440 143401 40 -5.46 6.00 Prom + 145889 145928 40 -5.16 6.01 Init + 157397 157777 381 1 0 73 72 196 0.241 11.47 6.02 Intr + 158043 158174 132 2 0 62 77 39 0.350 1.04 6.03 Term + 163106 163285 180 1 0 91 47 96 0.489 3.41 6.04 PlyA + 163780 163785 6 1.05 7.07 PlyA - 165066 165061 6 1.05 7.06 Term - 169454 169291 164 0 2 74 42 83 0.179 0.40 7.05 Intr - 175230 175106 125 2 2 72 58 55 0.168 1.13 7.04 Intr - 176757 176708 50 0 2 91 87 43 0.345 1.98 7.03 Intr - 177586 177402 185 2 2 25 99 56 0.144 -0.09 7.02 Intr - 180926 180877 50 1 2 69 113 12 0.169 0.12 7.01 Init - 181200 181124 77 0 2 60 50 93 0.423 3.29 7.00 Prom - 186136 186097 40 -4.26 8.00 Prom + 189676 189715 40 -5.96 8.01 Init + 198864 199094 231 2 0 60 100 708 0.999 65.46 8.02 Intr + 213245 213379 135 1 0 105 111 204 0.993 25.16 8.03 Intr + 214400 214580 181 1 1 38 82 273 0.993 21.14 8.04 Term + 216452 216726 275 0 2 158 46 437 0.999 42.13 8.05 PlyA + 217500 217505 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:120012417_120238031|GENSCAN_predicted_peptide_1|263_aa MVGCKSRALPRGEAAKARREIERSAGIVSNCLLEVESQMDAVESCFPEVASSLSPIPNVG LGETVGKGALLENTLKDKEPCFLPGSSNIKWIYLLKQIPYQDKGKSWIQWLPLSQGSTLW CLSFPFISGILQALLIPSQVAELQVAPGRQIGSKRGHPLLAAPWSPSRVWGPFLSTLETW GTAGRPDATDGSAAAGTLLVGVCEAHSGCPGLRQQAKQTALSSGCCSPGSIRAPAAAQLG SRHSPGQEAPAGGLRGGLGPGLQ >gi568815587r:120012417_120238031|GENSCAN_predicted_CDS_1|792_bp atggtgggctgcaagtcccgagccctgccccgcggggaggcagctaaggcccggcgagaa atcgagcgcagcgctggcattgtttcaaattgcctcctggaggtggagtctcagatggat gccgtagagagctgcttcccagaagttgcatcttccctgtcgcccatcccaaatgtgggc ttgggagagacagtgggcaagggagctctcctggaaaatacgctgaaggacaaagaacct tgcttcctgccaggcagcagcaacatcaaatggatctatttgctcaagcaaatcccctat caagacaaaggcaagtcttggattcagtggttgcctctgtcacaaggaagtaccttgtgg tgcctcagtttccctttcatatcaggaattcttcaggccttgctcattccctcccaggtg gcagagctgcaggttgctccaggtaggcagattggctccaagcggggccacccgctgctc gcagcaccctggagcccatcccgagtgtggggtcccttcctctccactctggaaacctgg ggcacagctggccgccctgatgccaccgacggctctgctgctgcagggactctgcttgtg ggggtgtgcgaggcccactccggctgccctggcctccggcaacaggcaaaacagactgct ctgagcagtggctgctgctccccggggtccatccgagcaccggcagcagcacaacttggc agccggcacagccccgggcaggaagctcctgcagggggactccgcggaggcctggggcca ggactccagtga >gi568815587r:120012417_120238031|GENSCAN_predicted_peptide_2|124_aa MLQTWGKALTRQNTEVMAQGAGAGPPSEKASRRKTLVAVVDLAQGFCKNQMRNSYGPNAG GYPLLDTSVMKEGGPSRGIQGSSPPPMPDAVSGAQSTRIGVQRPQLWAQLLSWCYVTLVN SLDD >gi568815587r:120012417_120238031|GENSCAN_predicted_CDS_2|375_bp atgctgcagacctgggggaaagctctgaccagacaaaacacggaggtgatggcccaaggt gcaggagcaggtcctcccagtgagaaggcttccaggaggaagaccttggtagctgtggta gaccttgcccagggtttttgtaagaatcaaatgaggaacagttatggcccaaatgctggc ggttatcctctattagacacaagcgtcatgaaggaagggggaccttccagaggcatccag ggcagctctccacccccaatgccagatgcagtatcaggtgcacagagcaccagaatagga gtccagagacctcagctgtgggcccagctcctcagttggtgctatgtgaccttggtcaat tcactggatgactga >gi568815587r:120012417_120238031|GENSCAN_predicted_peptide_3|41_aa MMDQMQYEVLHVCRPVPGSGDLIEDAAEFPSLAYGGAKRFS >gi568815587r:120012417_120238031|GENSCAN_predicted_CDS_3|126_bp atgatggatcagatgcaatatgaggtgcttcatgtctgcaggcctgtcccaggctctgga gaccttatagaggatgctgctgaattccctagcttggcttatggaggagcaaaaagattc agttga >gi568815587r:120012417_120238031|GENSCAN_predicted_peptide_4|687_aa MEAADASRSNGSSPEARDARSPSGPSGSLENGTKADGKDAKTTNGHGGEAAEGKSLGSAL KPGEGRSALFAGNEWRRPIIQFVESGDDKNSNYFSMDSMEGKRSPYAGLQLGAAKKPPVT FAEKGELRKSIFSESRKPTVSIMEPGETRRNSYPRADTGLFSRSKSGSEEVLCDSCIGNK QKAVKSCLVCQASFCELHLKPHLEGAAFRDHQLLEPIRDFEARKCPVHGKTMELFCQTDQ TCICYLCMFQEHKNHSTVTVEEAKAEKETELSLQKEQLQLKIIEIEDEAEKWQKEKDRIK SFTTNEKAILEQNFRDLVRDLEKQKEEVRAALEQREQDAVDQVKVIMDALDERAKVLHED KQTREQLHSISDSVLFLQEFGALMSNYSLPPPLPTYHVLLEGEGLGQSLGNFKDDLLNVC MRHVEKMCKADLSRNFIERNHMENVSHALSSEKLRLAVAAEVGLRSVIIIVPGVQTRQGS PSGHNAGHGLCTGVALRPVRGGDHRYVNNYTNSFGGEWSAPDTMKRYSMYLTPKGPGQSQ PARCRPSTDQCLPVLALWWPLTFLGASNKPQQYPLHKGGVRTSYQPSSPGRFTKETTQKN FNNLYGTKGNYTSRVWEYSSSIQNSDNDLPVVQGSSSFSLKGYPSLMRSQSPKAQPQTWK SGKQTMLSHYRPFYVNKGNGIGSNEAP >gi568815587r:120012417_120238031|GENSCAN_predicted_CDS_4|2064_bp atggaagctgcagatgcctccaggagcaacgggtcgagcccagaagccagggatgcccgg agcccgtcgggccccagtggcagcctggagaatggcaccaaggctgacggcaaggatgcc aagaccaccaacgggcacggcggggaggcagctgagggcaagagcctgggcagcgccctg aagccaggggaaggtaggagcgccctgttcgcgggcaatgagtggcggcgacccatcatc cagtttgtcgagtccggggacgacaagaactccaactacttcagcatggactctatggaa ggcaagaggtcgccgtacgcagggctccagctgggggctgccaagaagccacccgttacc tttgccgaaaagggcgagctgcgcaagtccattttctcggagtcccggaagcccacggtg tccatcatggagcccggggagacccggcggaacagctacccccgggccgacacgggcctt ttttcacggtccaagtccggctccgaggaggtgctgtgcgactcctgcatcggcaacaag cagaaggcggtcaagtcctgcctggtgtgccaggcctccttctgcgagctgcatctcaag ccccacctggagggcgccgccttccgagaccaccagctgctcgagcccatccgggacttt gaggcccgcaagtgtcccgtgcatggcaagacgatggagctcttctgccagaccgaccag acctgcatctgctacctttgcatgttccaggagcacaagaatcatagcaccgtgacagtg gaggaggccaaggccgagaaggagacggagctgtcattgcaaaaggagcagctgcagctc aagatcattgagattgaggatgaagctgagaagtggcagaaggagaaggaccgcatcaag agcttcaccaccaatgagaaggccatcctggagcagaacttccgggacctggtgcgggac ctggagaagcaaaaggaggaagtgagggctgcgctggagcagcgggagcaggatgctgtg gaccaagtgaaggtgatcatggatgctctggatgagagagccaaggtgctgcatgaggac aagcagacccgggagcagctgcatagcatcagcgactctgtgttgtttctgcaggaattt ggtgcattgatgagcaattactctctccccccacccctgcccacctatcatgtcctgctg gagggggagggcctgggacagtcactaggcaacttcaaggacgacctgctcaatgtatgc atgcgccacgttgagaagatgtgcaaggcggacctgagccgtaacttcattgagaggaac cacatggagaacgtctcccatgccctctcatctgagaagctgcggctggcggtggctgca gaagtggggttgaggagtgtgataatcatcgtccctggggtacaaactcggcagggcagc ccctccggtcacaatgctggccacggcctctgtacaggtgtggccctgcgtcctgtacga ggtggtgaccatcgctatgtgaacaactacacgaacagcttcgggggtgagtggagtgca ccggacaccatgaagagatactccatgtacctgacacccaaaggccctggacagtcccag cctgcaaggtgccggcctagcacagaccagtgcctgcctgtccttgccctgtggtggcct ctcacattcttgggcgccagcaacaaaccacagcaatacccgttacataagggtggggtc cggacatcataccagccctcgtctcctggccgcttcaccaaggagaccacccagaagaat ttcaacaatctctatggcaccaaaggtaactacacctcccgggtctgggagtactcctcc agcattcagaactctgacaatgacctgcccgtcgtccaaggcagctcctccttctccctg aaaggctatccctccctcatgcggagccaaagccccaaggcccagccccagacttggaaa tctggcaagcagactatgctgtctcactaccggccattctacgtcaacaaaggcaacggg attgggtccaacgaagccccatga >gi568815587r:120012417_120238031|GENSCAN_predicted_peptide_5|294_aa MEEYLPSKWKAQKAGFAILVFDKTDFQPKKVKRDKEGHYIMVKGSMQQEELTILNIYAPN TGASSFIKQVLGDPQRDLDSHTIIVGDFNTPLSILDRLMRQKINKDIQDLNSALDQVDLI DIYRTLHPKSTEYTFFSAPHHTYSKIDHIIGSKTLLSKCKRTEIITNSLSDHSAIKLELR IEKFTQNHTTTWKLNNLPLNDYWVNNEIKAEINKFFETNENRDTMYQNLWDTAKAVFRGK FIALNAHRRKQERSKINTLTSQLKELKKQEQTNSKASRRQEITKIRAELKETET >gi568815587r:120012417_120238031|GENSCAN_predicted_CDS_5|885_bp atggaggaatatttaccaagcaaatggaaagcacaaaaagcagggtttgcaatcctagtc tttgataaaacagactttcagccaaaaaaggtcaaaagagacaaagaagggcattacata atggtaaagggatcaatgcaacaagaagagctgactatcctaaatatatatgcacccaat acaggagcatccagcttcataaagcaagttcttggagacccacaaagagacttagactcc cacacaataatagtgggagactttaacaccccactgtcaatattagacagattaatgaga cagaaaattaacaaggatattcaggacttgaactcagctctggaccaagtggacctaata gatatctacagaactcttcaccccaaatcaacagaatatacattcttctcagcaccacat cacacttattctaaaattgaccacataattggaagtaaaacactcctcagcaaatgcaaa agaacagaaatcataacaaacagtctctcagaccacagtgcaattaaattagaactcagg attgagaaattcactcaaaaccacacaactacatggaaattgaacaacctgcccctgaat gactactgggtaaataatgaaattaaggcagaaataaataagttctttgaaacaaatgag aacagagacacaatgtaccagaatctctgggacacagctaaagcagtgtttagagggaaa tttatagcactaaatgcccacagaagaaagcaagaaagatctaaaatcaacactttaaca tcgcaattgaaagaactaaagaagcaagagcaaacaaattcaaaagctagcagaaggcaa gaaataactaagatcagagcagaactgaaggagacagagacatga >gi568815587r:120012417_120238031|GENSCAN_predicted_peptide_6|230_aa MDLESTGVPCPGSCSLWKAAGAGMPNGLNELLGDSLSLVNAEPTLTVRPNPTANNRAALG GSGLPVPGTGSGRGGCRCGSSARSAKLLCLGGRGHWATCSRGCPQAGLPGFLTARLAGND AALWHSEPAPTAHRNLQPPSLPQLSLCGPHYPTFCRAQLPYPLYRLSGAMVKMWKLKGPK MGEEGTELKGRAGEQVQTEAKEPSLAYLLLWKQFRGKELGILEPPECQLG >gi568815587r:120012417_120238031|GENSCAN_predicted_CDS_6|693_bp atggatctggaatccacgggggtcccctgcccaggatcgtgcagcctgtggaaggcagca ggcgccgggatgccgaatggattgaatgagcttctcggggattccctgagcttggtgaac gcagagcccaccctgacggtcaggcctaaccccaccgccaataacagagcagccctaggt ggaagtgggctgcctgtgcctggcactgggagcggcagaggcggctgccgctgtggaagt agcgcccgcagtgccaagctcctgtgcctgggaggacgcgggcactgggcgacctgcagt cggggctgcccccaggcagggctccctggctttctcacagcgcggctcgcgggaaacgac gctgccctatggcacagtgaacctgccccaactgctcatcggaacctgcagcccccttcc ctgccccaactgtccctctgcggaccccattaccctactttctgtcgagcccaactgccc tatcctttatacaggctgtcgggggcaatggtgaagatgtggaagctgaaaggacccaag atgggggaagagggtacagaactcaagggaagggctggggagcaggttcagacagaggca aaagaacccagccttgcttacctgctcctctggaaacagtttcgtggcaaagaattagga attctggagccgcccgaatgccagctgggatga >gi568815587r:120012417_120238031|GENSCAN_predicted_peptide_7|216_aa MWQPLLAKEAEVPLVEEDLGIGQQQPGGLAPACLWGRSGGSQVKEILQHKWLAQGPLACM HHCPAHKPHGSPLANVESTFLNLYSRPAIIFFLPDWPSGPVPWQGVEWLAAQSMVILAAG LSPLTVVPLFRSPSLKPHPHSLSYYQEQTPKYFCVSSGKILGASCLYRLPVQHRRLSAQW LQMVPLYCVGKDQDFFVHSPGPGSEWALAVNYMSEC >gi568815587r:120012417_120238031|GENSCAN_predicted_CDS_7|651_bp atgtggcaacctttgctggccaaggaggcagaggtgcctctggtggaagaagaccttggg atcggccagcagcagcccggaggactggcccctgcctgcctctggggccgctctggtggt agtcaagtcaaggagattctgcagcataaatggctggcccagggaccactggcctgtatg catcactgccctgctcataaacctcatggctccccactggcaaatgttgagtccaccttc ctcaacctgtattcaaggcctgccatcatcttcttcctgccagactggccttcaggtcct gtgccttggcagggtgtggagtggcttgcagcacagtccatggtcatccttgctgctggg ctctcccctctgactgtggtccctctgtttcgaagccctagcctcaagcctcatccccac tctctttcctactaccaagagcagacaccaaaatatttctgcgtttcctctggaaaaatc ctgggagcatcctgcctgtaccgactcccagttcagcacaggcgactttcagcccagtgg ctgcagatggttcctctctattgtgtgggcaaggaccaggacttctttgtccatagccca ggacctggctcagagtgggctctggctgttaattacatgtccgaatgctga >gi568815587r:120012417_120238031|GENSCAN_predicted_peptide_8|273_aa MRLPGVPLARPALLLLLPLLAPLLGTGAPAELRVRVRLPDGQVTEESLQADSDADSISLE LRKPDGTLVSFTADFKKDVKVFRALILGELEKGQSQFQALCFVTQLQHNEIIPSEAMAKL RQKNPRAVRQAEEVRGLEHLHMDVAVNFSQGALLSPHLHNVCAEAVDAIYTRQEDVRFWL EQGVDSSVFEALPKASEQAELPRCRQVGDHGKPCVCRYGLSLAWYPCMLKYCHSRDRPTP YKCGIRSCQKSYSFDFYVPQRQLCLWDEDPYPG >gi568815587r:120012417_120238031|GENSCAN_predicted_CDS_8|822_bp atgcgccttcccggggtacccctggcgcgccctgcgctgctgctgctgctgccgctgctc gcgccgctgctgggaacgggtgcgccggccgagctgcgggtccgcgtgcggctgccggac ggccaggtgaccgaggagagcctgcaggcggacagcgacgcggacagcatcagcctcgag ctgcgcaagcccgacggcaccctcgtctccttcaccgccgacttcaagaaggatgtgaag gtcttccgggccctgatcctgggggagctggagaaggggcagagtcagttccaggccctc tgctttgtcacccagctgcagcacaatgagatcatccccagtgaggccatggccaagctc cggcagaaaaatccccgggcagtgcggcaggcggaggaggttcggggtctggagcatctg cacatggatgtcgctgtcaacttcagccagggggccctgctgagcccccatctccacaac gtgtgtgccgaggccgtggatgccatctacacccgccaggaggatgtccggttctggctg gagcaaggtgtggacagttctgtgttcgaggctctgcccaaggcctcagagcaggcggag ctgcctcgctgcaggcaggtgggggaccacgggaagccctgcgtctgccgctatggcctg agcctggcctggtacccctgcatgctcaagtactgccacagccgcgaccggcccacgccc tacaagtgtggcatccgcagctgccagaagagctacagcttcgacttctacgtgccccag aggcagctgtgtctctgggatgaggatccctacccaggctag