GENSCAN 1.0 Date run: 5-Nov-116 Time: 16:41:07 Sequence gi568815581r:66112136_66329379 : 217244 bp : 41.26% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1151 1146 6 1.05 1.03 Term - 3898 3809 90 1 0 58 55 97 0.081 0.14 1.02 Intr - 17688 17611 78 0 0 123 50 110 0.894 9.53 1.01 Init - 23054 23004 51 2 0 54 92 21 0.277 0.31 1.00 Prom - 23318 23279 40 -7.25 2.00 Prom + 23536 23575 40 -5.85 2.01 Init + 24853 24909 57 0 0 76 94 31 0.031 3.86 2.02 Intr + 42892 42960 69 0 0 118 74 47 0.038 4.76 2.03 Term + 44580 45302 723 2 0 -12 42 406 0.791 18.49 2.04 PlyA + 45530 45535 6 1.05 3.00 Prom + 45708 45747 40 -4.95 3.01 Sngl + 46970 47542 573 1 0 70 37 171 0.775 6.14 3.02 PlyA + 47737 47742 6 1.05 4.00 Prom + 48352 48391 40 -3.75 4.01 Init + 52909 53084 176 0 2 38 44 114 0.157 0.87 4.02 Term + 57739 57916 178 1 1 92 34 116 0.383 2.78 4.03 PlyA + 58882 58887 6 1.05 5.00 Prom + 62812 62851 40 -3.45 5.01 Init + 80054 80136 83 1 2 95 43 110 0.016 5.69 5.02 Intr + 85460 86050 591 2 0 -8 63 624 0.029 41.08 5.03 Intr + 86188 86544 357 1 0 -11 32 339 0.259 11.85 5.04 Term + 91874 92075 202 2 1 32 55 156 0.326 2.58 5.05 PlyA + 93004 93009 6 1.05 6.00 Prom + 96002 96041 40 -5.75 6.01 Init + 96277 96417 141 0 0 66 41 180 0.899 11.28 6.02 Intr + 97444 97523 80 0 2 86 63 23 0.804 -2.97 6.03 Term + 98152 98356 205 1 1 55 38 143 0.809 1.86 6.04 PlyA + 98528 98533 6 1.05 7.09 PlyA - 99447 99442 6 1.05 7.08 Term - 100053 99998 56 1 2 88 45 48 0.764 -2.76 7.07 Intr - 102515 102318 198 0 0 82 110 128 0.928 12.80 7.06 Intr - 104832 104653 180 1 0 22 86 138 0.950 5.92 7.05 Intr - 108607 108419 189 2 0 30 98 195 0.679 13.34 7.04 Intr - 111639 111563 77 2 2 105 99 112 0.773 12.24 7.03 Intr - 113989 113893 97 2 1 58 86 69 0.994 1.85 7.02 Intr - 116061 115885 177 1 0 101 113 142 0.996 16.97 7.01 Init - 117244 117181 64 1 1 68 83 13 0.944 -0.12 7.00 Prom - 119578 119539 40 -6.45 8.04 PlyA - 120877 120872 6 1.05 8.03 Term - 136532 136285 248 0 2 97 54 220 0.066 14.47 8.02 Intr - 155876 155547 330 0 0 24 64 178 0.286 3.68 8.01 Init - 157657 157537 121 1 1 68 74 103 0.784 7.30 8.00 Prom - 165793 165754 40 -6.05 9.03 PlyA - 167602 167597 6 1.05 9.02 Term - 168890 168662 229 1 1 43 41 154 0.149 1.32 9.01 Init - 190404 190145 260 0 2 52 21 416 0.288 25.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 71172 71059 114 1 0 67 110 119 0.887 11.50 S.002 Init - 73998 73989 10 0 1 86 111 0 0.857 2.85 S.003 Init - 81847 81800 48 1 0 102 110 7 0.823 5.30 S.004 Init - 146449 146327 123 1 0 56 91 118 0.941 9.12 S.005 Init + 190717 190889 173 0 2 107 82 209 0.994 21.16 S.006 Intr + 192365 192432 68 2 2 105 70 48 0.828 2.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:66112136_66329379|GENSCAN_predicted_peptide_1|72_aa MPLHYLGFIGARIRGNIEVYSKKSPVSLDDSDIEARLNSWNLGKCGNLAIVLSVAFILGL IETCFNLHFGAY >gi568815581r:66112136_66329379|GENSCAN_predicted_CDS_1|219_bp atgcctctccattacttgggcttcattggagcacgtattaggggtaacatagaagtttat tcgaaaaaatctcctgtttctttggatgatagtgacattgaagctcgccttaatagttgg aatcttgggaaatgtggaaacctggccattgtactcagtgtggccttcattttggggctg attgagacctgctttaaccttcatttcggggcttattga >gi568815581r:66112136_66329379|GENSCAN_predicted_peptide_2|282_aa MAQRKEQNKSPETNPKEMKDDTTRRSSSDASTLILDFSASRTHENFVKHTQVSIAKLIKR KKGYQRLKINLMKLSVKTRLEKKNEKEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLEN TLQDIIQANFPNLARQTNIQIQEIQRALQRHSSRRATPRHIIVRFTKVAMKEKMLRAARE KGHVTHKGKPIRLTSDLLAETLQARREWGPIFNILKEKNFQSRISYPAKLSFISEGEIKS FTDEQMLRDFVTTRPALQELLKETLNMETKNQYQPLQKHTKL >gi568815581r:66112136_66329379|GENSCAN_predicted_CDS_2|849_bp atggcccaaagaaaggaacaaaataaatctccagaaaccaaccctaaagaaatgaaggat gatacaacaagaaggtcctcatcagatgccagcaccttgattctggatttctcagcctcc agaactcacgagaacttcgtgaagcatacgcaagtatcaatagccaaattgatcaagcgg aagaaaggatatcagagattaaagatcaacttaatgaaattaagtgtgaagacaagatta gagaaaaagaatgaaaaggaacaaagtctccaagaaatatgggactatgtgaaaagacca aatttacgtctgattggtgtacctgaaagtgacggggagaatggaaccaagttggaaaac actcttcaggatattatccaggcaaacttccccaacctggcaagacaaaccaatattcaa attcaggaaatacagagagcactacaaagacactcctcaagaagagcaaccccaagacac ataattgtcagattcaccaaggttgcaatgaaggaaaaaatgttaagggcagccagagag aaaggtcacgttacccacaaagggaagcccatcagactaacatcggatctcttggcagaa accctacaagccagaagagagtgggggccaatattcaacattcttaaagaaaagaatttt caatccagaatttcatatccagccaaactaagcttcataagtgaaggagaaataaaatcc tttacagacgagcaaatgttgagagattttgtcaccaccaggcctgccttacaagagctc ctgaaggaaacactaaatatggaaacgaaaaaccagtaccagccactgcaaaaacatacc aaattataa >gi568815581r:66112136_66329379|GENSCAN_predicted_peptide_3|190_aa MDKFLDTYTLPRLNQEEVKTLNRPITSSEIEAVINSLPTKKSPGPDGFTAEFYHRCKEEL LPFLLKLFQTIEKEGLLPNSFYEARVILIPKPGRDTTKKENFRPISLMNIDAKIVNKILA NRIQQHIKKFIHHNQVGFITGMQGWFNICKSINVIHHINRTNDKNHMIISIDAEKAFNKN STPLHAKNSQ >gi568815581r:66112136_66329379|GENSCAN_predicted_CDS_3|573_bp atggataaattcctggacacatacaccctcccaagactaaaccaggaagaagtcaaaacc ctgaatagaccaataacaagttctgaaattgaggcagtaattaatagcctaccaaccaaa aaaagcccaggtccagatggattcacagccgaattctaccataggtgcaaagaggagctg ttaccattccttctgaaactattccaaacaatagaaaaagagggactcctccctaactca ttttacgaggccagagttatcctgataccaaaacctggcagagacacaaccaaaaaagaa aatttcagaccaatatccctgatgaacatcgatgcaaaaatcgtcaataaaatactggca aaccgaatccaacagcacatcaaaaagtttatccaccacaatcaagttggcttcatcact gggatgcaaggctggttcaacatatgcaaatcaataaatgtaatccatcacataaataga accaatgacaaaaaccacatgattatctcaatagatgcagaaaaggccttcaataaaaat tcaacaccccttcatgctaaaaactctcaataa >gi568815581r:66112136_66329379|GENSCAN_predicted_peptide_4|117_aa MQVSTEERSPMWQSQFLCTSTQALVPIPTPHPHRDREQATSSPSGQSGFAMAKQKIRIWS EVQAQLNWVFCLGPQKAAIMIGPDLGSHLEAQLGKDSLPDLCGCQQHSVPYGYLIHV >gi568815581r:66112136_66329379|GENSCAN_predicted_CDS_4|354_bp atgcaagtaagcacagaagaaaggtctcccatgtggcagagccagtttctatgcaccagc acccaagccctagtccccattcctaccccacatccccaccgagacagagaacaggccacc agctctccatctggtcagtcaggttttgcaatggccaagcagaagattcggatctggtca gaagtccaggcacagctgaactgggtcttctgtttagggccacaaaaggctgcaatcatg attggaccagacctgggttctcatctagaggctcaactagggaaggattcacttcccgac ttgtgtggatgtcagcagcattccgttccttatggctatctgattcatgtatga >gi568815581r:66112136_66329379|GENSCAN_predicted_peptide_5|410_aa MATFPAAATTGAPGWGRRLPRGHFSTHREEEPGPDGAPGKSLPLPVFAGSREPGLGEVCL DARAGSTEGGGPSPVLLSVVDHFNRIGKVGIQKCVIGLFWGSWQKKVLDVLNSFAVPFDE DDKDNSMWFLDHDYLENMYGMFKKVNARERIVGWYHTGLKIHKNGITINELMKRYYPNSV LIIIDVNPKDLGLPTEAYISVEEIQDDGTPTLKTFEHVTSEIGAESYLEKVATGKLRINH QIIYELQDVFNLLPDVSLQEPIKAFYLRTNDQMVVVYLASLIHSVGLTDPFCGRPAQPHQ QQDCQPGWRQERRAGEKREQKDRKADKGKDKDKEKSDVKKEQKNHCLKEKYRFRQDSLPD DPRVLGLELTPPVARQYSPQDLSEIPRHADFRCDPAHFQLWWSQGETPST >gi568815581r:66112136_66329379|GENSCAN_predicted_CDS_5|1233_bp atggcgacgttcccggcggcggcgaccaccggcgccccaggctgggggcggagacttccg cgcgggcatttctccacccacagggaagaagagcctgggcctgatggagccccagggaaa tcactaccgctgccggtgtttgcaggtagcagggagccaggcctgggcgaggtgtgtctc gatgccagagctggtagtacagaaggtggtgggccatccccggtgctgctcagcgtggtg gatcatttcaaccgaatcggcaaggttggaatccagaaatgcgtcattggtctgttttgg gggtcatggcaaaagaaagtacttgatgtattgaacagttttgcagttccttttgatgaa gatgacaaagacaattctatgtggtttttagaccatgattatttggaaaacatgtatgga atgtttaagaaagtcaatgccagagaaagaatagttggctggtaccacacaggcctgaaa atacacaagaatggcattaccatcaatgaactcatgaaaagatactatcctaactcggta ttgatcattattgacgtgaatccaaaggacctagggctgcccacagaagcatacatttca gtggaagaaatccaagatgatggaactccaaccttgaaaacatttgaacatgtgaccagt gaaattggagcagagagctacctggaaaaagtagccacaggcaagctgcgcatcaaccac cagatcatctacgagctgcaggacgtcttcaacctgctgccagatgttagcctgcaggag cctattaaagccttttacctgaggaccaatgaccagatggtggtagtgtacttggcctca ctgatccattctgtgggcctcactgatccattctgtggtcgccctgcacaacctcatcaa caacaagattgccaaccaggatggagacaagaaagaagggcaggagaaaaaagagagcaa aaagatagaaaagctgacaaggggaaagataaagataaggaaaagagtgatgtaaagaaa gagcagaaaaaccactgtcttaaagagaagtaccggttcaggcaggattcattaccggat gatccaagagtcctcggacttgaattaacaccaccagttgccaggcagtactcaccacaa gacttgagtgagatcccgcgccatgccgacttcaggtgtgacccagcacatttccagctg tggtggtcacagggagagactccttctacttga >gi568815581r:66112136_66329379|GENSCAN_predicted_peptide_6|141_aa MKVFVTPPEEPLRPTEEPAKDEENTKWMVVEEEDEYQLQFEGQLQHQSQCSDYVWAREDI LNAYCPPSMSRAVRLSKGNLELLNESLEGRGATLVSVKGGGAEKHLGEKIWSTSHGIGLS SEKEAWVGKSYGECRLWRSDG >gi568815581r:66112136_66329379|GENSCAN_predicted_CDS_6|426_bp atgaaggtctttgtcacaccaccagaagagccactgagaccaacagaagagccagccaag gatgaggagaatacgaaatggatggtggtggaggaagaagatgagtatcagttgcagttc gaaggccaactgcagcatcagagccaatgcagtgactatgtgtgggcaagagaagacatt ctaaatgcttactgccctccttccatgtctagagctgttaggttaagtaaagggaatttg gagcttttgaatgaaagcctagaagggaggggagccactctggtttctgtaaaaggagga ggagcagagaagcatcttggagaaaagatctggtccacatcacacggaatcgggttaagt tctgagaaagaggcatgggtggggaaatcctatggagaatgcagattgtggagaagtgat gggtag >gi568815581r:66112136_66329379|GENSCAN_predicted_peptide_7|345_aa MISPVLILFSSFLCHVAIAGRTCPKPDDLPFSTVVPLKTFYEPGEEITYSCKPGYVSRGG MRKFICPLTGLWPINTLKCTPRVCPFAGILENGAVRYTTFEYPNTISFSCNTGFYLNGAD SAKCTEEGKWSPELPVCAPIICPPPSIPTFATLRVYKPSAGNNSLYRDTAVFECLPQHAM FGNDTITCTTHGNWTKLPECREVKCPFPSRPDNGFVNYPAKPTLYYKDKATFGCHDGYSL DGPEEIECTKLGNWSAMPSCKASCKVPVKKATVVYQGERVKIQEKFKNGMLHGDKVSFFC KNKEKKCSYTEDAQCIDGTIEVPKCFKEHSSLAFWKTDASDVKPC >gi568815581r:66112136_66329379|GENSCAN_predicted_CDS_7|1038_bp atgatttctccagtgctcatcttgttctcgagttttctctgccatgttgctattgcagga cggacctgtcccaagccagatgatttaccattttccacagtggtcccgttaaaaacattc tatgagccaggagaagagattacgtattcctgcaagccgggctatgtgtcccgaggaggg atgagaaagtttatctgccctctcacaggactgtggcccatcaacactctgaaatgtaca cccagagtatgtccttttgctggaatcttagaaaatggagccgtacgctatacgactttt gaatatcccaacacgatcagtttttcttgtaacactgggttttatctgaatggcgctgat tctgccaagtgcactgaggaaggaaaatggagcccggagcttcctgtctgtgctcccatc atctgccctccaccatccatacctacgtttgcaacacttcgtgtttataagccatcagct ggaaacaattccctctatcgggacacagcagtttttgaatgtttgccacaacatgcgatg tttggaaatgatacaattacctgcacgacacatggaaattggactaaattaccagaatgc agggaagtaaaatgcccattcccatcaagaccagacaatggatttgtgaactatcctgca aaaccaacactttattacaaggataaagccacatttggctgccatgatggatattctctg gatggcccggaagaaatagaatgtaccaaactgggaaactggtctgccatgccaagttgt aaagcatcttgtaaagtacctgtgaaaaaagccactgtggtgtaccaaggagagagagta aagattcaggaaaaatttaagaatggaatgctacatggtgataaagtttctttcttctgc aaaaataaggaaaagaagtgtagctatacagaggatgctcagtgtatagatggcactatc gaagtccccaaatgcttcaaggaacacagttctctggctttttggaaaactgatgcatcc gatgtaaagccatgctaa >gi568815581r:66112136_66329379|GENSCAN_predicted_peptide_8|232_aa MTLKEHAAFKHLFNKAHLAPPLIHLTLSGHSTCFREHRVGVNEENLYGAIIVTPRAAKPD IQRPCLGRDQSLSVASWNPRINLRDAGIMDEGLSSAPAPLFSHPSTNQARPCLASKIRQT ENVQGDMAIDSWATFLRPQEPRDFLTKWRRIFSWRWRFAMLPRLYLYFPKRILLAEVNFQ LASVAREMALSNDITSLIASLISQLIAISGALTVSGTGLRAGDAASKDRVSP >gi568815581r:66112136_66329379|GENSCAN_predicted_CDS_8|699_bp atgactcttaaggagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggacacagcacatgtttcagagagcaccgggttggg gtcaatgaagagaacctctatggggctatcattgtcacccccagagcagcaaagcctgac atacagagaccatgtcttggcagggaccagagcttatccgtagcatcctggaacccaagg atcaacttgagagatgctgggatcatggacgaaggccttagctcagcccctgccccactt ttctcccatccaagtactaaccaggcccgaccctgcctagcttccaagatcagacagacg gagaatgttcagggtgacatggccatagacagctgggccacttttctaaggccacaggag cccagggactttctgaccaagtggaggaggattttttcgtggagatggaggtttgccatg ttgcccaggctgtatctgtattttccaaagagaattttattagcggaagtgaacttccag ttggcaagtgttgccagggaaatggctctgagcaacgacattacttccctcattgcttcc cttatcagtcagttaatagctatttctggagcactgactgtgtcaggcactggtctacgt gctggggatgcagccagtaaagacagagtcagcccttga >gi568815581r:66112136_66329379|GENSCAN_predicted_peptide_9|162_aa MPRARGREDERRGRAGRARAGVRRGCGGGGAERTGPSAEEAASALRCWEHWARGGGGGGG GGGGGGGGGRGRAGPSGDAGEEAGSGGVTRGDASPMDQTGLTANLQAMVAASSVGSQDIG VKTVPVRGPHLDPVLQAGGPLEKGLPSAPKGKENARCHHGCN >gi568815581r:66112136_66329379|GENSCAN_predicted_CDS_9|489_bp atgccccgagcgcgaggccgcgaggacgagcggcgggggcgcgcgggccgggcgcgcgcg ggggtgcggcgagggtgcggcggcggcggcgctgagcggaccggaccgagtgcggaggag gcagcgagtgccttgcggtgctgggaacactgggcaaggggaggcggcggcggcggcggc ggcggcggcggcggcggcggcggcggccgggggagggccggcccgagtggggatgcgggg gaggaggccgggagtggcggggtcacccggggggatgcttcgcccatggaccaaacaggc ctaacagcaaacctgcaggcaatggttgctgcttcaagtgtgggaagccaggacattgga gtgaagactgtcccagtcaggggaccccacctagaccctgtcctgcaagcaggagggcca ttggaaaagggattaccttcagctccaaagggaaaggagaatgcccgatgccatcatggc tgtaactag