GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:46:13 Sequence gi568815581f:68415400_68630446 : 215047 bp : 43.33% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1570 1609 40 0 1 124 93 -13 0.146 0.60 1.02 Term + 4790 5064 275 0 2 90 42 227 0.996 13.93 1.03 PlyA + 5382 5387 6 1.05 2.10 PlyA - 5901 5896 6 1.05 2.09 Term - 10776 10655 122 1 2 115 36 136 0.822 9.74 2.08 Intr - 11790 11736 55 0 1 59 94 28 0.517 -0.95 2.07 Intr - 13537 13430 108 1 0 86 72 35 0.661 2.18 2.06 Intr - 14761 14597 165 1 0 50 73 97 0.775 4.56 2.05 Intr - 18176 18065 112 1 1 119 17 92 0.451 5.58 2.04 Intr - 19227 19157 71 0 2 108 66 107 0.904 8.48 2.03 Intr - 20313 20221 93 0 0 70 87 94 0.991 7.66 2.02 Intr - 21080 20983 98 0 2 116 119 42 0.987 9.83 2.01 Init - 27623 27614 10 2 1 48 100 5 0.552 -1.52 2.00 Prom - 28095 28056 40 -0.36 3.05 PlyA - 29411 29406 6 1.05 3.04 Term - 33083 32985 99 2 0 54 53 106 0.901 1.83 3.03 Intr - 35498 35329 170 0 2 115 108 180 0.999 22.37 3.02 Intr - 37593 37511 83 2 2 90 92 64 0.991 6.28 3.01 Init - 42022 41943 80 1 2 74 87 78 0.947 5.49 3.00 Prom - 43556 43517 40 -5.06 4.00 Prom + 45475 45514 40 -7.36 4.01 Init + 46372 46421 50 0 2 52 37 76 0.610 -0.68 4.02 Intr + 47172 47222 51 0 0 72 91 45 0.627 1.22 4.03 Intr + 48614 48717 104 2 2 88 67 102 0.911 7.92 4.04 Term + 50762 50847 86 0 2 94 44 41 0.266 -1.88 4.05 PlyA + 52777 52782 6 1.05 5.00 Prom + 64527 64566 40 -1.36 5.01 Init + 76444 76518 75 0 0 62 115 44 0.701 5.59 5.02 Intr + 97310 97579 270 1 0 90 25 93 0.004 0.94 5.03 Intr + 99995 100177 183 1 0 77 93 147 0.009 14.08 5.04 Intr + 107357 107527 171 1 0 63 63 153 0.284 10.44 5.05 Intr + 108326 108417 92 1 2 51 97 86 0.965 4.59 5.06 Intr + 108617 108678 62 2 2 110 116 36 0.999 6.98 5.07 Intr + 109513 109559 47 2 2 108 58 78 0.998 4.93 5.08 Intr + 110355 110513 159 2 0 71 95 74 0.988 6.58 5.09 Intr + 112441 112501 61 0 1 74 95 36 0.983 1.21 5.10 Intr + 113471 113592 122 0 2 75 88 147 0.995 13.61 5.11 Intr + 114521 114602 82 1 1 49 76 54 0.995 -0.49 5.12 Term + 114878 115050 173 0 2 99 44 150 0.985 9.69 5.13 PlyA + 115243 115248 6 1.05 6.12 PlyA - 115678 115673 6 1.05 6.11 Term - 122342 122078 265 1 1 86 49 145 0.875 5.38 6.10 Intr - 123997 123938 60 0 0 111 103 11 0.870 2.75 6.09 Intr - 124567 124486 82 2 1 110 84 134 0.999 13.90 6.08 Intr - 125559 125450 110 2 2 116 82 258 0.999 27.93 6.07 Intr - 126766 126586 181 2 1 109 64 200 0.999 18.63 6.06 Intr - 127410 127295 116 2 2 70 80 76 0.989 5.09 6.05 Intr - 128322 128230 93 2 0 88 91 115 0.999 10.88 6.04 Intr - 136552 136474 79 2 1 89 107 91 0.999 9.71 6.03 Intr - 139428 139378 51 1 0 110 89 31 0.950 4.28 6.02 Intr - 140344 140160 185 0 2 92 95 131 0.681 13.63 6.01 Init - 185267 184864 404 2 2 91 94 673 0.268 62.30 6.00 Prom - 203259 203220 40 -1.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100177 177 1 0 99 93 132 0.823 14.06 S.002 Term + 165424 165566 143 2 2 94 42 87 0.843 2.79 S.003 Init + 173844 173904 61 2 1 81 80 60 0.862 5.91 S.004 Init - 204209 204109 101 2 2 97 80 69 0.878 6.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:68415400_68630446|GENSCAN_predicted_peptide_1|104_aa VNQGFFSWFGSIAGGARACDGSTGPELQHKFPLIFNLEDDTAEAVPLERGGAEYQAVLPE VRKVLADVLQDIANDNISSADYTQDPSVTPCCNPYQIACRCQAA >gi568815581f:68415400_68630446|GENSCAN_predicted_CDS_1|315_bp gttaatcagggatttttttcttggtttggatccattgctggtggagccagggcgtgtgat gggagcacggggcctgagctgcagcataagtttcctctgattttcaacctggaagacgat accgcagaagctgtgcccctagaaagaggtggtgcggagtaccaggctgtgctgcccgag gtcagaaaggttcttgcagacgtcctccaagacattgccaacgacaacatctccagcgca gattacactcaggacccttcagtaactccctgctgtaatccctaccaaattgcctgccgc tgtcaagccgcataa >gi568815581f:68415400_68630446|GENSCAN_predicted_peptide_2|277_aa MEDGLCALSINHSNSYLAYPGSLTSGEIVLYDGNSLKTVCTIAAHEGTLAAITFNASGSK LASASEKGTVIRVFSVPDGQKLYEFRRGMKRYVTISSLVFSMDSQFLCASSNTETVHIFK LEQVTNRQSTRRAFDLEWLHGKDVYGCYQLPPYPGVRHDASGQGFCHCTLELLRTEEHLY PLNDPEVATAASCVIQWTPLYVQFGSSGWRRVCLNQNPQSYAATVARPSASSASTVPGYS EDGGALRGEVIPEHEFATGPVCLDDENEFPPVSIRNP >gi568815581f:68415400_68630446|GENSCAN_predicted_CDS_2|834_bp atggaagacggtctatgtgctctctctatcaaccattccaattcttacctggcctatcct ggaagcctgacttcaggggagattgtgctttatgatggaaactccctgaaaacagtctgc actattgctgcccatgagggaacactagctgccatcaccttcaatgcctcaggctccaaa ctagcaagtgcgtctgaaaaaggcacagtcatccgggtgttctctgtccctgatgggcaa aagctctatgagttccggagagggatgaaaaggtatgtgacaatcagctctctagtgttc agtatggattcacaattcctctgcgcctccagtaacaccgagacggtacacatcttcaag ctggaacaggtcaccaacaggcaatcgaccagaagagccttcgacctggagtggctacat gggaaagatgtttatggctgctaccaactacctccctacccaggtgtcagacatgatgca tcaggacagggcttttgccactgcacgcttgaacttctccggacagaggaacatctgtac cctctcaacgatccagaagttgccacggctgctagttgcgtcatccagtggacaccttta tatgtacaatttggatcctcaggatggaggagagtgtgtcttaatcaaaacccacagtct tatgcagcgaccgtagccagaccaagtgcatcttcagcctccacggtgccaggttattct gaggacggcggggcgctgcgaggagaagttattcctgaacatgagtttgcgacgggacca gtgtgtcttgatgatgagaatgagtttcctcctgtgagcattaggaacccgtaa >gi568815581f:68415400_68630446|GENSCAN_predicted_peptide_3|143_aa MEAEAADAPPGGVESALSCFSFNQDCTSLATGTKAGYKLFSLSSVEQLDQVHGSNEIPDV YIVERLFSSSLVVVVSHTKPRQMNVYHFKKGTEICNYSYSSNILSIRLNRQNTDAPGTVS NSWQSHEPRSNSSRNQDLTSEPR >gi568815581f:68415400_68630446|GENSCAN_predicted_CDS_3|432_bp atggaggccgaggccgcggacgctcccccgggcggggttgagtcggcgctcagctgcttc tctttcaaccaggactgcacatccctagcaactggaactaaagccgggtataagctgttt tctctgagttctgtggagcagctggatcaagtccacggaagcaatgaaatcccggacgtc tacatcgtggagcgcctcttctccagcagcctggtggtggtagtcagtcacacaaaacca cggcagatgaacgtgtatcacttcaagaaaggcacagagatctgtaattacagctactcc agcaacatcttgtccataaggctgaaccggcaaaacactgatgcaccaggaactgtcagc aattcatggcaaagccacgaaccaagatctaacagttctcggaaccaagatctaacgtcg gaaccaagatga >gi568815581f:68415400_68630446|GENSCAN_predicted_peptide_4|96_aa MCQYVEEAQGVSPMVPGNVYPHQAAAYAKVETVENTHYFMWEPEKMEISSDEQTGMFPTV INKASTRPEHTAPSKAFPINYWESPEMPRTCGDLPF >gi568815581f:68415400_68630446|GENSCAN_predicted_CDS_4|291_bp atgtgtcagtacgtggaggaagctcaaggagtcagccctatggtgccagggaatgtttat ccccatcaggcagctgcatatgccaaagtggagactgtcgagaacacccactacttcatg tgggaacctgagaaaatggaaatcagctcagatgagcagactggaatgttcccaacagtc ataaacaaggccagcacaaggccagaacatacagctccctccaaagcctttcctattaat tactgggagtccccggaaatgcctcgcacgtgcggggacctgcccttttag >gi568815581f:68415400_68630446|GENSCAN_predicted_peptide_5|498_aa MEVDKLLQKPPENLVLWHNRNKKHMASRPRLRVCILKAVWALALVRGSPAAPLALAAPTP RLLGPWALRGPPRPPSPPRAAASSLTLPHFLKDNGSILLGSAIVSSAGPGSSCKPRTMES GSTAASEEARSLRECELYVQKHNIQALLKDSIVQLCTARPERPMAFLREYFERLEKEEAK QIQNLQKAGTRTDSREDEISPPPPNPVVKGRRRRGAISAEVYTEEDAASYVRKVIPKDYK TMAALAKAIEKNVLFSHLDDNERSDIFDAMFSVSFIAGETVIQQGDEGDNFYVIDQGETD VYVNNEWATSVGEGGSFGELALIYGTPRAATVKAKTNVKLWGIDRDSYRRILMGSTLRKR KMYEEFLSKVSILESLDKWERLTVADALEPVQFEDGQKIVVQGEPGDEFFIILEGSAAVL QRRSENEEFVEVGRLGPSDYFGEIALLMNRPRAATVVARGPLKCVKLDRPRFERVLGPCS DILKRNIQQYNSFVSLSV >gi568815581f:68415400_68630446|GENSCAN_predicted_CDS_5|1497_bp atggaggtggacaaactgcttcaaaagccaccagaaaacttggtcctgtggcataacagg aacaaaaagcatatggcctcacgcccccggctccgagtctgcatcctcaaggccgtctgg gcgttggctttggtgcggggctccccagccgccccgctcgcgctcgctgcccccacccca cggctcctggggccctgggccctccgcggccctccccggccgccttcgcccccgcgcgcc gcggcctcttcgctcaccctcccccatttcctgaaggacaacggctctattttactgggg tccgccatcgtctcgtccgccggtcctggctcttcttgtaagccgagaaccatggagtct ggcagtaccgccgccagtgaggaggcacgcagccttcgagaatgtgagctctacgtccag aagcataacattcaagcgctgctcaaagattctattgtgcagttgtgcactgctcgacct gagagacccatggcattcctcagggaatactttgagaggttggagaaggaggaggcaaaa cagattcagaatctgcagaaagcaggcactcgtacagactcaagggaggatgagatttct cctcctccacccaacccagtggttaaaggtaggaggcgacgaggtgctatcagcgctgag gtctacacggaggaagatgcggcatcctatgttagaaaggttataccaaaagattacaag acaatggccgctttagccaaagccattgaaaagaatgtgctgttttcacatcttgatgat aatgagagaagtgatatttttgatgccatgttttcggtctcctttatcgcaggagagact gtgattcagcaaggtgatgaaggggataacttctatgtgattgatcaaggagagacggat gtctatgttaacaatgaatgggcaaccagtgttggggaaggagggagctttggagaactt gctttgatttatggaacaccgagagcagccactgtcaaagcaaagacaaatgtgaaattg tggggcatcgaccgagacagctatagaagaatcctcatgggaagcacactgagaaagcgg aagatgtatgaggaattccttagtaaagtctctattttagagtctctggacaagtgggaa cgtcttacggtagctgatgcattggaaccagtgcagtttgaagatgggcagaagattgtg gtgcagggagaaccaggggatgagttcttcattattttagaggggtcagctgctgtgcta caacgtcggtcagaaaatgaagagtttgttgaagtgggaagattggggccttctgattat tttggtgaaattgcactactgatgaatcgtcctcgtgctgccacagttgttgctcgtggc cccttgaagtgcgttaagctggaccgacctagatttgaacgtgttcttggcccatgctca gacatcctcaaacgaaacatccagcagtacaacagttttgtgtcactgtctgtctga >gi568815581f:68415400_68630446|GENSCAN_predicted_peptide_6|541_aa MPGLRRDRLLTLLLLGALLSADLYFHLWPQVQRQLRPRERPRGCPCTGRASSLARDSAAA ASDPGTIVHNFSRTEPRTEPAGGSHSGSSSKLQALFAHPLYNVPEEPPLLGAEDSLLASQ EALRYYRRKVARWNRRHKMYREQMNLTSLDPPLQLRLEASWVQFHLGINRHGLYSRSSPV VSKLLQDMRHFPTISADYSQDEKALLGACDCTQIVKPSGVHLKLVLRFSDFGKAMFKPMR QQRDEETPVDFFYFIDFQRHNAEIAAFHLDRILDFRRVPPTVGRIVNVTKEILEVTKNEI LQSVFFVSPASNVCFFAKCPYMCKTEYAVCGNPHLLEGSLSAFLPSLNLAPRLSVPNPWI RSYTLAGKEEWEVNPLYCDTVKQIYPYNNSQRLLNVIDMAIFDFLIGNMDRHHYEMFTKF GDDGFLIHLDNARGFGRHSHDEISILSPLSQCCMIKKKTLLHLQLLAQADYRLSDVMRES LLEDQLSPVLTEPHLLALDRRLQTILRTVEGCIVAHGQQSVIVDGPVEQLAPDSGQANLT S >gi568815581f:68415400_68630446|GENSCAN_predicted_CDS_6|1626_bp atgccggggctgcgccgggaccgcctactgactctgctgctgctgggcgcgctgctctcc gccgacctctacttccacctctggccccaagtacagcgccagctgcggcctcgggagcgc ccgcgggggtgcccgtgcaccggccgcgcctcctccctggcgcgggactcggccgcagct gcctcggaccccggcacgatcgtgcacaacttttcccgaaccgagccccggactgaaccg gctggcggcagccacagcgggtcgagctccaagttgcaggccctcttcgcccacccgctg tacaacgtcccggaggagccgcctctcctgggagccgaggactcgctcctggccagccag gaggcgctgcggtattaccggaggaaggtggcccgctggaacaggcgacacaagatgtac agagagcagatgaaccttacctccctggaccccccactgcagctccgactcgaggccagc tgggtccagttccacctgggtattaaccgccatgggctctactcccggtccagccctgtt gtcagcaaacttctgcaagacatgaggcactttcccaccatcagtgctgattacagtcaa gatgagaaagccttgctgggggcatgtgactgcacccagattgtgaaacccagtggggtc cacctcaagctggtgctgaggttctcggatttcgggaaggccatgttcaaacccatgaga cagcagcgagatgaggagacaccagtggacttcttctacttcattgactttcagagacac aatgctgagatcgcagctttccatctggacaggattctggacttccgacgggtgccgcca acagtggggaggatagtaaatgtcaccaaggaaatcctagaggtcaccaagaatgaaatc ctgcagagtgttttctttgtctctccagcgagcaacgtgtgcttcttcgccaagtgtcca tacatgtgcaagacggagtatgctgtctgtggcaacccacacctgctggagggttccctc tctgccttcctgccgtccctcaacctggcccccaggctgtctgtgcccaacccctggatc cgctcctacacactggcaggaaaagaggagtgggaggtcaatcccctttactgtgacaca gtgaaacagatctacccgtacaacaacagccagcggctcctcaatgtcatcgacatggcc atcttcgacttcttgatagggaatatggaccggcaccattatgagatgttcaccaagttc ggggatgatgggttccttattcaccttgacaacgccagagggttcggacgacactcccat gatgaaatctccatcctctcgcctctctcccagtgctgcatgataaaaaagaaaacactt ttgcacctgcagctgctggcccaagctgactacagactcagcgatgtgatgcgagaatca ctgctggaagaccagctcagccctgtcctcactgaaccccacctccttgccctggatcga aggctccaaaccatcctaaggacagtggaggggtgcatagtggcccatggacagcagagt gtcatagtcgacggcccagtggaacagttggccccagactctggccaggctaacttgaca agctaa