GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:08:38 Sequence gi568815597f:23861458_24061874 : 200417 bp : 45.63% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 1814 1677 138 1 0 97 110 102 0.943 12.88 1.02 Intr - 4168 4034 135 0 0 113 105 48 0.972 8.68 1.01 Init - 6829 6441 389 1 2 79 89 687 0.852 62.18 1.00 Prom - 9966 9927 40 -4.36 2.02 PlyA - 10936 10931 6 1.05 2.01 Sngl - 14160 13078 1083 0 0 70 45 876 0.997 78.58 2.00 Prom - 27934 27895 40 -2.96 3.04 PlyA - 30263 30258 6 1.05 3.03 Term - 40608 40014 595 2 1 25 51 635 0.985 47.50 3.02 Intr - 40995 40742 254 0 2 -3 -23 429 0.719 18.93 3.01 Init - 41292 41014 279 0 0 70 5 460 0.850 32.91 3.00 Prom - 43037 42998 40 -2.86 4.08 PlyA - 46846 46841 6 1.05 4.07 Term - 50692 50529 164 2 2 94 55 87 0.489 4.10 4.06 Intr - 88045 87969 77 0 2 58 98 63 0.269 3.46 4.05 Intr - 98473 98235 239 1 2 127 59 221 0.954 19.51 4.04 Intr - 106360 106162 199 0 1 -12 59 53 0.079 -8.25 4.03 Intr - 110555 110393 163 0 1 98 106 67 0.788 8.53 4.02 Intr - 113656 113517 140 0 2 32 89 135 0.716 8.11 4.01 Init - 118798 118734 65 1 2 99 96 119 0.999 14.42 4.00 Prom - 126386 126347 40 -5.06 5.03 PlyA - 126725 126720 6 1.05 5.02 Term - 139750 139745 6 1 0 100 48 0 0.175 -4.93 5.01 Init - 147062 146826 237 2 0 110 35 318 0.951 26.71 5.00 Prom - 184208 184169 40 -3.66 6.05 PlyA - 184381 184376 6 1.05 6.04 Term - 196170 195907 264 0 0 79 48 466 0.641 37.21 6.03 Intr - 197522 197467 56 0 2 84 79 66 0.869 4.00 6.02 Intr - 199625 199498 128 1 2 104 9 103 0.491 4.32 6.01 Intr - 199852 199816 37 2 1 107 100 38 0.886 4.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:23861458_24061874|GENSCAN_predicted_peptide_1|221_aa MRAPGMRSRPAGPALLLLLLFLGAAESVRRAQPPRRYTPDWPSLDSRPLPAWFDEAKFGV FIHWGVFSVPAWGSEWFWWHWQGEGRPQYQRFMRDNYPPGFSYADFGPQFTARFFHPEEW ADLFQAAGAKYVVLTTKHHEGFTNWPSPVSWNWNSKDVGPHRDLVGELGTALRKRNIRYG LYHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDLVNS >gi568815597f:23861458_24061874|GENSCAN_predicted_CDS_1|663_bp atgcgggctccggggatgaggtcgcggccggcgggtcccgcgctgttgctgctgctgctc ttcctcggagcggccgagtcggtgcgtcgggcccagcctccgcgccgctacaccccagac tggccgagcctggattctcggccgctgccggcctggttcgacgaagccaagttcggggtg ttcatccactggggcgtgttctcggtgcccgcctggggcagcgagtggttctggtggcac tggcagggcgaggggcggccgcagtaccagcgcttcatgcgcgacaactacccgcccggc ttcagctacgccgacttcggaccgcagttcactgcgcgcttcttccacccggaggagtgg gccgacctcttccaggccgcgggcgccaagtatgtagttttgacgacaaagcatcacgaa ggcttcacaaactggccgagtcctgtgtcttggaactggaactccaaagacgtggggcct catcgggatttggttggtgaattgggaacagctctccggaagaggaacatccgctatgga ctataccactcactcttagagtggttccatccactctatctacttgataagaaaaatggc ttcaaaacacagcattttgtcagtgcaaaaacaatgccagagctgtacgaccttgttaac agn >gi568815597f:23861458_24061874|GENSCAN_predicted_peptide_2|360_aa MEECWVTEIANGSKDGLDSNPMKDYMILSGPQKTAVAVLCTLLGLLSALENVAVLYLILS SHQLRRKPSYLFIGSLAGADFLASVVFACSFVNFHVFHGVDSKAVFLLKIGSVTMTFTAS VGSLLLTAIDRYLCLRYPPSYKALLTRGRALVTLGIMWVLSALVSYLPLMGWTCCPRPCS ELFPLIPNDYLLSWLLFIAFLFSGIIYTYGHVLWKAHQHVASLSGHQDRQVPGMARMRLD VRLAKTLGLVLAVLLICWFPVLALMAHSLATTLSDQVKKAFAFCSMLCLINSMVNPVIYA LRSGEIRSSAHHCLAHWKKCVRGLGSEAKEEAPRSSVTETEADGKITPWPDSRDLDLSDC >gi568815597f:23861458_24061874|GENSCAN_predicted_CDS_2|1083_bp atggaggaatgctgggtgacagagatagccaatggctccaaggatggcttggattccaac cctatgaaggattacatgatcctgagtggtccccagaagacagctgttgctgtgttgtgc actcttctgggcctgctaagtgccctggagaacgtggctgtgctctatctgatcctgtcc tcccaccaactccgccggaagccctcatacctgttcattggcagcttggctggggctgac ttcctggccagtgtggtctttgcatgcagctttgtgaatttccatgttttccatggtgtg gattccaaggctgtcttcctgctgaagattggcagcgtgactatgaccttcacagcctct gtgggtagcctcctgctgaccgccattgaccgatacctctgcctgcgctatccaccttcc tacaaagctctgctcacccgtggaagggcactggtgaccctgggcatcatgtgggtcctc tcagcactagtctcctacctgcccctcatgggatggacttgctgtcccaggccctgctct gagcttttcccactgatccccaatgactacctgctgagctggctcctgttcatcgccttc ctcttttccggaatcatctacacctatgggcatgttctctggaaggcccatcagcatgtg gccagcttgtctggccaccaggacaggcaggtgccaggaatggcccgaatgaggctggat gtgaggttggccaagaccctagggctagtgttggctgtgctcctcatctgttggttccca gtgctggccctcatggcccacagcctggccactacgctcagtgaccaggtcaagaaggcc tttgctttctgctccatgctgtgcctcatcaactccatggtcaaccctgtcatctatgct ctacggagtggagagatccgctcctctgcccatcactgcctggctcactggaagaagtgt gtgaggggccttgggtcagaggcaaaagaagaagccccgagatcctcagtcaccgagaca gaggctgatgggaaaatcactccgtggccagattccagagatctagacctctctgattgc tga >gi568815597f:23861458_24061874|GENSCAN_predicted_peptide_3|375_aa MARSPTLRERNALMFNNELVADVHFVVGPPGATRTVPAHKYVLAVCSSVFYAMFYWDLAE VKSEIHIPDVEPAAFLILLKYVYSDEIDLEADTAAKKYIVPALAKACVNFLGTSLKAKNA CVLLSQSRLFEEPDLTQRCWEVIDAQAEMALRSQGFCEMDRQTLEIIVTREALNTRRRGV CQRRCPVRHPDILTLEETHTIFLWYTATNKPRLDFPLIKRKGLTPQRCHLFQSSAYRSNQ CQYCGRCDSIQFAVDRRVFIAGLGLCGSSSGKAEYSVKIELKRLGVVLAQNLTKFMSDRS SNTFPVWFEHPVQVEQDTFYMASAVLDGSNLSYFGQEGMTQVQCRKVAFQFQCSSDSTNG TGVQGGQIPELIFYA >gi568815597f:23861458_24061874|GENSCAN_predicted_CDS_3|1128_bp atggcgcggtcccccacgctgcgcgagaggaacgcgctcatgttcaacaacgagctcgtg gccgacgtgcactttgtcgtgggacccccgggggcgaccaggacggtgcccgcccacaag tacgtcctggctgtctgcagctccgtcttctatgccatgttctactgggacctggcggaa gtcaaatctgaaattcacattccagacgtggagcccgcagcctttctgatcctattaaag tacgtgtacagtgatgagatcgatctggaagcggatacggctgctaagaagtacatcgtc ccagcattggcaaaagcctgcgtcaactttctggggacaagtctgaaagccaagaacgcc tgcgtcctgctgtcccagagccggctgtttgaggagcccgacctgacccagcgctgctgg gaggtcatcgacgcacaggccgagatggccctgcggtcccaaggcttctgtgagatggac cggcagacgttggagatcattgtcactcgggaggccctcaacaccaggaggcgaggagtt tgccaacggcgctgcccagtcagacatccagacatcctgactctggaggagacccacacc atcttcctgtggtacacggccaccaacaagccccgcctggacttccccctgatcaagagg aagggcctcaccccgcagaggtgccacctattccagtcttctgcctaccgcagcaaccaa tgccagtactgcgggcgctgcgacagcatccagtttgcagtggacagaagggtatttatt gcggggctaggcctgtgtgggtccagctctgggaaggctgagtacagcgtgaagattgag ctcaagaggcttggggtagttctggctcagaatctgaccaaattcatgtcggacagatcc agtaacaccttcccggtctggtttgaacacccggtccaggttgaacaagacaccttctac atggccagtgctgttctggacggcagcaacctcagctactttgggcaggaggggatgacg caagtgcagtgcagaaaggtggccttccagttccagtgctcctcggacagcaccaacggg actggggtccagggtgggcagatccctgagctcatcttctatgcctga >gi568815597f:23861458_24061874|GENSCAN_predicted_peptide_4|348_aa MSRYLRPPNTSLFVRNVADDTSLQNFPSPLTFSTFEDVRDAEDALHNLDRKWICGRQIEI QFAQGDRKTPNQMKAKEGRNVYSSSRYDDYDRYRRSRSRSYERRRSRSRSFDYNYRRSYS PRNKLLHPVQLTKSCIPQRRIKETKLTNILLLFKTKLQLEYPVQFCLLHFKKDLKAEKEP KKGSSSDQRGETRHGPTKPTNTDERAAGHPSFYNPTFSFCLYRQNGVGSDCRSRAGRLRG QSKHDSAGLANEKRAAACCWGWPGAKARSRRRSTVTLQGQRHLSLTELQPKEKGASSSEK HGPEKVTHRLVASQLSDAGTGRAAAQGSGMAVVESLIFTPSALAVDLR >gi568815597f:23861458_24061874|GENSCAN_predicted_CDS_4|1047_bp atgtcccgctacctgcgtccccccaacacgtctctgttcgtcaggaacgtggccgacgac accagtttgcaaaacttcccaagccccttaacatttagcacatttgaggatgttcgtgat gctgaagacgctttacataatttggacagaaagtggatttgtggacggcagattgaaata cagtttgcccagggggatcgaaagacaccaaatcagatgaaagccaaggaagggaggaat gtgtacagttcttcacgctatgatgattatgacagatacagacgttctagaagccgaagt tatgaaaggaggagatcaagaagtcggtcttttgattacaactatagaagatcgtatagt cctagaaataagctgctacatccagttcaactaacaaaatcctgtataccacagagaaga ataaaggaaaccaaactgacaaacatccttcttttatttaagaccaaactgcagctggaa tacccagtacagttctgcttactacacttcaagaaagatctgaaagcggaaaaagaacca aagaagggcagttcaagcgaccaaaggggagagacgaggcacggaccgacaaagcccacg aacaccgatgagcgagctgccggccaccccagcttctacaacccgaccttctccttctgc ctctaccgacaaaatggagtcggcagcgactgcagaagcagggccgggaggctccgcggc caatcaaaacacgacagcgcgggcctggccaatgagaagcgggccgccgcctgttgctgg ggctggccgggggcgaaggcgcggagccgccgccgctccactgtcactctccaaggccag cgccacctctcactcaccgagctccagccgaaggagaagggggcttcctcttctgagaaa catgggcctgagaaggtcactcacaggcttgttgcgagtcaactgtctgatgcgggcaca gggcgggcagcagcacagggctcgggaatggctgtggtggagagtttgatcttcacccca tctgctcttgctgtggatctgagatga >gi568815597f:23861458_24061874|GENSCAN_predicted_peptide_5|80_aa MALRYPMAMGLNKGHKVTKNVSNPRHSCHRRCLTKHTKFVQDMIREVCGFAPYECHTMEL QKVSKAMELLKTSKFIKKRI >gi568815597f:23861458_24061874|GENSCAN_predicted_CDS_5|243_bp atggctctgcgctaccctatggccatgggcctcaacaagggccacaaggtgaccaagaac gtgagcaatcccaggcacagctgccaccgcaggtgcctgaccaaacacaccaagtttgtg caggacatgatccgagaggtgtgtggctttgccccttacgagtgtcacaccatggagtta cagaaggtctccaaggccatggagttactgaagacctccaagtttatcaagaaaaggatt tga >gi568815597f:23861458_24061874|GENSCAN_predicted_peptide_6|161_aa XFEDAMAEHQRLKTLAIIEKSKSADISVVFVSEFGANFLEAQPSWWAAEGLIPQWDRAKV VRGLPDVATIMEDKTLCLTCIVSGDPTPEISWLKNDQPVTFLDRYRMEVRGTEVTITIEK VNSEDSGRYGVFVKNKYGSETGQVTISVFKHGDEPKELKSM >gi568815597f:23861458_24061874|GENSCAN_predicted_CDS_6|486_bp ncttttgaggatgcaatggctgaacaccagagactgaaaaccttggccatcatcgagaag agtaagtcggccgatatttctgttgtttttgtttctgaatttggggcaaacttcctggaa gctcaaccctcttggtgggctgctgaaggcctcattccacagtgggatcgtgccaaagtg gtgagaggtctgccggatgtggccactatcatggaagataagaccctgtgcttgacttgc atcgtctcaggagaccccacccctgaaatctcttggctgaagaatgaccagcctgtcacc ttccttgaccgataccgcatggaagtgagggggacagaggtcaccatcaccattgagaag gtcaacagtgaagacagcggccgctacggcgtcttcgtcaagaacaagtatggctccgag acgggccaggtcaccatcagtgtgttcaagcacggggacgagcccaaggagctgaagagc atgtga