GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:45:08 Sequence gi568815581f:19433940_19677550 : 243611 bp : 45.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1066 1061 6 1.05 1.02 Term - 12645 12031 615 0 0 40 46 851 0.987 70.56 1.01 Init - 12913 12707 207 1 0 56 -19 221 0.568 7.04 1.00 Prom - 32194 32155 40 -2.46 2.04 PlyA - 32959 32954 6 1.05 2.03 Term - 33468 33401 68 1 2 85 40 70 0.665 -0.00 2.02 Intr - 37167 36884 284 2 2 65 89 238 0.384 18.66 2.01 Init - 49844 49795 50 2 2 102 21 31 0.179 -1.78 2.00 Prom - 50866 50827 40 -0.56 3.00 Prom + 58468 58507 40 -0.46 3.01 Sngl + 63315 63590 276 2 0 29 32 256 0.305 9.48 3.02 PlyA + 63647 63652 6 1.05 4.04 PlyA - 64292 64287 6 1.05 4.03 Term - 66189 66128 62 1 2 98 36 42 0.306 -2.03 4.02 Intr - 70161 70123 39 1 0 100 94 23 0.298 2.30 4.01 Init - 73449 73365 85 0 1 53 33 163 0.436 6.28 4.00 Prom - 77588 77549 40 -5.56 5.00 Prom + 79682 79721 40 -3.96 5.01 Init + 91753 91758 6 0 0 82 89 4 0.139 0.64 5.02 Intr + 99989 100372 384 1 0 63 80 275 0.602 19.25 5.03 Intr + 108454 108555 102 0 0 104 88 144 0.984 16.37 5.04 Intr + 110523 110736 214 2 1 76 59 131 0.900 7.29 5.05 Intr + 112470 112564 95 1 2 77 99 36 0.885 3.28 5.06 Intr + 114046 114194 149 0 2 79 110 152 0.958 15.53 5.07 Intr + 121654 121751 98 1 2 59 80 19 0.218 -2.15 5.08 Intr + 121857 121970 114 1 0 79 14 86 0.118 0.72 5.09 Intr + 122056 122123 68 2 2 42 103 133 0.187 8.82 5.10 Intr + 126249 126357 109 2 1 98 53 85 0.994 5.86 5.11 Intr + 126479 126554 76 0 1 95 117 46 0.992 6.77 5.12 Intr + 132851 132920 70 2 1 108 87 47 0.996 5.78 5.13 Intr + 133157 133289 133 1 1 87 94 78 0.982 8.52 5.14 Intr + 143388 143575 188 1 2 103 44 135 0.013 10.01 5.15 Intr + 146066 146173 108 1 0 19 89 115 0.001 5.18 5.16 Intr + 146295 146396 102 2 0 92 76 90 0.001 8.57 5.17 Intr + 152652 152753 102 2 0 78 94 49 0.012 4.87 5.18 Intr + 154520 154667 148 1 1 35 94 132 0.776 8.31 5.19 Intr + 160271 160384 114 0 0 122 72 91 0.979 11.32 5.20 Intr + 161919 161986 68 1 2 58 95 119 0.924 8.22 5.21 Intr + 164159 164213 55 1 1 44 97 61 0.403 1.15 5.22 Intr + 168372 168571 200 1 2 70 44 101 0.903 2.97 5.23 Intr + 181849 181957 109 0 1 109 105 84 0.927 12.06 5.24 Intr + 187620 187744 125 1 2 49 36 70 0.448 -1.80 5.25 Intr + 191902 192034 133 0 1 84 77 61 0.839 4.82 5.26 Term + 193596 193690 95 1 2 85 49 69 0.677 0.69 5.27 PlyA + 198785 198790 6 1.05 6.00 Prom + 212206 212245 40 -1.06 6.01 Init + 214606 214767 162 0 0 85 18 200 0.779 10.44 6.02 Intr + 215024 215185 162 1 0 56 107 352 0.822 34.07 6.03 Intr + 217608 217839 232 2 1 47 41 183 0.224 6.85 6.04 Intr + 218608 218693 86 2 2 64 110 30 0.164 2.34 6.05 Intr + 222427 222635 209 0 2 -7 109 145 0.838 4.98 6.06 Intr + 223806 223923 118 0 1 73 76 28 0.962 0.57 6.07 Intr + 227188 227329 142 0 1 86 63 122 0.998 9.43 6.08 Intr + 229394 229560 167 0 2 92 121 85 0.993 11.88 6.09 Intr + 231009 231108 100 2 1 91 92 138 0.986 14.18 6.10 Term + 241619 241686 68 0 2 126 36 6 0.101 -2.70 6.11 PlyA + 241786 241791 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 146416 146193 224 2 2 89 41 222 0.976 15.45 S.002 Init + 152601 152753 153 2 0 63 94 115 0.956 9.59 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:19433940_19677550|GENSCAN_predicted_peptide_1|273_aa MADDAGAAGGPGGPGGPEMGNRGGFRGGFGSGIRGQGRGRGRGRGQGRGARGGKAEDKEW MPVTKLGRLESEIIDFFLGASLKDEVLKIMPVQKQTRAGQRTRFKAFVAIGDYNGHVGLG VKCSKEVATAIRGAIILAKLSIVPVRRGYWGNKIGKPHTVPCKVTGCCGSVLVRLIPAPR GTGIVSAPVPKKLLMMAGIDDCYTSARGCTATLGNFTKATFDAISKTYSYLTPDLWKETV FTKSPYQEFTDHLVKTHTRVSVQRTQAPAVATT >gi568815581f:19433940_19677550|GENSCAN_predicted_CDS_1|822_bp atggcggatgacgccggtgcagcgggggggcccggaggccctggtggccctgagatgggg aaccgcggtggcttccgcggaggtttcggcagtggcatccggggccagggtcgcggccgt ggacggggccggggccaaggccgcggagctcgcggaggcaaggccgaggataaggagtgg atgcccgtcaccaagttgggccgcttggaatcagagatcattgatttcttcctgggggcc tctctcaaggatgaggttttgaagattatgccagtgcagaagcagacccgtgccggccag cgcaccaggttcaaggcatttgttgctatcggggactacaacggccacgtcggtctgggt gttaagtgctccaaggaggtggccaccgccatccgtggggccatcatcctggccaagctc tccatcgtccccgtgcgcagaggctactgggggaacaagatcggcaagccccacactgtc ccttgcaaggtgacaggctgctgcggctctgtgctggtacgcctcatccctgcacccagg ggcactggcatcgtctccgcacctgtgcctaagaagctgctcatgatggctggtatcgat gactgctacacctcagcccggggctgcactgccaccctgggcaacttcaccaaggccacc tttgatgccatttctaagacctacagctacctaacccccgacctctggaaggagactgta tttaccaagtctccctatcaggaattcactgaccacctcgtcaagacccacaccagagtc tccgtgcagcggactcaggctccagctgtggctacaacatag >gi568815581f:19433940_19677550|GENSCAN_predicted_peptide_2|133_aa MTHDTAPGGPENMYPSCVLWLGGRSSFGSVLLMSLKASARGVKAWGQRLQDSKHCECSVA QLGVGLCDGKKHPKVDVARDGRSHRRGGKDSDRVPRERRVLVIREFSPHDCDPSAQGIMH GWGASALEEQFRD >gi568815581f:19433940_19677550|GENSCAN_predicted_CDS_2|402_bp atgacccatgatacagccccaggaggccctgagaacatgtacccaagttgtgttctctgg ctgggtgggcggagctcctttggctcggtcctgctgatgtccttaaaggcctcagcccgt ggagtcaaagcctggggccagcggctgcaggattctaaacactgtgagtgctcagttgcg cagctgggtgtgggactgtgtgacggcaagaagcatccaaaggtggacgtggccagggat gggaggagccatcggagaggtggtaaggacagcgaccgagttccccgagagaggcgagtc ttagtaattcgggaattcagcccccacgactgtgacccttctgctcagggcatcatgcac ggctggggggcctcagccctagaggagcagttcagagattaa >gi568815581f:19433940_19677550|GENSCAN_predicted_peptide_3|91_aa MLKNAESNGELKGLDVDSLVIEHIQVNKAPKICHWTYRAYGRINPYVSSLCHTEMILTEK EQIVPKPEEEVAQKKKISQKKLKKQKLMARE >gi568815581f:19433940_19677550|GENSCAN_predicted_CDS_3|276_bp atgctaaaaaatgcagagagtaatggtgaacttaagggtttagatgtagattctctggtc attgagcatatccaagtgaacaaagcaccgaagatatgccactggacctacagggcttat ggtcggattaacccatacgtgagctctctctgccacactgagatgatccttactgaaaag gaacagattgttcctaaaccagaagaggaggttgcccagaagaaaaagatatcccagaag aaactgaagaaacaaaaacttatggcacgggagtaa >gi568815581f:19433940_19677550|GENSCAN_predicted_peptide_4|61_aa MAARGTRGRGARGSGLLLSPLLSGDKRGGYGTKHCLVFYDRGFCDLLGFGFSEDVVQFKP M >gi568815581f:19433940_19677550|GENSCAN_predicted_CDS_4|186_bp atggcggcgcgggggacgcgggggcgcggggctcggggctcggggctgctcctttctcct ctcctctcaggggacaagcgcgggggatatggaaccaagcactgtcttgtgttctatgac agaggcttctgtgacctattaggctttggtttttccgaagatgtggttcagttcaaaccc atgtaa >gi568815581f:19433940_19677550|GENSCAN_predicted_peptide_5|1054_aa MVRASHMEAPEEPAPVRGGPEATLEVRGSRCLRLSAFREELRALLVLAGPAVSKVASVAG RYRRAGDLVISASAGDFGGLWGPSELSAGGLRGASCQEPGGHPSSSACVPAAFRSAPPDW GPRGALRERLFLVQLMVFLISFISSVFCGHLGKLELDAVTLAIAVPCCRVKWRYQVPDGE HIRHPVIPLPGLGGTALGSSKLQGLLLTLVKASTPGKPLALVRTSDRLCTPSAPQGPYLW VYLQVINVTGVSVGFGLSSACDTLISQTYGSQNLKHVGVILQRSALVLLLCCFPCWALFL NTQHILLLFRQDPDVSRGSALANLISQYTLALLLFLYILGKKLHQATWGGWSLECLQDWA SFLRLAIPSMLMLCMEWWAYEVGSFLSGILGMVELGAQSIVYELAIIVYMVPAGFSVAAS VRVGNALGAGDMEQARKSSTVSLLITVLFAVAFSVLLLSCKDHVGYIFTTDRDIINLVAQ VVPIYAVSHLFEALACTSGGVLRGSGNQKVGAIVNTIGYYVVGLPIGIALMFATTLGVMG CPENLEGILTNDVGKTGEPQSDQQMRQEEPLPEHPQDGAKLSRKQLVLRRGLLLLGVFLI LLRLRALGWRDGLWGLRGALPPDLRGEAAELAALAGPVFLAQLMIFIISLVSSIFCGHLG KVEPDAVTLAVTVCHTRSVVLLQVVNVTGIAVGTGLASAYDTLMSQSFGVKNLKRVGIIL QRGVLILMLCCFPCWADLVNTERILLLLKQDPKSPGWTRECFQEWGSYIHLVIPSMFMVC TEQWTFEIGNFLAGLIDVMELGTQGIICELASVAYMNYFRLNPQPPISDFTINLAFLTLT ERKILGYIQLRKGPDIVGPYGLLQPFTDAVKLHQRTLMALNIYYYPLYYCSNPGPFYHCP LVPLGLGVAASIRVGNALGEGNVEEAWCSCTTVLLCAAIPTKLFGEFIVYPLYQAAAAKT FARIRFQQTSLNINFSTGQGACGGVLRGTGKQNIGAILNAIGYYVFGFPIGVSLMFATKL RIIDTATIRFLNLFPAFPAFLFHKAAIVILARSQ >gi568815581f:19433940_19677550|GENSCAN_predicted_CDS_5|3165_bp atggtgcgcgcgagtcacatggaagctcctgaggagcccgcgccagtgcgcggaggcccg gaggccacccttgaggtccgtgggtcgcgctgcttgcggctgtccgccttccgagaagag ctgcgggcgctcttggtcctggctggccccgcggtgagtaaggtggcctcagtggcaggc cggtaccggcgggctggggacctggtgatttctgcctccgcgggtgactttggcgggctt tggggaccgagcgagctgtccgccggcgggctccggggagcatcgtgccaggagccgggc gggcaccccagctcctctgcctgcgtcccggccgctttccgctccgcaccacccgactgg gggccccgcggggcactgcgggaacggctgttcttggttcagctgatggtgttcctgatc agcttcataagctccgtgttctgtggccacctgggcaagctggagctggatgcagtcacg ctggcaatcgcggttccctgctgtagagtcaagtggaggtaccaggttccagatggggag cacatcagacatcctgtgataccactccctggtttggggggcactgctctaggctcctcg aaacttcagggtttgctgctgaccctggtgaaggcgagcacacctggaaagcctctggca ctagtccggacttcagaccgcctctgcactcccagtgccccccaggggccttatctttgg gtttatttgcaggttatcaatgtcactggtgtctcagtgggattcggcttatcttctgcc tgtgacaccctcatctcccagacgtacgggagccagaacctgaagcacgtgggcgtgatc ctgcagcggagtgcgctcgtcctgctcctctgctgcttcccctgctgggcgctctttctc aacacccagcacatcctgctgctcttcaggcaggacccagatgtgtccagaggctctgca ctggcaaacttgatttcccagtacaccctggctctactcctctttctctacatcctcggg aaaaaactgcatcaagctacatggggaggctggtccctcgagtgcctgcaggactgggcc tccttcctccgcctggccatccccagcatgctcatgctgtgcatggagtggtgggcctat gaggtcgggagcttcctcagtggcatcctcggcatggtggagctgggcgctcagtccatc gtgtatgaactggccatcattgtgtacatggtccctgcaggcttcagtgtggctgccagt gtccgggtaggaaacgctctgggtgctggagacatggagcaggcacggaagtcctctacc gtttccctgctgattacagtgctctttgctgtagccttcagtgtcctgctgttaagctgt aaggatcacgtggggtacatttttactaccgaccgagacatcattaatctggtggctcag gtggttccaatttatgctgtttcccacctctttgaagctcttgcttgcacgagtggtggt gttctgagggggagtggaaatcagaaggttggagccattgtgaataccattgggtactat gtggttggcctccccatcgggatcgcgctgatgtttgcaaccacacttggagtgatgggg tgccctgaaaaccttgaaggaattttaacgaacgatgttggaaagacaggcgagcctcag tcagatcagcagatgcgccaagaagaacctttgccggaacatccacaggacggcgctaaa ttgtccaggaaacagctggtgctgcggcgagggcttctgctcctgggggtcttcttaatc ttgctgcggctccgggccctcggatggcgtgacggcttgtggggcctgcggggtgcgctg cccccggacctgcggggggaggcggctgagctggcggcgctcgcgggcccagtgtttctt gcgcagttgatgatctttataatcagcctcgtcagctccatcttctgtggacatctgggc aaggtcgagccggacgctgtcacgctcgccgtcacggtctgccacaccaggtctgtggtt ctcttgcaggtggtgaacgttactgggattgcagttggcactggcttagcctcagcttat gacaccctcatgtctcagtcctttggagtcaagaacctcaagcgcgtggggatcatactt cagagaggggtcctcatccttatgctgtgctgctttccctgctgggccgacttggtcaac accgagcgcatcctcctgctcttaaaacaagacccgaagtctccaggttggacgagggag tgcttccaggagtggggctcctacatccacctggttattcccagtatgttcatggtgtgc actgagcagtggacctttgagatcggaaacttccttgcaggactgattgatgtgatggag ctcggcactcagggcatcatctgtgagctggcgtcagtggcctacatgaactatttccga ttaaatccgcaaccccccatctctgactttaccatcaacctagcattccttacactcact gaacggaaaatcttaggctatatacaactacgcaaaggacctgacattgtaggtccctat ggactgcttcaaccattcactgatgcagtaaaacttcaccaaagaacccttatggccctc aacatctactattaccctttatattattgctccaaccctggccctttctatcattgtcct cttgtacccctgggccttggagttgcagccagcatccgagtgggcaatgctctgggggaa gggaatgtagaggaggcttggtgctcctgcaccacagttctcctgtgtgctgccattcct actaagctttttggtgagtttattgtgtatcccttgtatcaggcagcagcagctaaaaca tttgcccgtatacgttttcaacaaacatccctcaacatcaactttagcactgggcagggc gcctgtggtggagtcctgagaggtacaggaaaacaaaacattggcgctatcttgaatgcc attgggtactatgtctttggttttcccattggagtatctctgatgtttgccactaaactc aggataatagacacggcaaccatccgatttctcaatcttttccccgcctttcccgccttt ctattccacaaagccgccattgtcatcctggcccgttctcaatga >gi568815581f:19433940_19677550|GENSCAN_predicted_peptide_6|481_aa MANTFPPSLHPAAPWPVAALGSRTAHSTPYIPARCQSRGEGGGRVGEAVNSGCHDQAMEL EVRRVRQAFLSGRSRPLRFRLQQLEALRRMVQEREKDILTAIAADLCKSEFNVYSQEVIT VLGEIDFMLENLPEWVTAKPVKKNVLTMLDEAYIQPQPLGVVLIIGAWNYPFVLTIQPLI GAIAAGNAVIIKPSELSENTAKILAKLLPQYLDQDLYIVINGGVEETTELLKQRFDHIFY TGNTAVGKIVMEAAAKHLTPVTLELGGKSPCYIDKDCDLDIVCRRITWGKYMNCGQTCIA PDYILCEASLQNQIVWKIKETVKEFYGENIKESPDYERIINLRHFKRILSLLEGQKIAFG GETDEATRYIAPTVLTDVDPKTKVMQEEIFGPILPIVPVKNVDEAINFINEREKPLALYV FSHNHKLIKRMIDETSSGGVTGNDVIMHFTLNSFPFGGVGRILLKNDPVQPPSASTELFL F >gi568815581f:19433940_19677550|GENSCAN_predicted_CDS_6|1446_bp atggccaacaccttccctccatccctacaccccgccgccccctggcccgtggccgcgctc ggctcccgcactgctcactccaccccctacatcccagcccgctgccagagccggggagag ggcgggggccgcgtgggcgaggccgtgaacagcggctgtcacgaccaggccatggagctc gaagtccggcgggtccgacaggcgttcctgtccggccggtcgcgacctctgcggtttcgg ctgcagcagctggaggccctgcggaggatggtgcaggagcgcgagaaggatatcctgacg gccatcgccgccgacctgtgcaagagtgaattcaatgtgtacagtcaggaagtcattact gtccttggggaaattgattttatgcttgagaatcttcctgaatgggttactgctaaacca gttaagaagaacgtgctcaccatgctggatgaggcctatattcagccacagcctctggga gtggtgctgataatcggagcttggaattaccccttcgttctcaccattcagccactgata ggagccatcgctgcaggaaatgctgtgattataaagccttctgaactgagtgaaaataca gccaagatcttggcaaagcttctccctcagtatttagaccaggatctctatattgttatt aatggtggtgttgaggaaaccacggagctcctgaagcagcgatttgaccacattttctat acgggaaacactgcggttggcaaaattgtcatggaagctgctgccaagcatctgacccct gtgactcttgaactgggagggaaaagtccatgttatattgataaagattgtgacctggac attgtttgcagacgcataacctggggaaaatacatgaattgtggccaaacctgcattgca cccgactatattctctgtgaagcatccctccaaaatcaaattgtatggaagattaaggaa acagtgaaggaattttatggagaaaatataaaagagtctcctgattatgaaaggatcatc aatcttcgtcattttaagaggatactaagtttgcttgaaggacaaaagatagcttttggt ggggagactgatgaggccacacgctacatagccccaacagtacttaccgatgttgatcct aaaaccaaggtgatgcaagaagaaatttttggaccaattcttccaatagtgcctgtgaaa aatgtagatgaggccataaatttcataaatgaacgtgaaaagcctctggctctttatgta ttttcgcataaccataagctcatcaaacggatgattgatgagacatccagtggaggtgtc acaggcaatgacgtcattatgcacttcacgctcaactctttcccatttggaggagtgggc agaatattactgaagaatgatcctgttcaacctcctagtgcctctactgaattattcctc ttttaa