GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:01:23 Sequence gi568815583r:64056026_64262986 : 206961 bp : 43.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 564 559 6 1.05 1.02 Term - 4965 4661 305 1 2 25 42 238 0.525 8.23 1.01 Init - 5126 5051 76 2 1 88 56 103 0.995 6.42 1.00 Prom - 9399 9360 40 -4.86 2.06 PlyA - 13144 13139 6 1.05 2.05 Term - 17003 16906 98 0 2 89 44 113 0.835 5.13 2.04 Intr - 19512 19467 46 0 1 89 98 40 0.708 3.08 2.03 Intr - 25126 25077 50 2 2 63 116 -6 0.443 -1.90 2.02 Intr - 32826 32662 165 1 0 30 111 86 0.704 5.03 2.01 Init - 37743 37620 124 0 1 79 46 170 0.850 10.29 2.00 Prom - 38871 38832 40 -7.86 3.00 Prom + 38933 38972 40 -7.96 3.01 Init + 39989 40147 159 1 0 81 72 127 0.566 10.49 3.02 Intr + 56548 56677 130 0 1 82 -34 131 0.320 0.57 3.03 Intr + 62092 62219 128 2 2 62 78 67 0.674 3.50 3.04 Intr + 62763 62829 67 2 1 56 97 85 0.883 4.68 3.05 Intr + 67478 67521 44 0 2 110 109 6 0.959 2.86 3.06 Intr + 70054 70195 142 0 1 70 87 89 0.997 6.93 3.07 Intr + 71149 71227 79 2 1 78 115 56 0.999 5.91 3.08 Intr + 71706 71781 76 0 1 88 88 28 0.999 2.32 3.09 Intr + 73891 74004 114 0 0 87 68 153 0.881 13.84 3.10 Intr + 74203 74296 94 0 1 67 87 102 0.995 7.54 3.11 Intr + 75662 75867 206 0 2 112 70 221 0.999 21.62 3.12 Intr + 78639 78782 144 2 0 80 105 271 0.989 28.48 3.13 Intr + 80305 80385 81 0 0 108 78 85 0.987 9.33 3.14 Intr + 80836 80907 72 0 0 34 91 70 0.342 1.50 3.15 Intr + 95891 96040 150 1 0 25 62 136 0.651 5.06 3.16 Intr + 96218 96301 84 1 0 68 76 162 0.998 13.02 3.17 Intr + 96613 96717 105 0 0 -3 85 100 0.665 1.01 3.18 Intr + 97220 97314 95 1 2 107 74 109 0.997 10.16 3.19 Intr + 97910 97977 68 2 2 73 94 43 0.998 1.95 3.20 Term + 98362 98483 122 2 2 92 43 127 0.999 7.24 3.21 PlyA + 98715 98720 6 1.05 4.16 PlyA - 99826 99821 6 1.05 4.15 Term - 100120 99998 123 1 0 120 49 336 0.998 31.28 4.14 Intr - 100884 100700 185 1 2 136 101 319 0.999 37.51 4.13 Intr - 104172 104079 94 0 1 118 76 108 0.978 12.14 4.12 Intr - 106129 106016 114 1 0 70 107 87 0.998 9.44 4.11 Intr - 106992 106827 166 2 1 75 63 311 0.944 27.26 4.10 Intr - 107377 107237 141 0 0 71 -6 129 0.231 1.27 4.09 Intr - 115566 115405 162 2 0 84 69 79 0.258 4.69 4.08 Intr - 124429 124323 107 1 2 133 59 130 0.910 13.61 4.07 Intr - 147164 147057 108 2 0 101 91 71 0.992 9.18 4.06 Intr - 148564 148416 149 2 2 91 87 231 0.902 23.25 4.05 Intr - 148924 148840 85 1 1 120 75 34 0.757 4.69 4.04 Intr - 151569 151484 86 1 2 103 63 -23 0.240 -3.76 4.03 Intr - 158099 157865 235 2 1 103 93 179 0.978 17.16 4.02 Intr - 160688 160537 152 0 2 124 119 130 0.976 19.58 4.01 Init - 161113 161041 73 1 1 65 44 28 0.410 -2.67 4.00 Prom - 164067 164028 40 -4.76 5.00 Prom + 165929 165968 40 -2.46 5.01 Init + 186310 186329 20 0 2 88 115 14 0.545 3.87 5.02 Intr + 201978 202081 104 0 2 96 103 101 0.884 12.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:64056026_64262986|GENSCAN_predicted_peptide_1|126_aa MAGCRSRALPRGEAAKARREIKLSAGPASGSECGARRAHTHPELALARKYGPQVPPAAPV PTRSPGSHPGLSLHTSSQAKGAGSGLSQPRKGLPQCSSGLKGSSSAAKVGTRAEEVPRAS EGCEGC >gi568815583r:64056026_64262986|GENSCAN_predicted_CDS_1|381_bp atggcgggctgcaggtcccgagccctgccccgtggggaggcagctaaggcccggcgagaa atcaagctcagcgctgggccagccagcggctctgagtgcggggcccgccgagcccacacc cacccggaactcgcgctggcccgcaagtacggcccgcaagtaccgcccgcagccccggtt cccacccgcagccccggttcccacccgggcctctccctccacacctcctcgcaagctaag ggagccggctctggcctcagccagcccaggaaggggctcccacagtgcagcagcgggctg aagggctcctcaagtgccgccaaagtgggaacccgggcagaggaggtgccgagagcaagc gagggctgtgagggctgctag >gi568815583r:64056026_64262986|GENSCAN_predicted_peptide_2|160_aa MQRVSGLLSWTLSRVLWLSGLSEPGAARQPRIMEEKALEVYDLIRTIRDPEKPNTLEELE VVSESCVEVQEINEEEYLVIIRFTPTVPHCSLATLIGLCLRVKLQRCLPFKHKLEIYISE GTHSTEEDINKQINDKERVAAAMENPNLREIVEQCVLEPD >gi568815583r:64056026_64262986|GENSCAN_predicted_CDS_2|483_bp atgcagcgggtgtccgggctgctctcctggacgctgagcagagtcctgtggctctccggc ctctctgagccgggagctgcccggcagccccggatcatggaagagaaagcgctagaagtt tatgatttgattagaactatccgggacccagaaaagcccaatactttagaagaactggaa gtggtctcggaaagttgtgtggaagttcaggagataaatgaagaagaatatctggttatt atcaggttcacgccaacagtacctcattgctctttggcgactcttattgggctgtgctta agagtaaaacttcagcgatgtttaccatttaaacataagttggaaatctacatttctgaa ggaacccactcaacagaagaagacatcaataagcagataaatgacaaagagcgagtggca gctgcaatggaaaaccccaacttacgggaaattgtggaacagtgtgtccttgaacctgac tga >gi568815583r:64056026_64262986|GENSCAN_predicted_peptide_3|719_aa MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTGAAVVSKHQSPK ITTSLLPINNGSKENGIHEEQDQEPQDLFAGKFGLKNATVELSLDSTQNNQKKVLAKTLI SLPPQEATNSSKPQPTYEELEEEEQEDQFDLTVGITDPEKIGDGMNAYVAYKVTTQTSLP LFRSKQFAVKRRFSDFLGLYEKLSEKHSQNGFIVPPPPEKSLIGMTKVKVGKEDSSSAEF LEKRRAALERYLQRIVNHPTMLQDPDVREFLEKEELPRAVGTQTLSGAGLLKMFNKATDA VSKMTIKMNESDIWFEEKLQEVECEEQRLRKLHAVVETLVNHRKELALNTAQFAKSLAML GSSEDNTALSRALSQLAEVEEKIEQLHQEQANNDFFLLAELLSDYIRLLAIVRAAFDQRM KTWQRWQDAQATLQKKREAEARLLWANKPDKLQQAKDEILEWESRVTQYERDFERISTVV RKEVIRFEKEKSKDFKNHVIKYLETLLYSQQQWGPGARAWVNERCGLDRRGKPCRGFGGR LDLPASRTMVRALEGPQGSEGQVFRVEVLCSGRRHTVPRRYSEFHALHKRIKKLYKVPDF PSKRLPNWRTRGLEQRRQGLEAYIQGILYLNQEVPKELLEFLRLRHFPTDPKASNWGSQQ HQRPVLSFHVDPYVCNPSPESLPNVVVNGVLQGLYSFSISPDKAQPKAACHPAPLPPMP >gi568815583r:64056026_64262986|GENSCAN_predicted_CDS_3|2160_bp atggcgtcgggtggtggtggctgtagcgcttcggagagactgcctccgcccttccccggc ctggagccggagtccgagggggcggccgggggatcagaacccgaggctggggacagcgac accgagggggaggacattttcaccggcgccgcggtggtcagtaaacatcagtctccaaag ataactacatcccttcttcccatcaacaatggctccaaagaaaatgggatccatgaagaa caagaccaagagccacaggatctctttgcaggcaagtttggactcaaaaatgccacagtg gagctatccttggacagcacacaaaataatcagaagaaggtgctagccaaaacactcatt tctcttcctcctcaggaagccacaaattcttcgaagccccagccaacctatgaggagcta gaggaagaagaacaggaggatcaatttgatttgacagtcggtataactgatcctgagaag ataggggatggtatgaatgcatatgtagcctacaaagttacaacacagacaagcttacca ttgttcagaagcaaacagtttgcagtaaaaagaagatttagtgactttctgggtctttat gagaagctttccgagaagcactctcagaatggcttcattgtccctccgcccccggagaag agcctcatagggatgacaaaagtgaaagttgggaaggaagattcttcttctgcagaattt cttgaaaaacggagggccgctttagaaaggtaccttcagaggattgtaaatcatcctacc atgttacaggaccctgacgtcagagagttcttggaaaaagaagagctgccacgtgccgtg ggtacccagacattgagtggtgctggtctcctcaagatgttcaacaaagccacagatgcc gtcagcaaaatgaccatcaagatgaatgaatcagacatttggtttgaggagaagctccag gaggtagagtgtgaggagcagcgcttacggaaactgcatgctgttgtagaaactctagtc aaccataggaaagagctagcgctgaacacagcccagtttgcaaagagtctagccatgctt gggagctctgaggacaacacggcattgtcacgggcactctcccagctggctgaggtggaa gaaaaaattgagcagctccaccaggaacaggccaacaatgacttcttcctccttgctgag ctcctgagtgactacattcgcctcctggccatagtccgcgctgccttcgaccagcgcatg aagacatggcagcgctggcaggatgcccaagccacactgcagaagaagcgggaggccgag gctcggctgctgtgggccaacaagcctgataagctgcagcaggccaaggacgagatcctc gagtgggagtctcgggtgactcaatatgaaagggacttcgagaggatttcaacagtggtc cgaaaagaagtgatacggtttgagaaagagaaatccaaggacttcaagaaccacgtgatc aagtaccttgagacactcctttactcacagcagcagtggggacccggggcccgggcctgg gtgaacgagcgctgcgggctggaccgtcgggggaaaccttgtcgagggtttgggggccgc ttggatttgccagcctcgcggaccatggttcgagctcttgagggaccccagggctccgag gggcaggtgttccgagtggaggtgctgtgcagcgggcgcagacacacggtgccaaggcgc tacagcgagttccacgcgctgcacaagcggatcaagaagctgtacaaagtgcccgacttc ccctcgaaacgcctgcccaactggaggaccagagggttggaacagcgccggcagggcttg gaggcttacatccagggcatcctgtacctgaaccaggaggtgcccaaggagttactggaa ttcctgagacttcggcacttccccacagaccccaaggctagcaactggggctcccagcag caccagcggcctgtcctgagcttccatgtggatccctatgtttgcaacccctccccagag tcgctgcccaacgtggtggtgaatggtgtgctccagggcctctacagcttcagcatcagc ccagataaagcccagccaaaggcggcctgtcaccctgctcctctgccaccgatgccctga >gi568815583r:64056026_64262986|GENSCAN_predicted_peptide_4|659_aa MAVSIESAHEPDNVTSYVSLAFELSEGLPQVYYFGPCGKYNAMVLELLGPSLEDLFDLCD RTFTLKTVLMIAIQLLSRMEYVHSKNLIYRDVKPENFLIGRQGNKKEHVIHIIDFGLAKE YIDPETKKHIPYREHKSLTGTARYMSINTHLGKEQSRRDDLEALGHMFMYFLRGSLPWQG LKADTLKERYQKIGDTKRNTPIEALCENFPEEMATYLRYVRRLDFFEKPDYEYLRTLFTD LFEKKGYTFDYAYDWVGRPIPTPVGSVHVDSGASAITRESHTHRDRPSQQQPLRNQVVSS TNGELNVDDPTGAHSNAPITAHAEVEVVEEANLLLVFTPPSTLSSIKPSELKALSAEGDQ EAGGKAKKVGGKWQTWAGEEPSPYPRPTYLLTSSASYRRPVEQRSGRGGPRSMPGLVVFS EPGPVLCEAAFRRCPGCFRLRLWMLRLSERNMKVLLAAALIAGSVFFLLLPGPSAADEKK KGPKVTVKVYFDLRIGDEDVGRVIFGLFGKTVPKTVDNFVALATGEKGFGYKNSKFHRVI KDFMIQGGDFTRGDGTGGKSIYGERFPDENFKLKHYGPGWVSMANAGKDTNGSQFFITTV KTAWLDGKHVVFGKVLEGMEVVRKVESTKTDSRDKPLKDVIIADCGKIEVEKPFAIAKE >gi568815583r:64056026_64262986|GENSCAN_predicted_CDS_4|1980_bp atggcagtaagcatagagagtgctcatgaaccagataatgtaacaagttatgttagcctg gcctttgagctcagtgaaggtctcccacaggtgtattactttggaccatgtgggaaatat aatgccatggtgctggagctccttggccctagcttggaggacttgtttgacctctgtgac cgaacatttactttgaagacggtgttaatgatagccatccagctgctttctcgaatggaa tacgtgcactcaaagaacctcatttaccgagatgtcaagccagagaacttcctgattggt cgacaaggcaataagaaagagcatgttatacacattatagactttggactggccaaggaa tacattgaccccgaaaccaaaaaacacataccttatagggaacacaaaagtttaactgga actgcaagatatatgtctatcaacacgcatcttggcaaagagcaaagccggagagatgat ttggaagccctaggccatatgttcatgtatttccttcgaggcagcctcccctggcaagga ctcaaggctgacacattaaaagagagatatcaaaaaattggtgacaccaaaaggaatact cccattgaagctctctgtgagaactttccagaggagatggcaacctaccttcgatatgtc aggcgactggacttctttgaaaaacctgattatgagtatttacggaccctcttcacagac ctctttgaaaagaaaggctacacctttgactatgcctatgattgggttgggagacctatt cctactccagtagggtcagttcacgtagattctggtgcatctgcaataactcgagaaagc cacacacatagggatcggccatcacaacagcagcctcttcgaaatcaggtggttagctca accaatggagagctgaatgttgatgatcccacgggagcccactccaatgcaccaatcaca gctcatgccgaggtggaggtagtggaggaagctaatttactcctggtttttaccccaccc tcaaccctctccagcataaaacctagtgagctaaaggctttgtctgcagaaggagatcaa gaggctgggggtaaggccaagaaggtaggaggaaaatggcagacctgggctggagaagaa ccttctccgtatcccaggcccacgtatttgctaacctcaagcgccagctaccgcaggcca gtggaacagcgctccgggcggggcgggccgcggagcatgccggggcttgtggtcttctcg gagcccgggcccgtgctctgcgaggccgccttccgccgctgtccgggctgctttcgcctc cgcctgtggatgctgcgcctctccgaacgcaacatgaaggtgctccttgccgccgccctc atcgcggggtccgtcttcttcctgctgctgccgggaccttctgcggccgatgagaagaag aaggggcccaaagtcaccgtcaaggtgtattttgacctacgaattggagatgaagatgta ggccgggtgatctttggtctcttcggaaagactgttccaaaaacagtggataattttgtg gccttagctacaggagagaaaggatttggctacaaaaacagcaaattccatcgtgtaatc aaggacttcatgatccagggcggagacttcaccaggggagatggcacaggaggaaagagc atctacggtgagcgcttccccgatgagaacttcaaactgaagcactacgggcctggctgg gtgagcatggccaacgcaggcaaagacaccaacggctcccagttcttcatcacgacagtc aagacagcctggctagatggcaagcatgtggtgtttggcaaagttctagagggcatggag gtggtgcggaaggtggagagcaccaagacagacagccgggataaacccctgaaggatgtg atcatcgcagactgcggcaagatcgaggtggagaagccctttgccatcgccaaggagtag >gi568815583r:64056026_64262986|GENSCAN_predicted_peptide_5|42_aa MAVLSSWETNLVTVSSSQDNPNSSTSKLDEKNNQSQNKCGCX >gi568815583r:64056026_64262986|GENSCAN_predicted_CDS_5|126_bp atggcggtgctgtcttcatgggaaacaaatctggtaactgtcagttcctcacaagacaac cctaactcctccacaagcaagctggatgagaagaacaaccaatcacagaacaagtgtgga tgtgnn