GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:33:32 Sequence gi568815596r:28750202_28970277 : 220076 bp : 40.68% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1837 2239 403 0 1 32 2 367 0.006 15.58 1.02 Intr + 26650 26781 132 2 0 52 68 92 0.655 3.30 1.03 Intr + 28608 28838 231 1 0 109 101 129 0.997 13.12 1.04 Intr + 31537 31641 105 2 0 110 59 42 0.869 2.77 1.05 Intr + 33706 33777 72 2 0 77 111 106 0.999 10.26 1.06 Intr + 38457 38608 152 1 2 90 80 70 0.889 5.36 1.07 Intr + 43662 43796 135 2 0 104 80 149 0.999 15.54 1.08 Term + 48998 49102 105 1 0 34 49 183 0.821 6.43 1.09 PlyA + 51048 51053 6 1.05 2.02 PlyA - 51962 51957 6 1.05 2.01 Sngl - 61300 60626 675 1 0 56 45 308 0.999 19.23 2.00 Prom - 65752 65713 40 -4.35 3.00 Prom + 67372 67411 40 -2.45 3.01 Init + 70560 70570 11 2 2 37 77 10 0.086 -5.25 3.02 Intr + 78947 79118 172 2 1 53 94 104 0.378 6.52 3.03 Intr + 89971 90268 298 0 1 50 86 215 0.440 13.02 3.04 Term + 98109 98137 29 1 2 77 54 36 0.066 -3.64 3.05 PlyA + 98448 98453 6 1.05 4.07 PlyA - 99644 99639 6 1.05 4.06 Term - 101087 100824 264 2 0 14 44 205 0.234 3.22 4.05 Intr - 111107 110917 191 0 2 18 92 101 0.870 1.88 4.04 Intr - 114918 114816 103 0 1 74 110 72 0.860 6.83 4.03 Intr - 116191 116144 48 1 0 60 53 89 0.498 0.56 4.02 Intr - 120209 119378 832 1 1 73 90 427 0.724 31.87 4.01 Init - 125254 125109 146 1 2 35 80 184 0.895 11.94 4.00 Prom - 134531 134492 40 -7.15 5.00 Prom + 139626 139665 40 -9.25 5.01 Init + 142516 142518 3 0 0 113 22 0 0.184 -4.05 5.02 Intr + 144432 144722 291 2 0 113 98 347 0.880 34.61 5.03 Intr + 151786 151923 138 0 0 127 92 60 0.994 10.04 5.04 Intr + 156259 156380 122 0 2 106 89 65 0.765 6.77 5.05 Intr + 162389 162509 121 2 1 32 84 83 0.670 1.88 5.06 Intr + 163868 164007 140 1 2 78 84 156 0.989 12.54 5.07 Intr + 167692 167794 103 1 1 76 84 47 0.564 2.36 5.08 Intr + 172718 172782 65 1 2 71 106 14 0.571 -1.90 5.09 Intr + 174781 174952 172 1 1 117 76 82 0.991 8.92 5.10 Intr + 177368 177499 132 1 0 80 80 104 0.990 8.72 5.11 Intr + 179378 179509 132 1 0 100 101 101 0.998 12.52 5.12 Intr + 185320 185406 87 0 0 111 78 23 0.801 2.85 5.13 Intr + 191260 191373 114 0 0 69 89 39 0.728 1.82 5.14 Intr + 192111 192180 70 2 1 93 98 83 0.991 7.64 5.15 Intr + 196249 196298 50 2 2 103 81 111 0.890 9.38 5.16 Term + 196399 196578 180 0 0 60 49 477 0.999 37.63 5.17 PlyA + 196770 196775 6 1.05 6.00 Prom + 197941 197980 40 -1.55 6.01 Init + 201191 201309 119 1 2 62 28 157 0.729 6.82 6.02 Intr + 201844 202054 211 1 1 75 78 92 0.792 4.89 6.03 Term + 202162 202269 108 0 0 82 46 75 0.827 0.23 6.04 PlyA + 202569 202574 6 1.05 7.04 PlyA - 203452 203447 6 1.05 7.03 Term - 206539 206303 237 1 0 11 42 275 0.797 10.38 7.02 Intr - 209061 208916 146 1 2 31 54 109 0.217 0.88 7.01 Intr - 217538 217514 25 2 1 99 121 35 0.370 4.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:28750202_28970277|GENSCAN_predicted_peptide_1|444_aa VRERRAVAAASAAEKPLFPLLGRRVCADKMADGELNVDSLITRLLEGECAPGRGTEGGRA PPPTPASPSAAGTRGDPFPAPRRVSGSAARRTNRGGGEEALGAGERPLGARSGEIGGEGA VPADPRGPGPPAEGLRGCRPGKIVQMTEAEVRGLCIKSREIFLSQPILLELEAPLKICGD IHGQYTDLLRLFEYGGFPPEANYLFLGDYVDRGKQSLETICLLLAYKIKYPENFFLLRGN HECASINRIYGFYDECKRRFNIKLWKTFTDCFNCLPIAAIVDEKIFCCHGGLSPDLQSME QIRRIMRPTDVPDTGLLCDLLWSDPDKDVQGWGENDRGVSFTFGADVVSKFLNRHDLDLI CRAHQVVEDGYEFFAKRQLVTLFSAPNYCGEFDNAGGMMSVDETLMCSFQILKPSEKKAK YQYGGLNSGRPVTPPRTANPPKKR >gi568815596r:28750202_28970277|GENSCAN_predicted_CDS_1|1335_bp gtgagagaacgccgagccgtcgccgcagcctccgccgccgagaagcccttgttcccgctg ctgggaaggagagtctgtgccgacaagatggcggacggggagctgaacgtggacagcctc atcacccggctgctggagggtgagtgcgcgcctggccgcgggacagagggaggtcgggca ccgccgccgacccctgcgtccccgtctgccgccggaacgcgaggggacccctttcccgcc ccgagacgagtctctgggagcgcggcgcggcggacgaaccgaggagggggcgaggaggct ctgggcgcgggggagcggcctctgggagcgcggtcaggggagatcgggggagagggggcc gttcccgcggaccctcgggggccaggcccgccggccgaaggcttacgaggatgtcgtcca ggaaagattgtgcagatgactgaagcagaagttcgaggcttatgtatcaagtctcgggag atctttctcagccagcctattcttttggaattggaagcaccgctgaaaatttgtggagat attcatggacaatatacagatttactgagattatttgaatatggaggtttcccaccagaa gccaactatcttttcttaggagattatgtggacagaggaaagcagtctttggaaaccatt tgtttgctattggcttataaaatcaaatatccagagaacttctttctcttaagaggaaac catgagtgtgctagcatcaatcgcatttatggattctatgatgaatgcaaacgaagattt aatattaaattgtggaagaccttcactgattgttttaactgtctgcctatagcagccatt gtggatgagaagatcttctgttgtcatggaggattgtcaccagacctgcaatctatggag cagattcggagaattatgagacctactgatgtccctgatacaggtttgctctgtgatttg ctatggtctgatccagataaggatgtgcaaggctggggagaaaatgatcgtggtgtttcc tttacttttggagctgatgtagtcagtaaatttctgaatcgtcatgatttagatttgatt tgtcgagctcatcaggtggtggaagatggatatgaattttttgctaaacgacagttggta accttattttcagccccaaattactgtggcgagtttgataatgctggtggaatgatgagt gtggatgaaactttgatgtgttcatttcagatattgaaaccatctgaaaagaaagctaaa taccagtatggtggactgaattctggacgtcctgtcactccacctcgaacagctaatccg ccgaagaaaaggtga >gi568815596r:28750202_28970277|GENSCAN_predicted_peptide_2|224_aa MSYGPGTETQQLRSQNSGADDLGDKKRCLMGHKEVGFIKKTPQISIPPTIKAAGTRGDGS ACPGPSLRADGRASGAASGIGPARPSPGTFTRYAAGRRAAKAPTCAATASLARSLSPEPA AGSACVVAAKAAEGAHGRRREDEAGALPSHLRRAVGSPASEPRDRAHSKLEIRLRGPFPG ASTGTCGSPGLRGRGPGNGGQGSVAALLASDGCSSVASQQRYPL >gi568815596r:28750202_28970277|GENSCAN_predicted_CDS_2|675_bp atgagctacgggcctggtactgagacacagcagctcagatctcaaaattctggggcagac gatcttggggacaaaaagaggtgtttaatgggacataaggaggtaggatttatcaaaaag accccccaaatctccattcctcccacaataaaggcggcaggcacacgtggagacgggagc gcctgcccagggccctccctccgagcagacggccgagcttcgggagcagcctccggtatc ggccctgcccgtccttcccctggaaccttcacccgctacgccgccgggcggagggcggcc aaagccccaacctgcgcggccactgcctccctcgccaggtccctcagcccagagcccgct gcggggagcgcgtgtgtcgtcgccgcgaaggcagctgagggcgcccacgggaggcggcgt gaggacgaggctggagcgctgccttctcatctaaggcgggcggtggggtcgccggcgagc gaacccagggaccgggcacactcgaaactggagattcgcctgcgaggccccttcccgggg gcgagcacaggtacctgcggaagcccggggctgcgcgggagagggccgggcaacggcggt caaggctccgtcgcagcgctcctggcctcagacggttgctcgtcggtcgctagccagcag cggtacccgctctaa >gi568815596r:28750202_28970277|GENSCAN_predicted_peptide_3|169_aa MSGRYLANTVEEDEEETKYEIFPWALGKNWRKLFPNFLKLRDQLWDRIDYRAIVSRRCCE EVMAIAPTHYIWQRERSVHHSGAVRNYNRDEVQLPRGPSATPVDCSLCGKKRRYVRLGLS SSSSLSSHTAGVTEKHSQDSYNSLSMDIIGDPSQAYTGSEGYTNSFTGI >gi568815596r:28750202_28970277|GENSCAN_predicted_CDS_3|510_bp atgagtggaaggtatctggctaatacagttgaagaagatgaagaagaaaccaagtacgaa atttttccatgggctttagggaaaaactggagaaaattgttccctaatttcttaaagtta agggaccagctctgggatagaattgactatagggctattgtaagcaggcgatgttgtgag gaggttatggccattgcaccaacccattatatctggcaaagagaacgttctgttcatcac agtggagctgtcagaaactacaacagagatgaagttcagctgccccggggacctagtgcc acaccagtagattgttcactctgtggtaaaaaaagaagatatgttagactgggattgtct tcatcatcatctttatccagtcatacagcaggggtgacagaaaaacattctcaggactca tacaactcactgtcaatggacataataggtgatccttctcaagcttatactggttctgaa ggatacaccaattccttcacgggcatatga >gi568815596r:28750202_28970277|GENSCAN_predicted_peptide_4|527_aa MFEKVLNFSEFGFAVSSNTAVNPRGEVLQNPDSSLAATGDKVKKQEKSRRSRGAVEPHAA AEPSGCCAMRATGKEGVALGLRHSSATAPSRNTMLMAWCRGPVLLCLRQGLGTNSFLHGL GQEPFEGARSLCCRSSPRDLRDGEREHEAAQRKAPGAESCPSLPLSISDIGTGCLSSLEN LRLPTLREESSPRELEDSSGDQGRCGPTHQGSEDPSMLSQAQSATEVEERHVSPSCSTSR ERPFQAGELILAETGEGETKFKKLFRLNNFGLLNSNWGAVPFGKIVGKFPGQILRSSFGK QYMLRRPALEDYVVLMKRGTAITFPKSVEKLSSTKRVPSAQKDINMILSMMDINPGDTVL EAGSGSGGMSLFLSKAVGSQGRVISFEVRKDHHDLAKKNYKHWRDSWKLSHVEEWPDNVD FIHKDISGATEDIKSLTFDAVIELLDGIRTCELALSCEKISEVIVRDWLVCLAKQKNGIL AQKVESKINTDVQLDSQEKIGVKGELFQEDDHGELQFYFMHAVMNGE >gi568815596r:28750202_28970277|GENSCAN_predicted_CDS_4|1584_bp atgtttgagaaggttttaaacttctctgaatttggatttgcagtttctagtaacacagca gtaaatcctagaggagaagttctacagaatccagattcaagtttagcagcaactggggat aaagtgaaaaagcaggagaaaagcaggcgcagtcgcggagctgtagagccccacgcagct gcagagccatcgggctgctgcgccatgcgcgcgactgggaaagaaggggtcgcgctaggc ttgcgtcactcgtctgcgacggcgccttcgcgaaacactatgctaatggcatggtgccgc ggtcctgtcttgctgtgcctgcggcaggggctcggaaccaattcattcctgcacggcctg gggcaggagcccttcgagggagctcggtcactgtgttgcaggtcctcgcctagagacctg cgagatggagaaagagagcacgaggcggcacaaaggaaagccccaggagcagagtcttgc ccatctctccctctgagcatctcggacattgggactggatgtctttcgtcactggaaaac ctcagactgccgacgctgcgggaagagtcatcccctcgagagctcgaggactcgagcgga gaccagggccggtgcggtcccacacaccagggatccgaggatccttcgatgctctcgcag gcccagtccgctaccgaggtcgaagagcgtcacgtctccccttcttgttcaacttccaga gagagaccctttcaggctggggaactgattttagctgagactggggagggagaaacaaaa tttaagaaattatttaggttgaacaacttcggactcttaaatagtaactggggggcagtc ccgttcggcaagatcgtggggaagttccccggccagatactgaggagttccttcggtaag cagtacatgctgaggaggccagccttggaagactatgtagtattgatgaaaagagggact gccataacattcccaaagtctgtggaaaaactgtcttccacgaaacgggtccctagtgcc caaaaggatattaatatgattctctcaatgatggatatcaacccaggtgatactgttttg gaagctggctcaggctctggtggaatgagcttatttttatccaaagcagttggatcacaa ggacgagtcataagttttgaggtacgaaaagaccaccatgatctggctaagaagaattac aaacactggcgtgattcatggaaattaagtcatgtagaagagtggccagacaatgtggat tttattcataaggacatttcaggagcaaccgaagacataaaatctttaacatttgacgca gttattgaacttttagatggaattcgcacctgtgaacttgctctttcatgtgaaaagata agcgaggtcattgtcagagattggttggtttgccttgcaaaacagaaaaatggaatttta gctcaaaaagtagaatctaaaatcaacacagatgtacaactagattctcaagagaaaatt ggagttaaaggtgagctgtttcaagaggatgaccatggtgagcttcagttttactttatg catgcagtaatgaatggagaatga >gi568815596r:28750202_28970277|GENSCAN_predicted_peptide_5|639_aa MAPPRAGAPAHGRTRGCSGARAAMAAGGGGSCDPLAPAGVPCAFSPHSQAYFALASTDGH LRVWETANNRLHQEYVPSAHLSGTCTCLAWAPARLQAKESPQRKKRKSEAVGMSNQTDLL ALGTAVGSILLYSTVKGELHSKLISGGHDNRVNCIQWHQDSGCLYSCSDDKHIVEWNVQT CKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLMFT TIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKSAVMSFTVTDEPVYIDLTL SENKEEPVKLAVVCRDGQVHLFEHILNGYCKKPLTSNCTIQIATPGKGKKSTPKPIPILA AGFCSDKMSLLLVYGSWFQPTIERVVRTPVMNSEAKVLVPGIPGHHAAIKPAPPQTEQVE SKRKSGGNEVSIEERLGAMDIDTHKKGKEDLQTNSFPVLLTQGLESNDFEMLNKVLQTRN VNLIKKTVLRMPLHTIIPLLQELPDLVPQLGTLYQLMESRVKTFQKLSHLHGKLILLITQ VTASEKTKGATSPGQKAKLVYEEESSEEESDDEIADKDSEDNWDEDEEESESEKDEDVEE EDEDAEGKDEENGEDRDTASEKELNGDSDLDPENESEEE >gi568815596r:28750202_28970277|GENSCAN_predicted_CDS_5|1920_bp atggctccgccccgcgccggtgcgcctgcgcacggacgaacacgtggctgcagcggggcc agagcagcaatggcggcgggcggcggcggtagctgcgaccccctggcccctgctggggtc ccttgcgccttctccccgcacagccaggcctacttcgctttggcctctaccgacggtcac ttacgagtatgggagacggccaacaaccggctgcaccaggagtacgtgccttccgcgcac ctcagtggtacctgcacctgtctggcctgggcgccagcgcggctgcaggccaaggaaagt ccccagaggaaaaaaaggaaatcagaagctgtaggaatgagtaaccagactgacttattg gctcttggcacagcagttggtagcattttattatacagcacagtaaaaggagagttacac agtaaattaataagtggtggacatgacaacagagtcaactgcatacagtggcatcaagac agtggctgtttatatagttgttcagatgataaacatattgtggaatggaacgtacagaca tgcaaagtaaagtgcaaatggaaaggcgacaatagcagtgtcagttccctatgtatcagc ccagatggaaagatgttgctttcagctggtcgaacaatcaaactatgggttttggagacc aaagaagtctacaggcatttcacaggacatgcaacgccagtttcgtcactgatgttcact accatcagacctcctaatgagagccagccctttgatggaattacaggtctttatttctta tctggagcagtacatgaccggttacttaatgtctggcaggtccgatcagaaaacaaagaa aagagtgcagtgatgtcatttacagttaccgatgaacctgtctatattgacttaactttg tcagaaaacaaagaagagcctgtcaagttggctgttgtttgcagagatggtcaagtccat ctttttgaacacatattaaatgggtactgcaaaaagcctttgacttcaaactgcacaatt cagatagcaacacctgggaaaggcaagaagtcaacaccaaaacccatccctattctagct gctggtttttgctcagacaaaatgtcattgttgcttgtatatggcagttggtttcagcct actattgagcgagtggtgaggacaccagtgatgaattctgaagcaaaagttctggtgcct gggattcctggtcatcatgcagctatcaagcccgctcctccacaaaccgagcaagtagag agcaagaggaagtcagggggaaatgaggttagcattgaagaacgtctgggagcaatggat atagacacacacaaaaaaggaaaggaagacctccagacgaatagctttccagttcttctt acccagggcttagaaagtaacgattttgaaatgctaaataaagtacttcaaactaggaat gtaaaccttataaagaagactgtattaaggatgcccctgcatactattattccgttgtta caagagttgcctgacctggtaccccagctggggacactctaccagttaatggaaagcaga gtcaaaacttttcagaaactttcacaccttcatggaaagcttattcttctaattacacaa gtaacagcatcagagaagacaaagggagcaacttcccctggacagaaggcaaagttggtg tatgaagaagagtcttctgaagaggagtctgatgatgaaatagcagataaggattctgaa gataattgggatgaagatgaggaggagagtgaaagtgaaaaagatgaggacgttgaagag gaagatgaggatgccgaaggaaaagatgaagaaaatggcgaggacagagatacagcaagt gaaaaagaattaaatggagattctgatttagatcctgaaaatgaaagtgaagaagaatga >gi568815596r:28750202_28970277|GENSCAN_predicted_peptide_6|145_aa MRSRSSVTGLAGVIQTEDARCGQSMGWVVLRLRMVLADSVFDAISQSRIANQPSKRNCVC MLYLFATSNIPLSGAGCSRWVVTLLCTQESCIINQFHNWKIAFGPECGTGVVLLPADHSP FVNRNCKQTVDWLEFGCFESPQAPT >gi568815596r:28750202_28970277|GENSCAN_predicted_CDS_6|438_bp atgagaagccgtagcagtgtaacaggattagcaggcgtcattcagactgaggacgcccgg tgtggacaaagcatggggtgggtagtgcttcgtcttcgcatggtgctggctgactctgta tttgatgcaatttcacagagtcgaattgctaaccagccttccaagagaaattgtgtttgc atgttgtatttatttgcgacaagtaacattcctctcagtggagcaggatgttctcgttgg gtggtgacccttctttgtactcaggagagttgcataataaaccagtttcataactggaag attgcgtttggaccagagtgtggaactggggtcgtcttactgcctgctgaccactctcct tttgtcaacaggaattgtaaacaaacagttgactggctggaatttggatgttttgagtct cctcaggcacccacttag >gi568815596r:28750202_28970277|GENSCAN_predicted_peptide_7|135_aa MRKLTLVEGHEGETSIQASYAQQLCQTKELLQLLLTAGHFFGLASVARWGVWSGPGRVHV ACDQNSTSPYPPEKDAGTGAHKAGTRSPQLHFTEKQEAPQASIDQGMAAVVLRSAAVTQQ QQQHHVGRILWPPSG >gi568815596r:28750202_28970277|GENSCAN_predicted_CDS_7|408_bp atgaggaaattgacactcgtggaaggacatgagggggaaacatcaattcaagcttcctat gctcagcaactgtgtcagactaaagagctgctgcagcttctcttgacagctgggcacttt ttcgggttagcatctgtagcccggtggggagtttggtcaggtccagggagggtacatgta gcctgtgaccagaattccacctctccttacccacctgagaaagatgctggcacaggtgca cataaggcaggcacgaggtcgccgcaacttcactttactgagaaacaggaagcaccccaa gcatctatcgatcagggaatggcagcagtggttctccgaagcgcagccgtaacacagcag cagcagcagcatcacgtgggacgcattctttggcccccatctggctga