GENSCAN 1.0 Date run: 2-Nov-116 Time: 22:05:51 Sequence gi568815587r:105909538_106110917 : 201380 bp : 36.15% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 898 1008 111 0 0 75 115 74 0.863 8.46 1.02 Intr + 9175 9381 207 0 0 99 86 202 0.867 19.45 1.03 Intr + 14862 15232 371 2 2 96 92 247 0.999 18.88 1.04 Intr + 17204 17402 199 2 1 35 89 220 0.968 15.23 1.05 Intr + 24185 24432 248 1 2 83 98 275 0.858 23.33 1.06 Intr + 50402 50612 211 2 1 99 61 96 0.007 5.99 1.07 Intr + 51074 51171 98 1 2 48 50 71 0.151 -2.81 1.08 Intr + 56371 56550 180 1 0 65 64 112 0.225 4.56 1.09 Intr + 62377 62491 115 1 1 28 86 113 0.672 4.63 1.10 Term + 64773 65018 246 2 0 114 50 259 0.967 19.51 1.11 PlyA + 65892 65897 6 1.05 2.00 Prom + 72003 72042 40 -6.75 2.01 Init + 72646 72808 163 0 1 56 110 86 0.069 7.54 2.02 Term + 87595 87800 206 2 2 -4 41 468 0.999 29.45 2.03 PlyA + 87993 87998 6 1.05 3.03 PlyA - 88033 88028 6 1.05 3.02 Term - 100573 99998 576 1 0 66 34 776 0.995 63.48 3.01 Init - 101380 100919 462 1 0 59 100 423 0.909 36.34 3.00 Prom - 111261 111222 40 -4.55 4.00 Prom + 125313 125352 40 -3.35 4.01 Init + 126554 126603 50 1 2 55 111 78 0.913 7.27 4.02 Term + 128580 128652 73 0 1 109 52 91 0.998 4.00 4.03 PlyA + 128674 128679 6 1.05 5.04 PlyA - 128702 128697 6 1.05 5.03 Term - 132752 132564 189 2 0 67 32 107 0.024 -0.53 5.02 Intr - 144918 143367 1552 2 1 121 60 633 0.043 52.17 5.01 Init - 148075 147996 80 1 2 60 62 39 0.096 -0.92 5.00 Prom - 156119 156080 40 -5.95 6.00 Prom + 164135 164174 40 -4.15 6.01 Init + 165501 165648 148 2 1 31 35 138 0.578 3.10 6.02 Intr + 166950 167209 260 1 2 -54 -58 345 0.785 1.96 6.03 Intr + 167326 167495 170 0 2 68 96 78 0.985 4.42 6.04 Intr + 167757 168016 260 0 2 84 24 275 0.958 16.88 6.05 Intr + 168061 168356 296 2 2 -41 34 377 0.553 15.20 6.06 Intr + 169930 170155 226 0 1 57 97 195 0.886 13.94 6.07 Intr + 181020 181141 122 1 2 97 10 25 0.063 -5.11 6.08 Intr + 181779 181940 162 2 0 29 94 117 0.929 5.65 6.09 Intr + 185046 185117 72 2 0 105 58 59 0.822 3.28 6.10 Intr + 187206 187356 151 2 1 101 42 81 0.269 3.61 6.11 Term + 191795 191916 122 0 2 41 53 81 0.035 -2.44 6.12 PlyA + 194648 194653 6 1.05 7.02 PlyA - 194862 194857 6 1.05 7.01 Term - 197854 197601 254 2 2 76 41 317 0.976 20.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:105909538_106110917|GENSCAN_predicted_peptide_1|661_aa VGYWNDMDKLVLIQDVPTLGNDTAAIENRTVVVTTIMESPYVMYKKNHEMFEGNDKYEGY CVDLASEIAKHIGIKYKIAIVPDGKYGARDADTKIWNGMVGELVYGKAEIAIAPLTITLV REEVIDFSKPFMSLGISIMIKKPQKSKPGVFSFLDPLAYEIWMCIVFAYIGVSVVLFLVS RFSPYEWHTEEPEDGKEGPSDQPPNEFGIFNSLWFSLGAFMQQGCDISPRSLSGRIVGGV WWFFTLIIISSYTANLAAFLTVERMVSPIESAEDLAKQTEIAYGTLDSGSTKEFFRRSKI AVYEKMWTYMRSAEPSVFTRTTAEGVARVRKSKGKFAFLLESTMNEYIEQRKPCDTMKVG GNLDSKGYGVATPKGSSLRKMSLKEAGKQPGWLPAPSSRTSHLEGHQPDASRIALAIGCL TTPVGGSHSVGWHREQDLFNEALCPLVERELKQLRQQAAAAGAGCPSPREFRRFKQIPAE RLCAVSYKICFIVSRNAVNLAVLKLNEQGLLDKLKNKWWYDKGECGSGGGDSKVSLNVTK SGTPVNLAVLKLSEAGVLDKLKNKWWYDKGECGPKDSGSKDKTSALSLSNVAGVFYILVG GLGLAMLVALIEFCYKSRAEAKRMKVAKSAQTFNPTSSQNTQNLATYREGYNVYGTESIK I >gi568815587r:105909538_106110917|GENSCAN_predicted_CDS_1|1986_bp gttggttactggaatgatatggataagttagtcttgattcaagatgtaccaactcttggc aatgacacagctgctattgagaacagaacagtggttgtaaccacaattatggaatcccca tatgttatgtacaagaaaaatcatgaaatgtttgaaggaaatgacaagtatgaaggatac tgtgtagatttggcatctgaaattgcaaaacatattggtatcaagtataaaattgccatt gtccctgatggaaaatatggagcaagggatgcagacacaaaaatctggaatgggatggta ggagaacttgtttatgggaaagcagagattgctattgcccctctgacaatcactttggta cgagaggaggtcattgacttttctaagcccttcatgagtttgggcatatctatcatgatc aaaaagcctcagaaatccaaaccaggagtgttttccttcttggatcctctggcctatgag atttggatgtgcatagtctttgcctacattggtgtcagcgtggtcttattcctagttagt agatttagtccatatgagtggcacacagaagagccagaggacggaaaggaaggacccagc gaccagcctcccaatgagtttggcatctttaacagcctctggttttccctgggtgctttt atgcagcaaggatgtgacatttcacccagatccctctcaggtcgaattgttggaggtgtt tggtggttctttacactcatcattatatcatcttatactgctaacctcgctgctttcctg acggttgagcgaatggtctctcccatagaaagtgcagaagacctggccaaacaaacagaa attgcctatggaacactggattcaggatcaacaaaagaattcttcagaagatcaaaaata gcagtgtatgaaaagatgtggacctacatgcgatcagcagagccatcagtattcactagg actacagctgagggagtagctcgtgtccgcaaatccaagggcaaatttgcctttctcctg gagtccactatgaatgaatacattgagcagcgaaagccatgtgacacgatgaaagtggga ggaaatctggattccaaaggctatggagtagcaacgcccaagggttcctcattaaggaag atgtcactcaaggaggctggaaagcagccaggatggctgcctgctccttcttctaggacc tctcaccttgaggggcaccaacctgatgccagtaggatcgctctggcgatagggtgtctg acaacccctgttggagggtctcattcagttgggtggcacagggagcaggacctattcaat gaagcactttgtcccttggtggagagggagctgaaacagcttcgacagcaggcagcagca gctggtgctggttgcccctccccccgggagttccgtaggttcaagcagattccagctgag aggctctgtgcagtttcttacaaaatatgttttatcgtttcaagaaatgctgttaacctc gcagttttaaaactgaatgaacaaggcctcttggacaaattgaaaaacaaatggtggtac gacaaaggagaatgtggcagcgggggaggtgactccaaggttagcctcaatgtcaccaaa agcggaactcctgtaaaccttgccgttttgaaactcagtgaggcaggcgtcttagacaag ctgaaaaacaaatggtggtacgataaaggtgaatgtggacccaaggactctggaagcaag gacaagacgagtgccttgagcctgagcaatgtagcaggcgtcttctacattctggttggc ggcttgggcttggcaatgctggtggctttgatagagttctgttacaagtccagggcagaa gcgaagagaatgaaggtggcaaagagtgcacagacttttaacccaacttcctcgcagaat acccagaatttagcaacctatagagaaggttacaacgtatatggaaccgaaagtattaaa atttag >gi568815587r:105909538_106110917|GENSCAN_predicted_peptide_2|122_aa MVPNLMIIRPMIIQVYDDMHSVQYSISYMRYSTLSNKIGFVLDDFEANISVLNTGGAGEG EGGGGGEAEAEEEEEGEGEEEKEGEEEEEGEEEEGEEEDNKRKRKRKEEEKKEEEEEEKE KK >gi568815587r:105909538_106110917|GENSCAN_predicted_CDS_2|369_bp atggtccccaacttaatgatcattcgacccatgattattcaagtttatgatgatatgcat tcagtacagtactcgataagttacatgagatattcaacacttagtaataaaataggcttt gtgttagatgactttgaggctaatataagcgttctgaacacaggaggagcaggagaagga gaaggaggaggaggaggagaagcagaagcagaagaagaggaggagggggaaggggaagag gaaaaggaaggggaagaggaggaggaaggtgaggaggaagaaggagaagaagaagacaac aagaggaagaggaagaggaaggaggaggagaagaaggaggaggaggaggaggagaaggag aagaaataa >gi568815587r:105909538_106110917|GENSCAN_predicted_peptide_3|345_aa MKQLKRKRKSNFSVQETQTLLKEITKRKEVIFSKQLNTTINVMKRMAWEEIAQCVNAVGE GEQRTGTEVKRRYLDWRALMKRKRMKANIKLVGSGFPLPSSDLDDSLTEEIDEKIGFRND ANFDWQNVADFRDAGGSLTEVKVEEEERDPQSPEFEIEEEEEMLSSVIPDSRRENELPDF PHIDEFFTLNSTPSRSAYDEPHLLVNIEKQKLELEKRRLDIEAERLQVEKERLQIEKERL RHLDMEHERLQLEKERLQIEREKLRLQIVNSEKPSLENELGQGEKSMLQPQDIETEKLKL ERERLQLEKDRLQFLKFESEKLQIEKERLQVEKDRLRIQKEGHLQ >gi568815587r:105909538_106110917|GENSCAN_predicted_CDS_3|1038_bp atgaagcagttgaaaagaaaaaggaaaagcaattttagtgttcaagaaactcagaccctt ttgaaagaaattacgaaaaggaaagaagtcattttttccaagcagctcaatacaacaatt aatgtgatgaagcgaatggcttgggaagagattgcacagtgtgtgaatgctgtaggagaa ggagaacagaggacagggacagaggtgaaaagaaggtaccttgactggcgagcacttatg aagagaaagaggatgaaggccaacattaagctggttggttcaggatttccccttccctcc tctgatttggatgactctctcactgaagagatagatgaaaagattggattccgaaatgat gcaaattttgactggcaaaatgtggcagatttcagggatgcaggtggatccttaactgag gtcaaggtggaagaggaagaaagggatccgcagagtcctgaatttgaaattgaggaggag gaagaaatgttgtcatccgtcataccagattccaggagagaaaatgaacttcccgatttc ccccacattgatgagttttttacccttaactcaacaccatctagatctgcatatgatgag cctcatttgctcgtaaatattgagaaacagaaactagagttggaaaaacgacgactggat atcgaggccgaaaggctgcaggtagaaaaggaacgcctacaaatcgagaaagagaggctg cggcatttagacatggaacatgagcggcttcagctagagaaggagcggctgcagattgaa agagaaaagttgaggttacagatagtcaattcagagaaaccgtccttggaaaatgaactt ggtcaaggagaaaaatccatgcttcaaccacaggacatagaaacagagaagttaaaactt gagcgagaacgcttgcaactggaaaaggataggctgcagtttttgaagtttgaatctgag aagctgcagattgaaaaggaacgcttacaggtagagaaagacagacttcgaattcagaaa gaaggacacttgcagtga >gi568815587r:105909538_106110917|GENSCAN_predicted_peptide_4|40_aa MSRGEEKMPRSTEAADSDWKPHKASPEADAAVLPMKPAEP >gi568815587r:105909538_106110917|GENSCAN_predicted_CDS_4|123_bp atgtcacgtggtgaagaaaagatgccgagaagcactgaggcagcagacagtgattggaag cctcataaggcctccccagaagcagatgctgctgtgcttcctatgaagcctgcagaacca tga >gi568815587r:105909538_106110917|GENSCAN_predicted_peptide_5|606_aa MMTADFHFCEVKGQTHDLAITVLIVSRAMFEVNMKERDDGSVTITNLSSKAVKAFLDYAY TGKTKITDDNVEMFFQLSSFLQVSFLSKACSDFLIKSINLVNCLQLLSISDSYGSTSLFD HALHFVQHHFSLLFKSSDFLEMNFGVLQKCLESDELNVPEEEMVLKVVLSWTKHNLESRQ KYLPHLIEKVRLHQLSEETLQDCLFNEESLLKSTNCFDIIMDAIKCVQGSGGLFPDARPS TTEKYIFIHKTEENGENQYTFCYNIKSDSWKILPQSHLIDLPGSSLSSYGEKIFLTGGCK GKCCRTVRLHIAESYHDATDQTWCYCPVKNDFFLVSTMKTPRTMHTSVMALDRLFVIGGK TRGSRDIKSLLDVESYNPLSKEWISVSPLPRGIYYPEASTCQNVIYVLGSEVEITDAFNP SLDCFFKYNATTDQWSELVAEFGQFFHATLIKAVPVNCTLYICDLSTYKVYSFCPDTCVW KGEGSFECAGFNAGAIGIEDKIYILGGDYAPDEITDEVQVYHSNRSEWEEVSPMPRALTE FYCQLTSLALHMLHLHTSRVMTICVFEKKKVEGYLMESRRDPDLQVLLRTKKKHSNPVAG AQTLLL >gi568815587r:105909538_106110917|GENSCAN_predicted_CDS_5|1821_bp atgatgacagcagatttccatttctgcgaagttaagggtcagacacatgaccttgcaatt actgtgttaattgtatcaagggctatgtttgaagtaaacatgaaagaaagagatgatgga agtgttaccattactaatttgtcctccaaggcagtaaaagcatttctcgattatgcctat actggaaaaacaaaaataacagatgataatgtggaaatgttcttccagttgtcatcattt cttcaagtttccttcctatccaaagcttgcagtgactttttaataaaaagtattaatctt gtcaattgtttacagttattatctatatcagatagctatggctccaccagtttgtttgat catgcattacactttgtacaacatcacttttctttattatttaaatccagtgatttctta gagatgaattttggagtactacagaaatgtctggaatcagatgaattaaatgttcctgaa gaagaaatggtactgaaagttgtccttagttggactaaacataacttagaatcaaggcaa aagtatctgcctcatttgattgaaaaagtgagattacatcagttatctgaggagacactt caggactgtctgttcaatgaagagagtttactcaaaagcacaaactgttttgacataatc atggatgcaattaagtgtgtgcaaggttctggtggactcttccctgatgctcgaccatcc acaactgagaaatacatattcattcacaaaactgaggaaaatggagaaaatcaatataca ttttgctataacattaaatctgattcatggaaaatactgccgcaatcacacctgattgat ttgccaggatctagtctttcgagttacggagagaaaatattcttgacaggtggttgcaaa gggaaatgttgtcgaacggttcgactgcatattgccgagtcatatcatgatgccactgat caaacctggtgctactgtccagtgaaaaatgatttcttcttggtatcaactatgaaaaca ccaagaaccatgcatacatcagttatggctctcgatagattatttgtcataggtggaaaa actagaggatcccgggacattaaaagtctcttagatgttgaatcttacaatcctctttcc aaagaatggatatctgttagcccattacccagaggcatatactatccagaagcaagcaca tgccaaaatgtaatttatgttcttggatcagaggtagagattacagatgcttttaaccca tcacttgattgcttttttaaatacaatgctacaactgatcagtggtctgaactagtagca gagtttgggcaattttttcatgcaacattaattaaagctgtaccagtaaactgtacactg tatatatgtgacctttccacctataaggtttatagtttttgtccagacacttgtgtttgg aaaggcgaaggatcttttgagtgtgcaggctttaatgcaggtgcaattggaattgaagat aaaatttatatattaggtggtgattatgcaccagatgaaatcacagatgaagtgcaggtc taccacagcaacaggtctgaatgggaagaagtttcaccaatgcctagagccttaacagaa ttttactgccagctaacatctctggccttacatatgctacacttacatacgtccagagtt atgacaatctgtgtctttgagaagaaaaaggtggaaggctatcttatggagagtaggagg gatccagacttacaggttctgttgagaacaaagaagaaacactcaaatccagtggctgga gcccagactttgttgttataa >gi568815587r:105909538_106110917|GENSCAN_predicted_peptide_6|662_aa MEEVMSHIGSCVTNVTQTIGGRAGNRIRVPQNPAKKPQISTKQDDHYRLSNCVNYYRWPL ADISGQSEEEGNRLNPAVPHFNRPPGSHLQSSAETNFSLDQNLQGTWIAWAFIRESTTAY TEQLRKQTYINMTLRKCSSPGLAIAKQHLGRFIAPNKQPALAQLLAGSLGSRRGQCCALG IVPLLGLRREGFRPRSSPTCNCSLRSLHPSLPASRGLVAIGLHRCPVNSSASVSGVTVGL LALVPPFARLRSLAYLTSYNAAKRRADLRLRPQLLSPRGVCVATARPLRTDAGKKGVGPR LRPRHQARDSGEVRFQCMVFPAKRFCLVPSMEGVRWAFSCGTWLPSRAEWLLAVRSIQPE EKERIGQFVFARDAKAAMAGRLMIRKLVAEKLNIPWNHIRLQRTAKGKPVLAKDSSNPYP NFNFNISHQGDYAVLAAEPELQVGIDIMKTSFPGRGSIPEFFHIMKRKFTNKEWETIRSF KDEWTQLDMFYRNWALKESFIKAIGVGLGFELQRLEFDLSPLNLDIGQVYKETRLFLDGE EEKEWAFEESKIDEHHFVAVALRKPDGSRHQDVPSQDDSKPTQRQFTILNFNDLMSSAVP MTPEDPSFWDCFCFTEEIPIRNEAFSNRQVKSHKFDAPDSSMSVFGCYLAPFKALHAFSS VA >gi568815587r:105909538_106110917|GENSCAN_predicted_CDS_6|1989_bp atggaagaggttatgtcacacataggatcctgtgtgaccaatgtcacacagacaattggt ggtagagctggaaatagaatacgagtccctcagaatcctgctaagaagccacagatcagc accaagcaagatgatcattacagactgagtaattgtgttaattactaccgctggcctttg gctgatatttccgggcagtcagaggaagaaggaaatcggctgaatcccgcagtaccgcac tttaatcgacctccaggtagtcatttacagtcatctgcagagaccaacttttccctggac caaaacctgcagggcacttggatcgcgtgggctttcatacgagagtcaacaacagcctac acggaacaactaaggaaacagacatacatcaatatgacccttcgcaaatgctctagccca ggcttggcaatagcaaaacagcacctgggtcgcttcatcgccccaaataaacagccagca ttagctcagctgctcgctggcagcctcggatccaggcggggtcagtgttgcgcactgggg atagtgcctctgctcggccttcggagggagggtttcagaccccggagctcacccacctgc aactgcagtctccgaagtctccaccccagtctccctgcctccagaggactggttgcgatt ggcctgcaccgctgtcccgttaactcttccgcaagtgtgagtggcgtaactgtcgggctc ttggccttggtcccgcccttcgctcgcctccgaagcctcgcctacttgacgtcatacaat gccgcaaagcgcagggctgatctccgtctccgcccccagctgctttctccgagaggagtc tgcgtagcgacggcccgtcccctgcgcacggacgccgggaagaagggggtggggccacgt ttgcgtccgcgccatcaggcccgagatagcggcgaggtccgctttcagtgtatggttttc cctgccaaacggttctgcttggtgccatccatggagggcgtgcgctgggccttttcctgc ggcacttggctgccgagccgagccgaatggctgctggcagtgcgatcgattcagcccgag gagaaggagcgcattggccagttcgtctttgcccgggacgctaaggcagccatggctggt cgtctgatgataaggaaattagttgcagagaaattgaatatcccttggaatcatattcgt ttgcaaagaactgcaaaaggaaaaccagttcttgcaaaggactcatcgaatccttacccg aatttcaactttaacatctctcatcaaggagactatgcagtgcttgctgctgaacctgag ctgcaagttggaattgatataatgaagactagttttccaggtcgtggttcaattccagaa ttctttcatattatgaaaagaaagtttaccaacaaagaatgggaaacaatcagaagcttt aaggatgagtggactcagctggatatgttttataggaattgggcacttaaggaaagcttc ataaaagccattggtgttggactaggatttgaattgcagcggcttgaatttgatctatct ccattaaacttggatataggccaagtttataaagaaacacgtttattcctggatggagag gaagaaaaagaatgggcatttgaggaaagcaaaatagatgagcaccattttgttgcagtt gctcttaggaaacccgatggatctagacatcaggatgttccatctcaggatgattccaaa ccaacccagaggcaatttactattctcaactttaatgatttaatgtcatctgccgttccc atgacacctgaagatccttcattttgggactgtttttgcttcacagaagaaattccaata cgaaatgaagccttcagcaataggcaagtgaaatcccataaatttgatgctcctgatagt tccatgagtgttttcggttgttacttagcaccctttaaagctcttcatgctttttccagt gttgcatga >gi568815587r:105909538_106110917|GENSCAN_predicted_peptide_7|84_aa XCGNDVRQRANSSVFLFEFKMGCKAAKTTRNISNTYGPGTANECTVQWWFKKFCKGDESL EDEECTGQPLEVDNDQLRATIEPS >gi568815587r:105909538_106110917|GENSCAN_predicted_CDS_7|255_bp nactgtggaaacgatgttagacaaagagcaaattcgagcgttttcttattcgagttcaaa atgggttgtaaagcagcaaagacaactcgcaatatcagcaacacatatggcccaggaact gctaatgaatgtacagtgcagtggtggttcaagaagttttgcaaaggagacgagagcctt gaagatgaggagtgtaccggccagccattggaagttgacaacgaccaattgagagcaacc atcgaaccctcttaa