GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:04:35 Sequence gi568815593r:151564087_151776188 : 212102 bp : 45.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 4865 1587 3279 0 0 108 84 3121 0.572 301.08 1.07 Intr - 18337 18198 140 0 2 104 74 12 0.076 1.51 1.06 Intr - 20350 20281 70 2 1 102 74 46 0.093 2.84 1.05 Intr - 35191 34973 219 2 0 1 115 78 0.002 0.07 1.04 Intr - 42032 41917 116 1 2 102 46 55 0.045 2.79 1.03 Intr - 47431 47288 144 0 0 55 105 26 0.065 0.40 1.02 Intr - 55892 55867 26 2 2 126 87 6 0.400 1.12 1.01 Init - 56616 56533 84 0 0 74 64 65 0.549 3.52 1.00 Prom - 59844 59805 40 -5.36 2.00 Prom + 67361 67400 40 -6.16 2.01 Init + 68233 68302 70 0 1 95 119 55 0.946 10.51 2.02 Intr + 74990 75063 74 0 2 49 91 53 0.138 0.83 2.03 Intr + 90451 90585 135 0 0 87 92 23 0.356 3.36 2.04 Intr + 90611 90667 57 1 0 76 60 58 0.023 0.88 2.05 Term + 97171 97203 33 0 0 71 52 71 0.014 -0.61 2.06 PlyA + 98062 98067 6 1.05 3.10 PlyA - 98376 98371 6 1.05 3.09 Term - 99513 99485 29 1 2 127 35 62 0.932 3.04 3.08 Intr - 100149 100001 149 2 2 151 70 290 0.999 33.38 3.07 Intr - 102423 102275 149 0 2 96 94 263 0.988 26.73 3.06 Intr - 103514 103381 134 0 2 60 114 248 0.998 24.96 3.05 Intr - 105698 105578 121 2 1 58 82 194 0.999 15.87 3.04 Intr - 107608 107487 122 2 2 123 100 183 0.999 23.21 3.03 Intr - 109130 109043 88 2 1 117 53 122 0.998 11.14 3.02 Intr - 110588 110526 63 2 0 104 87 66 0.754 7.01 3.01 Init - 112102 112046 57 1 0 96 95 91 0.998 9.91 3.00 Prom - 137069 137030 40 -1.76 4.05 PlyA - 137282 137277 6 1.05 4.04 Term - 182363 182239 125 0 2 117 43 109 0.991 7.85 4.03 Intr - 184256 184179 78 0 0 76 77 49 0.812 2.12 4.02 Intr - 187693 187618 76 1 1 79 105 87 0.353 8.59 4.01 Init - 194465 194460 6 2 0 79 117 9 0.292 3.28 4.00 Prom - 201294 201255 40 -3.26 5.03 PlyA - 201790 201785 6 1.05 5.02 Term - 202007 201798 210 2 0 36 35 216 0.994 8.39 5.01 Init - 202153 202079 75 1 0 107 23 108 0.775 7.31 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:151564087_151776188|GENSCAN_predicted_peptide_1|1360_aa MQASPCFGADEKARPQALGLHIKFLIAMGFAQLAGVRALLMWLVHPAVTVSMVSPAPSSL SSSGLLEHLWSLARGKLSQQVKVWRASISIADGHKTSKNQGKTQKDKYCQAIYQFANLFA PGWKPDGVCLKIPREEVLGNHSCLTKSNQSGPIKSHRKPVSGWQREPESALTCMDHIRVS RGCQGMEGVALRVLKEGSLKELNKKTDQNIAPRTEKCCRKEFLMVGITSSIHRPALRMRT QTDPATLLSFVSADGWVEQLPWLRKVGFREFSTMTIALLGFAIFLLHCATCEKPLEGILS SSAWHFTHSHYNATIYENSSPKTYVESFEKMGIYLAEPQWAVRYRIISGDVANVFKTEEY VVGNFCFLRIRTKSSNTALLNREVRDSYTLIIQATEKTLELEALTRVVVHILDQNDLKPL FSPPSYRVTISEDMPLKSPICKVTATDADLGQNAEFYYAFNTRSEMFAIHPTSGVVTVAG KLNVTWRGKHELQVLAVDRMRKISEGNGFGSLAALVVHVEPALRKPPAIASVVVTPPDSN DGTTYATVLVDANSSGAEVESVEVVGGDPGKHFKAIKSYARSNEFSLVSVKDINWMEYLH GFNLSLQARSGSGPYFYSQIRGFHLPPSKLSSLKFEKAVYRVQLSEFSPPGSRVVMVRVT PAFPNLQYVLKPSSENVGFKLNARTGLITTTKLMDFHDRAHYQLHIRTSPGQASTVVVID IVDCNNHAPLFNRSSYDGTLDENIPPGTSVLAVTATDRDHGENGYVTYSIAGPKALPFSI DPYLGIISTSKPMDYELMKRIYTFRVRASDWGSPFRREKEVSIFLQLRNLNDNQPMFEEV NCTGSIRQDWPVGKSIMTMSAIDVDELQNLKYEIVSGNELEYFDLNHFSGVISLKRPFIN LTAGQPTSYSLKITASDGKNYASPTTLNITVVKDPHFEVPVTCDKTGVLTQFTKTILHFI GLQNQESSDEEFTSLSTYQINHYTPQFEDHFPQSIDVLESVPINTPLARLAATDPDAGFN GKLVYVIADGNEEGCFDIELETGLLTVAAPLDYEATNFYILNVTVYDLGTPQKSSWKLLT VNVKDWNDNAPRFPPGGYQLTISEDTEVGTTIAELTTKDADSEDNGRVRYTLLSPTEKFS LHPLTGELVVTGHLDRESEPRYILKVEARDQPSKGHQLFSVTDLIITLEDVNDNSPQCIT EHNRLKVPEDLPPGTVLTFLDASDPDLGPAGEVRYVLMDGAHGTFRVDLMTGALILEREL DFERRAGYNLSLWASDGGRPLARRTLCHVEVIVLDVNENLHPPHFASFVHQGQVQENSPS GTQVIVVAAQDDDSGLDGELQYFLRAGTGLAAFSINQDTX >gi568815593r:151564087_151776188|GENSCAN_predicted_CDS_1|4080_bp atgcaggcttcgccctgtttcggggctgatgagaaagcacggccccaggcactggggctt catattaaattcttaatagccatgggatttgctcagcttgctggggttagggctttgctt atgtggttggtgcatccggctgtcaccgtgtctatggtgtcacctgctcctagtagcctt tcatcttctgggctactggagcatctctggagcttggccagggggaagctgtcacagcag gtcaaggtgtggagagcctccattagtatagcagatggacacaaaacaagcaaaaaccaa ggcaaaacccaaaaagataaatattgtcaagctatctaccagtttgcaaatctctttgcc ccagggtggaagccagatggtgtgtgcctgaagatccccagagaggaggtattgggaaat cacagctgcctcaccaagtcaaaccagtcaggccccatcaagagccacaggaagccagtt tccgggtggcagagagagcctgagtcagctctcacctgcatggatcacatcagggtgagc aggggctgtcaagggatggaaggagtggccctgagagtgctaaaagaagggtctttgaag gaattaaataagaaaacggaccagaatattgctccaagaacagaaaaatgctgcaggaag gaattcctaatggtgggaatcaccagttcaatccaccggccagctttgcggatgagaaca cagacagacccagccactctgttgagttttgtgtctgctgatggctgggtggagcagtta ccttggctaagaaaagtcgggtttcgggagttttccaccatgactattgccctgctgggt tttgccatattcttgctccattgtgcgacctgtgagaagcctctagaagggattctctcc tcctctgcttggcacttcacacactcccattacaatgccaccatctatgaaaattcttct cccaagacctatgtggagagcttcgagaaaatgggcatctacctcgcggagccacagtgg gcagtgaggtaccggatcatctctggggatgtggccaatgtatttaaaactgaggagtat gtggtgggcaacttctgcttcctaagaataaggacaaagagcagcaacacagctcttctg aacagagaggtgcgagacagctacaccctcatcatccaagccacagagaagaccttggag ttggaagctttgacccgtgtggtggtccacatcctggaccagaatgacctgaagcctctc ttctctccaccttcgtacagagtcaccatctctgaggacatgcccctgaagagccccatc tgcaaggtgactgccacagatgctgatctaggccagaatgctgagttctattatgccttt aacacaaggtcagagatgtttgccatccatcccaccagcggtgtggtcactgtggctggg aagcttaacgtcacctggcgaggaaagcatgagctccaggtgctagctgtggaccgcatg cggaaaatctctgagggcaatgggtttggcagcctggctgcacttgtggttcatgtggag cctgccctcaggaagcccccagccattgcttcggtggtggtgactccaccagacagcaat gatggtaccacctatgccactgtactggtcgatgcaaatagctcaggagctgaagtggag tcagtggaagttgttggtggtgaccctggaaagcacttcaaagccatcaagtcttatgcc cggagcaatgagttcagtttggtgtctgtcaaagacatcaactggatggagtaccttcat gggttcaacctcagcctccaggccaggagtgggagcggcccttatttttattcccagatc aggggctttcacctaccaccttccaaactgtcttccctcaaattcgagaaggctgtttac agagtgcagcttagtgagttttcccctcctggcagccgcgtggtgatggtgagagtcacc ccagccttccccaacctgcagtatgttctaaagccatcttcagagaatgtaggatttaaa cttaatgctcgaactgggttgatcaccaccacaaagctcatggacttccacgacagagcc cactatcagctacacatcagaacctcaccgggccaggcctccaccgtggtggtcattgac attgtggactgcaacaaccatgcccccctcttcaacaggtcttcctatgatggtaccttg gatgagaacatccctccaggcaccagtgttttggctgtgactgccactgaccgggatcat ggggaaaatggatatgtcacctattccattgctggaccaaaagctttgccattttctatt gacccctacctggggatcatctccacctccaaacccatggactatgaactcatgaaaaga atttataccttccgggtaagagcatcagactggggatccccttttcgccgggagaaggaa gtgtccatttttcttcagctcaggaacttgaatgacaaccagcctatgtttgaagaagtc aactgtacagggtctatccgccaagactggccagtagggaaatcgataatgactatgtca gccatagatgtggatgagcttcagaacctaaaatacgagattgtatcaggcaatgaacta gagtattttgatctaaatcatttctccggagtgatatccctcaaacgcccttttatcaat cttactgctggtcaacccaccagttattccctgaagattacagcctcagatggcaaaaac tatgcctcacccacaactttgaatattactgtggtgaaggaccctcattttgaagttcct gtaacatgtgataaaacaggggtattgacacaattcacaaagactatcctccactttatt gggcttcagaaccaggagtccagtgatgaggaattcacttctttaagcacatatcagatt aatcattacaccccacagtttgaggaccacttcccccaatccattgatgtccttgagagt gtccctatcaacacccccttggcccgcctagcagccactgaccctgatgctggttttaat ggcaaactggtctatgtgattgcagatggcaatgaggagggctgctttgacatagagctg gagacagggctgctcactgtagctgctcccttggactatgaagccaccaatttctacatc ctcaatgtaacagtatatgacctgggcacaccccagaagtcctcctggaagctgctgaca gtgaatgtgaaagactggaatgacaacgcacccagatttcctcccggtgggtaccagtta accatctcggaggacacagaagttggaaccacaattgcagagctgacaaccaaagatgct gactcggaagacaatggcagggttcgctacaccctgctaagtcccacagagaagttctcc ctccaccctctcactggggaactggttgttacaggacacctggaccgcgaatcagagcct cggtacatactcaaggtggaggccagggatcagcccagcaaaggccaccagctcttctct gtcactgacctgataatcacattggaggatgtcaacgacaactctccccagtgcatcaca gaacacaacaggctgaaggttccagaggacctgccccccgggactgtcttgacatttctg gatgcctctgatcctgacctgggccccgcaggtgaagtgcgatatgttctgatggatggc gcccatgggaccttccgggtggacctgatgacaggggcgctcattctggagagagagctg gactttgagaggcgagctgggtacaatctgagcctgtgggccagtgatggtgggaggccc ctagcccgcaggactctctgccatgtggaggtgatcgtcctggatgtgaatgagaatctc caccctccccactttgcctccttcgtgcaccagggccaggtgcaggagaacagcccctcg ggaactcaggtgattgtagtggctgcccaggacgatgacagtggcttggatggggagctc cagtacttcctgcgtgctggcactggactcgcagccttcagcatcaaccaagatacagnn >gi568815593r:151564087_151776188|GENSCAN_predicted_peptide_2|122_aa MDAWMDNGQLNIFSIQRKDNSDAGLLLLIPTDSEGPIIALSPPKPPLQFSSRISAAMSSC FQLGSVNKRERMEGGKKGEAKVFLTSTLPFSSQLSDGSSSHRKASHDSNFYQVPDECRRM TE >gi568815593r:151564087_151776188|GENSCAN_predicted_CDS_2|369_bp atggatgcatggatggataatggacagttgaatattttcagcatccagaggaaagataat tctgatgctggccttctcctcctcattccaactgattctgaggggcccatcatcgcattg tctccacccaaaccaccactgcagttttccagcaggatctcggcagccatgtcatcttgc ttccagttgggctcagttaataaaagagagagaatggaaggtggaaagaagggagaagcc aaggtgtttctcaccagcaccctccccttctcctctcagctctccgatgggtccagttcc cacaggaaagccagccatgattctaacttctaccaggtacctgatgaatgtcgccgaatg actgaatga >gi568815593r:151564087_151776188|GENSCAN_predicted_peptide_3|303_aa MRAWIFFLLCLAGRALAAPQQEALPDETEVVEETVAEVTEVSVGANPVQVEVGEFDDGAE ETEEEVVAENPCQNHHCKHGKVCELDENNTPMCVCQDPTSCPAPIGEFEKVCSNDNKTFD SSCHFFATKCTLEGTKKGHKLHLDYIGPCKYIPPCLDSELTEFPLRMRDWLKNVLVTLYE RDEDNNLLTEKQKLRVKKIHENEKRLEAGDHPVELLARDFEKNYNMYIFPVHWQFGQLDQ HPIDGYLSHTELAPLRAPLIPMEHCTTRFFETCDLDNDKYIALDEWAGCFGIKQKDIDKD LVI >gi568815593r:151564087_151776188|GENSCAN_predicted_CDS_3|912_bp atgagggcctggatcttctttctcctttgcctggccgggagggccttggcagcccctcag caagaagccctgcctgatgagacagaggtggtggaagaaactgtggcagaggtgactgag gtatctgtgggagctaatcctgtccaggtggaagtaggagaatttgatgatggtgcagag gaaaccgaagaggaggtggtggcggaaaatccctgccagaaccaccactgcaaacacggc aaggtgtgcgagctggatgagaacaacacccccatgtgcgtgtgccaggaccccaccagc tgcccagcccccattggcgagtttgagaaggtgtgcagcaatgacaacaagaccttcgac tcttcctgccacttctttgccacaaagtgcaccctggagggcaccaagaagggccacaag ctccacctggactacatcgggccttgcaaatacatccccccttgcctggactctgagctg accgaattccccctgcgcatgcgggactggctcaagaacgtcctggtcaccctgtatgag agggatgaggacaacaaccttctgactgagaagcagaagctgcgggtgaagaagatccat gagaatgagaagcgcctggaggcaggagaccaccccgtggagctgctggcccgggacttc gagaagaactataacatgtacatcttccctgtacactggcagttcggccagctggaccag caccccattgacgggtacctctcccacaccgagctggctccactgcgtgctcccctcatc cccatggagcattgcaccacccgctttttcgagacctgtgacctggacaatgacaagtac atcgccctggatgagtgggccggctgcttcggcatcaagcagaaggatatcgacaaggat cttgtgatctaa >gi568815593r:151564087_151776188|GENSCAN_predicted_peptide_4|94_aa MPKHEFSVDMTCGGCAEAVSRVLNKLGDEVAEACEKQDPPPTDSLPMVTQQVNGVKYDID LPNKKVCIESEHSMDTLLATLKKTGKTVSYLGLE >gi568815593r:151564087_151776188|GENSCAN_predicted_CDS_4|285_bp atgccgaagcacgagttctctgtggacatgacctgtggaggctgtgctgaagctgtctct cgggtcctcaataagcttggagatgaggtagctgaggcctgtgagaagcaggaccctcct cccaccgactccctgcccatggtcacccagcaagtcaatggagttaagtatgacattgac ctgcccaacaagaaggtctgcattgaatctgagcacagcatggacactctgcttgcaacc ctgaagaaaacaggaaagactgtttcctaccttggccttgagtag >gi568815593r:151564087_151776188|GENSCAN_predicted_peptide_5|94_aa MASISELACVYLALILHDDEVIIMEALANVNIGSLICNVGAGGPALAAGAAPAGGPAPSI AAASAEEKKMEAKKEESEESDDDMGFGLFTKPVL >gi568815593r:151564087_151776188|GENSCAN_predicted_CDS_5|285_bp atggcctccatctccgagcttgcctgtgtctacttggccctcattctgcacgatgacgag gtgatcatcatggaggccctggccaacgtcaacattggaagcctcatctgcaatgtaggg gctggtggacctgctctagcagctggtgctgcaccagcaggaggtcctgccccctccatt gctgctgcttcagctgaggagaagaaaatggaagcaaagaaagaagaatctgaggagtct gatgatgacatgggctttggtctttttactaaacctgttttataa