GENSCAN 1.0 Date run: 5-Nov-116 Time: 12:23:21 Sequence gi568815590r:81343401_81547386 : 203986 bp : 37.34% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 579 574 6 -0.45 1.08 Term - 843 717 127 2 1 43 46 99 0.002 -2.03 1.07 Intr - 6128 6039 90 1 0 72 99 78 0.010 5.49 1.06 Intr - 17877 17747 131 0 2 56 16 135 0.214 1.67 1.05 Intr - 20026 19873 154 0 1 56 77 79 0.465 2.75 1.04 Intr - 22437 22267 171 2 0 68 42 114 0.011 2.94 1.03 Intr - 43316 43045 272 2 2 34 71 149 0.001 3.22 1.02 Intr - 46960 46889 72 1 0 90 97 67 0.007 6.48 1.01 Init - 53347 53303 45 1 0 53 106 22 0.009 1.23 1.00 Prom - 73782 73743 40 -4.25 2.00 Prom + 78188 78227 40 -2.65 2.01 Sngl + 84814 85728 915 0 0 42 43 667 0.998 53.66 2.02 PlyA + 85956 85961 6 1.05 3.00 Prom + 86125 86164 40 -6.15 3.01 Init + 86218 87255 1038 0 0 44 41 511 0.894 36.03 3.02 Term + 87543 88409 867 2 0 -14 38 392 0.557 15.68 3.03 PlyA + 88585 88590 6 1.05 4.12 PlyA - 89382 89377 6 1.05 4.11 Term - 95288 94887 402 2 0 40 36 163 0.210 0.47 4.10 Intr - 101201 101100 102 2 0 70 95 75 0.876 5.85 4.09 Intr - 101589 101417 173 1 2 95 84 84 0.979 7.44 4.08 Intr - 104013 103914 100 0 1 52 86 66 0.137 1.56 4.07 Intr - 115033 114989 45 1 0 99 70 31 0.147 0.09 4.06 Intr - 115303 115202 102 1 0 73 103 50 0.940 4.45 4.05 Intr - 115937 115765 173 0 2 71 77 162 0.458 12.14 4.04 Intr - 126163 126106 58 1 1 105 68 -1 0.003 -2.86 4.03 Intr - 136115 136014 102 2 0 89 89 90 0.976 8.65 4.02 Intr - 137198 137026 173 0 2 75 100 144 0.917 13.04 4.01 Init - 139767 139695 73 0 1 46 82 64 0.522 2.88 4.00 Prom - 150824 150785 40 -5.05 5.06 PlyA - 151141 151136 6 1.05 5.05 Term - 178951 178280 672 1 0 32 41 575 0.494 39.96 5.04 Intr - 183721 183620 102 1 0 95 89 97 0.960 9.95 5.03 Intr - 186210 186038 173 1 2 84 100 116 0.966 11.14 5.02 Intr - 196343 196218 126 0 0 122 61 81 0.132 8.53 5.01 Init - 198969 198960 10 0 1 100 113 2 0.645 4.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 187915 187843 73 1 1 56 80 74 0.975 4.68 S.002 Term - 196343 196177 167 0 2 122 28 108 0.806 5.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:81343401_81547386|GENSCAN_predicted_peptide_1|353_aa MEHRNKKVHTEESRKTLKTDDVNITRIIARLLSTDAGDLLPTRTVSAERGTVLLGGRTAV QPPVPGSEDSASSLTSAYHSQCLNVLSGGLKARLPAQLHPLSTRACHSGNYPSQASLPSL APDYSYWSLRNMDAAGGHYPKRINARTENQILNVLTYKWELNIGYTWTQRRKQQTLVLAR GWRVEGDTESIPLSLTQLLSKLELLTPLADNSLSPGLLIDNIAPLVRVPRCMQQKLTVSV YHLMTERPFQRRGFTRKSFAVLKKPLDFPTYSCPNEDKRFQNQRSQASFSYFLINIKPVT LELQGSAEVQNRPTGLIGKKWRTGNSRQLTVLNVELEPQPVVTSVSPSCRKDA >gi568815590r:81343401_81547386|GENSCAN_predicted_CDS_1|1062_bp atggagcacagaaataagaaagtacacactgaagagagtagaaaaaccttgaagactgat gatgtcaatattactaggattattgcaagattgctaagtacagatgctggagatttgcta cctacccgcactgtttcagctgagaggggcactgtcctccttggtggcagaactgcagtg cagccaccagtgcctggatctgaggattctgccagcagcctcacttctgcctaccacagt cagtgcctgaatgtactatcagggggcctgaaagccagactaccagcccaattgcatccc ctcagtactagagcatgccattcggggaattacccatctcaggccagtctaccatcattg gcacctgattactcctactggagtctcaggaacatggatgcagctggaggccattatcct aaacgaattaatgcaagaacagaaaaccaaatactgaatgtcctcacttataagtgggag ctaaacattgggtacacatggacacaaagaaggaaacagcagacactggtgcttgcaaga gggtggagggtggaaggagatactgagagcattcctctatcacttactcagttattgtca aagcttgagttactgacacccctggctgacaacagtttaagtcctggcttactcattgat aacattgctcctctagtcagagttcctagatgcatgcaacagaaactgacagtgagtgtc tatcacttaatgacagaaaggccctttcagaggagaggatttacccgaaagagctttgct gttctgaagaagcccctggatttccccacatattcctgtccaaacgaggacaagagattc caaaatcagagatctcaggcatcttttagttacttcttgatcaacatcaagccagttaca ttagaactacaaggcagtgctgaagttcaaaacagacctacgggtcttatagggaaaaaa tggagaacagggaacagcaggcagctgactgtgctgaatgtagagctggaaccacaacca gtagttaccagtgtctcaccctcctgcaggaaggatgcctga >gi568815590r:81343401_81547386|GENSCAN_predicted_peptide_2|304_aa MENDFDELREEGFRRSNYSELQEDVQTKGKEVENFEKNLEECITRITNTEKCLKELMELK TKARELREECRSLRSRCNQLEERVSAMEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVK RPNLHLIGVPESDGENGTKLENTLQDIIQENFPNLARQAKIHIQEIQRTPQRYSSRRATP RHIIVRFTKVEMKEKMLRAAREKGRVTLKGKPIRLTADLSAETLQARREWGPIFNILKEK NFQPRISYPAKLSFISEGEIKYFTDKQMLRDFVTTRPALKELLKEALNMERNNRYQPLQN HAKM >gi568815590r:81343401_81547386|GENSCAN_predicted_CDS_2|915_bp atggagaatgactttgacgagctgagagaagaaggcttcagacgatcaaattactctgag ctacaggaggacgttcaaaccaaaggcaaagaagttgaaaactttgaaaaaaatttagaa gaatgtataactagaataaccaatacagagaagtgcttaaaggagctgatggagctgaag accaaggctcgagaactacgtgaagaatgcagaagcctcaggagccgatgcaatcaactg gaagaaagggtatcagcaatggaagatgaaatgaatgaaatgaagcgagaagggaagttt agagaaaaaagaataaaaagaaatgagcaaagcctccaagaaatatgggactatgtgaaa agaccaaatctacatctgattggtgtacctgaaagtgatggggagaatggaaccaagttg gaaaacactctgcaggatattatccaggagaacttccccaatctagcaaggcaggccaag attcacattcaggaaatacagagaacgccacaaagatactcctcgagaagagcaactcca agacacataattgtcagattcaccaaagttgaaatgaaggaaaaaatgttaagggcagcc agggagaaaggtcgggttaccctcaaagggaagcccatcagactaacagcggatctctcg gcagaaaccctacaagccagaagagagtgggggccaatattcaacattcttaaagaaaag aattttcaacccagaatttcatatccagccaaactaagcttcataagtgaaggagaaata aaatactttacagacaagcaaatgctgagagattttgtcaccaccaggcctgccctaaaa gagctcctgaaggaagcgctaaacatggaaaggaacaaccggtaccagccgctgcaaaat catgccaaaatgtaa >gi568815590r:81343401_81547386|GENSCAN_predicted_peptide_3|634_aa MGDFNTPLSTLDRSMRQKVNKDTEELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWV HNKMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKVSRSQEITKTRAELKEIETQKTLQKINESRSWFFERINKIDRLLAILIK KKREKNQIDVIKNDKGDITTDPTEIETTIREYYKHLHANKLENLEEMDKFLNTYTLPRLN QEEIESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELHINRAKDKNHMIIS IDAEKAFDKIQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTR QGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNL LKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKPRNPTYKGREG PLQEELQTTAQGNKRGYKQMEEHSMLMGRKNQYRENGHTAQGNLQIQCHPHQGTNAFLHR IGKNYLKVDMEPKRSPHRQINPKPKEQSWRHHTT >gi568815590r:81343401_81547386|GENSCAN_predicted_CDS_3|1905_bp atgggagactttaacaccccactgtcaacattagacagatcaatgagacagaaagtcaac aaggataccgaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacaaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagttagcagaagccaagaaataactaaa accagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagactgctagcaatactaataaag aaaaaaagagagaagaatcaaatagacgtaataaaaaatgataagggggatatcaccacc gatcccacagaaatagaaactaccatcagagaatactacaaacacctccacgcaaataaa ctagaaaatctagaagaaatggataaattcctcaacacatacactctcccaagactaaac caggaagaaattgaatctctgaatagaccaataacaggatctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactgcatataaacagagccaaagacaaaaaccacatgattatctca atagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaat aaattaggtattgatgggatgtatctcaaaataataagagctatctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggtacaaga cagggatgcccgctctcaccactcctattcaacatagtattggaagttctggccagggca attaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ttgtttgcagacgacatgattgtatatctagaaaaccccattgtctcagcccaaaatctc cttaagctgataagcaactttagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaacctaggaatccaacttacaagggacgtgaagga cctcttcaagaagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatg gaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcc caaggtaatttacagattcaatgccatccccatcaaggtaccaatgcctttcttcacaga attggaaaaaactacttgaaagttgatatggaaccaaaaaggagcccgcatcgccaaatc aatcctaagccaaaagaacaaagctggaggcatcacactacctga >gi568815590r:81343401_81547386|GENSCAN_predicted_peptide_4|500_aa MCDAFVGTWKLVSSENFDDYMKEVGVGFATRKVAGMAKPNMIISVNGDVITIKSESTFKN TEISFILGQEFDEVTADDRKVKSTITLDGGVLVHVQKWDGKSTTIKRKREDDKLVVVIII KNLTTKIISGPLGVTGVNFAARNMAGLVKPTVTISVDGKMMTIRTESSFQDTKISFKLGE EFDETTADNRKVKSTITLENGSMIHVQKWLGKETTIKRKIVDEKMVVECKMNNIVSTRIY EKNCVELSPITMSNKFLGTWKLVSSENFDDYMKALGVGLATRKLGNLAKPTVIISKKGDI ITIRTESTFKNTEISFKLGQEFEETTADNRKTKSIVTLQRGSLNQVQRWDGKETTIKRKL VNGKMVASTQMRGSQKNNSSNITKLVSLTLPKDHTSSPAMDSSQDEISELPEKEFRTLIL KLIREAPEKSELQLKEIKDVIQDMKGKFCEIGSINKKQSHLLEIKDTLREMQNALETLSN RIKQAEERSSELEDKAFELI >gi568815590r:81343401_81547386|GENSCAN_predicted_CDS_4|1503_bp atgtgtgatgcttttgtaggtacctggaaacttgtctccagtgaaaactttgatgattat atgaaagaagtaggagtgggctttgccaccaggaaagtggctggcatggccaaacctaac atgatcatcagtgtgaatggggatgtgatcaccattaaatctgaaagtacctttaaaaat actgagatttccttcatactgggccaggaatttgacgaagtcactgcagatgacaggaaa gtcaagagcaccataaccttagatgggggtgtcctggtacatgtgcagaaatgggatgga aaatcaaccaccataaagagaaaacgagaggatgataaactggtggtggtcatcataata aaaaatctcacaacaaagataatctctggacctcttggcgtcacaggagtgaatttcgca gcccggaacatggcagggttagtgaaaccgacagtaactattagtgttgatgggaaaatg atgaccataagaacagaaagttctttccaggacactaagatctccttcaagctgggggaa gaatttgatgaaactacagcagacaaccggaaagtaaagagcaccataacattagagaat ggctcaatgattcacgtccaaaaatggcttggcaaagagacaacaatcaaaagaaaaatt gtggatgaaaaaatggtagtggaatgtaaaatgaataatattgtcagcaccagaatctac gaaaagaactgtgttgagctctcacccatcacgatgagcaacaaattcctgggcacctgg aaacttgtctctagtgagaactttgacgattacatgaaagctctgggtgtggggttagcc accagaaaactgggaaatttggccaaacccactgtgatcatcagcaagaaaggagatatt ataactatacgaactgaaagtacctttaaaaatacagaaatctccttcaagctaggccag gaatttgaagaaaccacagctgacaatagaaagaccaagagcatcgtaaccctgcagaga ggatcactgaatcaagtgcagagatgggatggcaaagagacaaccataaagagaaagcta gtgaatgggaaaatggtagcgtccacccaaatgagagggagccagaaaaacaattctagt aatataacaaaacttgtttctttaacactcccaaaagatcataccagctcaccagcaatg gattcaagccaagatgaaatctctgaattgccagaaaaagaattcagaacgttgattctt aagctaatcagggaggcaccagagaaaagtgaactccaacttaaagaaatcaaagacgtg atacaagatatgaaaggaaaattctgtgaaataggtagcataaataaaaaacaatcacat cttctggaaataaaggacacacttagagaaatgcaaaatgcactggaaactctcagcaat agaatcaaacaagcagaagaaagaagttcagagcttgaagacaaggcttttgaattaatc taa >gi568815590r:81343401_81547386|GENSCAN_predicted_peptide_5|360_aa MPQGIPCIILICLNHMIAQKQTAPASQHSTLRAAFSEQSAEILKQSIGRASRKLGRLAKP TVTISTDGDVITIKTKSIFKNNEISFKLGEEFEEITPGGHKTKSKVTLDKESLIQVQDWD GKETTITRKLVDGKMVVPGRPPIASRPSPLHRPLRPPQGPPAAAPAPPLQPPLFSRRHGD PVHLAGAPGLQPGLRGRHQPPHQPGALRLLLLPVHVLYFDRDDVALKNFAKYFLYQSHEE REHAEKLMKLQNQRGGRIFLQDIKKPDCDDWESGLNAMECALYLEKNVNQSLLELYKLAT DKNDPHLCDFIETHYLNEQVKAIKELSDHVTNLRKMGAPESGSAEYLFDKHTLGDSDNES >gi568815590r:81343401_81547386|GENSCAN_predicted_CDS_5|1083_bp atgccccaaggaattccgtgtattattttgatttgtctgaatcacatgattgctcagaag cagacagcaccagcaagccagcactcaaccctaagagcagccttcagtgagcagtctgct gaaattctaaaacagagtataggaagagccagcaggaaactgggccgtttggcaaaaccc actgtgaccatcagtacagatggagatgtcatcacaataaaaaccaaaagcatctttaaa aataatgagatctcctttaagctgggagaagagtttgaggaaatcacgccaggtggccac aaaacaaagagtaaagtaaccttagataaggagtccctgattcaagttcaggactgggat ggcaaagaaaccaccataacgagaaagctggtggatgggaaaatggtggtgcccggccgg ccgcccatagccagccgtccgtcacctcttcaccgccccctcagaccgccccaaggtccc cccgccgccgctccagcgccgccgctgcagccgcctctctttagtcgccgccatggcgac cccgtccacctcgcaggagcgccaggactacaaccaggactcagaggccgccatcaaccg ccacatcaacctggagctctgcgcctcctacttttacctgtccatgtcttatactttgac cgcgatgatgtggctttgaagaactttgccaaatactttctttaccaatctcatgaggag agggaacatgctgagaaactgatgaagctgcagaaccaacgaggtggccgaatcttcctt caggatatcaagaaaccagactgtgatgactgggagagcgggctgaatgcgatggagtgt gcattatatttggaaaaaaatgtgaatcagtcactactggaactgtacaaactggccact gacaaaaatgacccccatttgtgtgacttcattgagacacattacctgaatgagcaggtg aaagccatcaaagaattgagtgaccacgtgaccaacttgcgcaagatgggagcgcccgaa tctggctcggcagaatatctcttcgacaagcacaccctgggagacagtgataatgaaagc taa