GENSCAN 1.0 Date run: 3-Nov-116 Time: 20:58:27 Sequence gi568815597f:40638844_40870825 : 231982 bp : 48.40% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 2916 3206 291 2 0 105 38 184 0.968 8.76 1.02 PlyA + 3279 3284 6 1.05 2.07 PlyA - 4287 4282 6 1.05 2.06 Term - 13807 13759 49 0 1 100 47 64 0.062 0.28 2.05 Intr - 21411 21291 121 1 1 111 19 94 0.046 4.35 2.04 Intr - 24292 24130 163 1 1 52 101 21 0.013 -0.65 2.03 Intr - 30616 30584 33 1 0 101 92 12 0.011 1.32 2.02 Intr - 41149 41082 68 2 2 75 103 38 0.098 2.62 2.01 Init - 53166 53118 49 0 1 101 53 110 0.186 7.92 2.00 Prom - 67090 67051 40 -4.36 3.00 Prom + 68348 68387 40 -3.46 3.01 Init + 91712 91763 52 1 1 40 115 40 0.484 3.22 3.02 Intr + 99993 100105 113 1 2 77 65 127 0.461 9.40 3.03 Intr + 108691 108762 72 0 0 96 63 102 0.993 8.10 3.04 Intr + 110730 110843 114 2 0 73 67 82 0.972 5.24 3.05 Intr + 114308 114403 96 1 0 119 99 137 0.999 18.11 3.06 Intr + 119278 119451 174 0 0 91 62 207 0.820 18.54 3.07 Intr + 124045 124203 159 0 0 98 117 168 0.861 20.88 3.08 Intr + 127753 127860 108 0 0 93 91 122 0.997 13.48 3.09 Intr + 130513 130572 60 0 0 85 91 71 0.959 6.03 3.10 Intr + 131866 131970 105 0 0 116 60 129 0.790 13.31 3.11 Term + 133321 133605 285 0 0 50 45 164 0.607 3.70 3.12 PlyA + 134123 134128 6 1.05 4.00 Prom + 134298 134337 40 -5.96 4.01 Init + 145251 145564 314 2 2 77 84 471 0.903 40.40 4.02 Intr + 170343 170517 175 0 1 131 -13 90 0.034 3.14 4.03 Intr + 177484 177638 155 0 2 88 96 -17 0.274 -2.03 4.04 Intr + 178422 178512 91 0 1 80 111 162 0.932 17.70 4.05 Intr + 179321 179447 127 1 1 65 20 213 0.999 12.45 4.06 Intr + 179662 179837 176 2 2 100 100 462 0.999 48.26 4.07 Intr + 180504 180629 126 2 0 110 86 272 0.997 30.08 4.08 Intr + 181032 181142 111 2 0 94 78 78 0.995 7.98 4.09 Intr + 181322 181417 96 1 0 86 87 194 0.975 19.31 4.10 Intr + 182693 182728 36 1 0 107 74 53 0.926 4.26 4.11 Intr + 183471 183559 89 2 2 52 96 59 0.972 1.87 4.12 Intr + 185254 185415 162 1 0 94 97 134 0.985 13.99 4.13 Intr + 192241 192461 221 1 2 46 98 240 0.408 18.75 4.14 Intr + 194171 194270 100 0 1 83 110 206 0.956 21.47 4.15 Intr + 196124 196255 132 2 0 108 72 270 0.999 27.16 4.16 Intr + 198822 198951 130 0 1 91 109 180 0.997 21.10 4.17 Intr + 199468 199659 192 0 0 97 57 384 0.986 35.89 4.18 Intr + 199871 200051 181 1 1 71 100 108 0.987 9.74 4.19 Intr + 201410 201553 144 0 0 80 21 78 0.122 0.55 4.20 Term + 207088 207125 38 2 2 113 35 20 0.017 -3.30 4.21 PlyA + 208166 208171 6 1.05 5.07 PlyA - 208833 208828 6 1.05 5.06 Term - 209593 209475 119 2 2 53 50 76 0.213 -1.00 5.05 Intr - 209983 209954 30 2 0 139 94 6 0.897 4.40 5.04 Intr - 218316 218219 98 2 2 -61 71 320 0.832 15.05 5.03 Intr - 219052 218964 89 1 2 49 59 40 0.589 -4.03 5.02 Intr - 223301 222742 560 0 2 129 49 838 0.413 76.55 5.01 Intr - 231238 231178 61 1 1 120 103 -12 0.432 1.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 19277 19395 119 1 2 84 100 79 0.852 8.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:40638844_40870825|GENSCAN_predicted_peptide_1|96_aa MATILAPRLLRRFLAVVVPAPPPACWDPQISPLMLELRTTFLEAPDEAGPGSPLNMVPGV AGPQAALKMGRKQKDPTSDLDESTAVPPLPSCVTLG >gi568815597f:40638844_40870825|GENSCAN_predicted_CDS_1|291_bp atggccaccatcttggcacccaggctgctccgccgcttcttggcggtggtggtcccagcc ccgcccccggcttgctgggatccgcagatttcaccgctaatgctggagctccgcaccaca ttcctggaggccccagatgaggcaggacctggctccccgttaaacatggtccccggggtg gcagggcctcaggcagctctgaagatgggcaggaaacaaaaggatcccacatcagacctg gatgaatctactgcagtcccaccacttcctagctgcgtgaccttgggctag >gi568815597f:40638844_40870825|GENSCAN_predicted_peptide_2|160_aa MAAGRTDRATAAAARGNNRYPEEYKPNQLCWSSKQWLQHNPQHLQKAWIKGRSDFYTVLT DSGAVSYTGISLTSQKSFTSWGVCELKAEVRNKCSLVGGELAGNGPRFISGIAVLPPGQA RKLKVLLDTPLFPSHTPITSPQQALCTYLLSNCAAVFASL >gi568815597f:40638844_40870825|GENSCAN_predicted_CDS_2|483_bp atggcggcgggcaggacggaccgagctacagcagcggccgcgcgaggaaataatagatac ccagaggagtataagcccaatcaactgtgctggagctctaagcagtggctacagcataat ccccagcacctgcaaaaggcctggataaaggggaggagtgacttctacactgtcctcact gacagtggggctgtctcctacactggtatatcactgacctcacagaagagctttaccagc tggggtgtgtgtgaactgaaggcggaggttagaaacaagtgctctctggttggaggtgag cttgctgggaatggtccccggttcatctctggcatcgccgtcctcccacctggtcaagct agaaagctgaaagttctcctggatacgcccctcttcccgtcccacacccctataaccagt cctcagcaagccctttgtacctacctcctctccaactgcgctgctgtgttcgccagcctc tga >gi568815597f:40638844_40870825|GENSCAN_predicted_peptide_3|445_aa MTEEDMIKMQVTKVSEEVVEMSTEGGFGGTSSSDAQQSLQSFWPRVMEEIRNLTVKDFRV QELPLARIKKIMKLDEDVKMISAEAPVLFAKAAQIFITELTLRAWIHTEDNKRRTLQRND IAMAITKFDQFDFLIDIVPRDELKPPKRQEEVRQSVTPAEPVQYYFTLAQQPTAVQVQGQ QQGQQTTSSTTTIQPGQIIIAQPQQGQTTPVTMQVGEGQQVQIVQAQPQGQAQQAQSGTG QTMQVMQQIITNTGEIQQIPVQLNAGQLQYIRLAQPVSGTQVVQGQIQTLATNAQQITQT EVQQGQQQFSQFTDGQQLYQIQQVTMPAGQDLAQPMFIQSANQPSDGQAPQTELCPAWLR VKKGASLMEKAGSVKAWGKSFQSCLAQAIIQLPAVAMPINKGMASAIENNPGPLSAKSST AKTYFLLIEAGRIERAFHFIGHRGQ >gi568815597f:40638844_40870825|GENSCAN_predicted_CDS_3|1338_bp atgacagaggaagatatgatcaaaatgcaagtcacaaaagtgagtgaagaagttgtcgag atgtccacagaaggaggatttggtggtactagcagcagtgatgcccagcaaagcctacag tcgttctggcctcgggtcatggaagaaatccggaatttaacagtgaaagacttccgagtg caggaactcccactggctcgtattaagaagattatgaaactggatgaagatgtgaagatg atcagtgcagaagcgcctgtactctttgccaaggcagcccagatttttatcacagagttg actcttcgagcctggattcacacagaagataacaagcgccggactctacagagaaatgat atcgccatggcaattacaaaatttgatcagtttgattttctcatcgatattgttccaaga gatgaactgaaacctccaaagcgtcaggaggaggtgcgccagtctgtaactcctgccgag ccagtccagtactatttcacgctggctcagcaacccaccgctgtccaagtccagggccag cagcaaggccagcagaccaccagctccacgaccaccatccagcctgggcagatcatcatc gcacagcctcagcagggccagaccacacctgtgacaatgcaggttggagaaggtcagcag gtgcagattgtccaggctcagccacagggtcaagcccaacaggcccagagtggcactgga cagaccatgcaggtgatgcagcagatcatcactaacacaggagagatccagcagatcccg gtgcagctgaatgccggccagctgcagtatatccgcttagcccagcctgtatcaggcact caagttgtgcagggacagatccagacacttgccaccaatgctcaacagattacacagaca gaggtccagcaaggacagcagcagttcagccagttcacagatggacagcagctctaccag atccagcaagtcaccatgcctgcgggccaggacctcgcccagcccatgttcatccagtca gccaaccagccctccgacgggcaggccccccagactgagctgtgtcctgcctggttgagg gtcaagaagggggcatctctcatggagaaagctggctccgtcaaagcctggggtaaatcc ttccagtcctgcctggcacaggctatcattcagctacctgctgtggccatgcccattaac aaaggcatggcatctgccatagagaacaacccaggccccttgtcagccaagagttctaca gccaagacctacttcctgttaattgaagcagggagaatagagagagcattccacttcatt ggccacagaggccagtga >gi568815597f:40638844_40870825|GENSCAN_predicted_peptide_4|931_aa MAEAPPRRLGLGPPPGDAPRAELVALTAVQSEQGEAGGGGSPRRLGLLGSPLPPGAPLPG PGSGSGSACGQRSSAAHKRYRRLQNWVYNVLERPRGWAFVYHVFISLAIPPSLRRGPLFL LHRWGSIGSSSNFATSSLDCLSCSCDLSHHLYPDVPQICISSRPVTSHLALLHPSTYRKA TSFSPHALGPLPIHWEVGNPGSNLSSVLGTLGDKGFLLVFSCLVLSVLSTIQEHQELANE CLLILEFVMIVVFGLEYIVRVWSAGCCCRYRGWQGRFRFARKPFCVIDFIVFVASVAVIA AGTQGNIFATSALRSMRFLQILRMVRMDRRGGTWKLLGSVVYAHSKELITAWYIGFLVLI FASFLVYLAEKDANSDFSSYADSLWWGTITLTTIGYGDKTPHTWLGRVLAAGFALLGISF FALPAGILGSGFALKVQEQHRQKHFEKRRMPAANLIQRKLGKPMDAEGLAAWRLYSTDMS RAYLTATWYYYDSILPSFRELALLFEHVQRARNGGLRPLEVRRAPVPDGAPSRYPPVATC HRPGSTSFCPGESSRMGIKDRIRMGSSQRRTGPSKQHLAPPTMPTSPSSEQVGEATSPTK VQKSWSFNDRTRFRASLRLKPRTSAEDAPSEEVAEEKSYQCELTVDDIMPAVKTVIRSIR ILKFLVAKRKFKETLRPYDVKDVIEQYSAGHLDMLGRIKSLQTRVDQIVGRGPGDRKARE KGDKGPSDAEVVDEISMMGRVVKVEKQVQSIEHKLDLLLGFYSRCLRSGTSASLGAVQVP LFDPDITSDYHSPVDHEDISVSAQTLSISRSEREMPGRTEGSSSGRPAASGPPSALPTPS RPYVAHLAGAQPREWERALGPWALTQLPAMQESHPSTHSETAVERARRMGLPYDQGDMGR SPPSFHDRGSAINSVLGYAMSIVAIFMSMCN >gi568815597f:40638844_40870825|GENSCAN_predicted_CDS_4|2796_bp atggccgaggcccccccgcgccgcctcggcctgggtcccccgcccggggacgccccccgc gcggagctagtggcgctcacggccgtgcagagcgaacagggcgaggcgggcgggggcggc tccccgcgccgcctcggcctcctgggcagccccctgccgccgggcgcgcccctccctggg ccgggctccggctcgggctccgcctgcggccagcgctcctcggccgcgcacaagcgctac cgccgcctgcagaactgggtctacaacgtgctggagcggccccgcggctgggccttcgtc taccacgtcttcatctctctggctatccctcccagtctgcgtagagggcctctgttcctg ctgcaccgctggggctccatcgggtcctcctctaactttgctacaagctctctggactgt ctcagctgctcctgtgacctcagtcaccacctctaccctgatgtcccccagatctgcatt tccagccggcctgtgacctctcacctcgctctccttcacccatccacataccgcaaggcc accagtttctctcctcatgcccttggtcctcttcccatccactgggaggttgggaaccct ggctctaatctcagctctgtcctgggcacactgggcgacaaaggatttttgctggtcttc agctgcctggtgctgtctgtgctgtccactatccaggagcaccaggaacttgccaacgag tgtctcctcatcttggaattcgtgatgatcgtggttttcggcttggagtacatcgtccgg gtctggtccgccggatgctgctgccgctaccgaggatggcagggtcgcttccgctttgcc agaaagcccttctgtgtcatcgacttcatcgtgttcgtggcctcggtggccgtcatcgcc gcgggtacccagggcaacatcttcgccacgtccgcgctgcgcagcatgcgcttcctgcag atcctgcgcatggtgcgcatggaccgccgcggcggcacctggaagctgctgggctcagtg gtctacgcgcatagcaaggagctgatcaccgcctggtacatcgggttcctggtgctcatc ttcgcctccttcctggtctacctggctgagaaggacgccaactccgacttctcctcctac gccgactcgctctggtgggggacgattacattgacaaccatcggctatggtgacaagaca ccgcacacatggctgggcagggtcctggctgctggcttcgccttactgggcatctctttc tttgccctgcctgccggcatcctaggctccggctttgccctgaaggtccaggagcagcac cggcagaagcacttcgagaagcggaggatgccggcagccaacctcatccagaggaagctc ggcaagcccatggatgctgaggggttggctgcctggcgcctgtactccaccgatatgagc cgggcctacctgacagccacctggtactactatgacagtatcctcccatccttcagagag ctggccctcttgtttgagcacgtgcaacgggcccgcaatgggggcctacggcccctggag gtgcggcgggcgccggtacccgacggagcaccctcccgttacccgcccgttgccacctgc caccggccgggcagcacctccttctgccctggggaaagcagccggatgggcatcaaagac cgcatccgcatgggcagctcccagcggcggacgggtccttccaagcagcatctggcacct ccaacaatgcccacctccccaagcagcgagcaggtgggtgaggccaccagccccaccaag gtgcaaaagagctggagcttcaatgaccgcacccgcttccgggcatctctgagactcaaa ccccgcacctctgctgaggatgccccctcagaggaagtagcagaggagaagagctaccag tgtgagctcacggtggacgacatcatgcctgctgtgaagacagtcatccgctccatcagg attctcaagttcctggtggccaaaaggaaattcaaggagacactgcgaccgtacgacgtg aaggacgtcattgagcagtactcagcaggccacctggacatgctgggccggatcaagagc ctgcaaactcgggtggaccaaattgtgggtcgggggcccggggacaggaaggcccgggag aagggcgacaaggggccctccgacgcggaggtggtggatgaaatcagcatgatgggacgc gtggtcaaggtggagaagcaggtgcagtccatcgagcacaagctggacctgctgttgggc ttctattcgcgctgcctgcgctctggcacctcggccagcctgggcgccgtgcaagtgccg ctgttcgaccccgacatcacctccgactaccacagccctgtggaccacgaggacatctcc gtctccgcacagacgctcagcatctcccgctcggagcgtgagatgccaggtcgcacagag ggcagcagcagcggccgtcccgcggcctctgggccccccagtgccctgcccactccatca aggccctatgtggcccacctggcaggggcacagccccgggagtgggagcgggcgctgggg ccctgggccctgacccagcttccagctatgcaagagagccacccctccacccactcagag acagctgtggagagggccaggagaatgggattaccctatgaccaaggagacatgggaaga agccctccttccttccacgatcgaggttccgccatcaactcggttctcggatatgcaatg tcaattgttgccatctttatgtccatgtgtaactaa >gi568815597f:40638844_40870825|GENSCAN_predicted_peptide_5|318_aa VFAQILPFSYADIVGAGVHAGSRPAAMADHLMLAEGYRLVQRPPSAAAAHGPHALRTLPP YAGPGLDSGLRPRGAPLGPPPPRQPGALAYGAFGPPSSFQPFPAVPPPAAGIAHLQPVAT PYPGRAAAPPNAPGGPPGPQPAPSAAAPPPPAHALGGMDAELIDEEALTSLELELGLHRV RELPELFLGQSEFDCFSDLGSAPPAGSSLDVSHPEKWCDCGQGSCPQLSSSLEELPGAAI TIIIVIVIIIVIVIIIIIIVIVIIIVITPDVLNLTKAPSVLRFKGVGHRELATDVFLVLQ ELKSHQGKEQDCGVQERI >gi568815597f:40638844_40870825|GENSCAN_predicted_CDS_5|957_bp gtttttgctcaaatattacctttctcttatgctgatattgttggggctggagtgcatgca ggcagccggcctgccgccatggccgaccacctgatgctcgccgagggctaccgcctggtg cagaggccgccgtccgccgcggccgcccatggccctcatgcgctccggactctgccgccg tacgcgggcccgggcctggacagtgggctgaggccgcggggggctccgctggggccgccg ccgccccgccaacccggggccctggcgtacggggccttcgggccgccgtcctccttccag ccctttccggccgtgcctccgccggccgcgggcatcgcgcacctgcagcctgtggcgacg ccgtaccccggccgcgcggccgcgccccccaacgctccgggaggccccccgggcccgcag ccggcgccaagcgccgcagccccgccgccgcccgcgcacgccctgggcggcatggacgcc gaactcatcgacgaggaggcgctgacgtcgctggagctggagctcgggctgcaccgcgtg cgcgagctgcccgagctcttcctgggccagagcgagttcgactgcttctcggacttgggg tccgcgccgcccgccggctcctcactggatgtgagccacccagagaagtggtgtgactgt gggcaaggcagctgtccccagctaagctcatccctggaggagctgccaggcgctgccatc actatcataattgtcattgtcatcatcatcgtcattgtcatcatcattatcatcatcgtc attgtcatcatcatcgtcatcactccagatgtgctgaacctaactaaagcaccttcagtg ttgaggtttaaaggtgtgggccaccgtgaactggccacagatgtgtttcttgtcctccag gagctgaagtcacaccaggggaaggagcaagattgtggggtccaggagaggatttag