GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:08:35 Sequence gi568815597f:52656457_52921626 : 265170 bp : 43.82% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 11013 11123 111 2 0 -48 92 201 0.836 7.48 1.02 Term + 11242 11307 66 0 0 74 44 64 0.630 -1.46 1.03 PlyA + 12642 12647 6 1.05 2.04 PlyA - 13593 13588 6 1.05 2.03 Term - 31712 31264 449 0 2 122 35 369 0.840 30.38 2.02 Intr - 36411 36271 141 1 0 98 116 220 0.989 26.12 2.01 Init - 41870 41765 106 2 1 102 101 196 0.764 22.68 2.00 Prom - 57095 57056 40 -3.86 3.03 PlyA - 58953 58948 6 1.05 3.02 Term - 70832 70727 106 1 1 67 49 166 0.800 8.48 3.01 Init - 100156 100059 98 1 2 77 113 43 0.681 5.48 3.00 Prom - 104627 104588 40 -4.96 4.00 Prom + 108011 108050 40 -4.96 4.01 Init + 109234 109282 49 0 1 65 56 30 0.231 -3.38 4.02 Intr + 114564 115318 755 1 2 116 72 551 0.028 47.48 4.03 Intr + 123397 123537 141 0 0 62 116 73 0.218 8.05 4.04 Intr + 128421 128597 177 2 0 55 115 64 0.135 5.92 4.05 Intr + 140278 140328 51 0 0 91 110 32 0.961 4.80 4.06 Intr + 145363 145524 162 0 0 98 50 33 0.585 0.67 4.07 Intr + 145636 145683 48 0 0 98 91 5 0.593 0.68 4.08 Intr + 157080 157221 142 2 1 121 51 55 0.596 5.03 4.09 Intr + 186518 186685 168 0 0 45 92 81 0.005 4.12 4.10 Intr + 200542 200740 199 2 1 53 13 145 0.080 1.91 4.11 Intr + 201086 201293 208 2 1 78 9 137 0.168 3.88 4.12 Intr + 204275 204391 117 1 0 87 76 18 0.166 1.16 4.13 Intr + 211271 211321 51 1 0 74 100 27 0.253 1.60 4.14 Intr + 221226 221387 162 2 0 120 59 66 0.783 7.07 4.15 Intr + 225015 225129 115 2 1 60 69 98 0.408 5.12 4.16 Term + 236326 236501 176 2 2 67 46 67 0.055 -1.68 4.17 PlyA + 238091 238096 6 1.05 5.11 PlyA - 238517 238512 6 1.05 5.10 Term - 240141 240064 78 0 0 99 46 82 0.986 2.86 5.09 Intr - 241028 240981 48 2 0 76 91 44 0.656 2.38 5.08 Intr - 242768 242718 51 2 0 118 78 87 0.997 9.80 5.07 Intr - 248377 248190 188 2 2 86 64 246 0.996 21.41 5.06 Intr - 250155 250063 93 1 0 85 84 62 0.825 5.44 5.05 Intr - 251498 251412 87 0 0 82 85 130 0.881 12.04 5.04 Intr - 255197 255110 88 2 1 53 96 79 0.992 4.74 5.03 Intr - 255334 255267 68 2 2 84 94 66 0.957 5.42 5.02 Intr - 260032 260009 24 2 0 131 65 24 0.450 2.40 5.01 Init - 261103 261088 16 1 1 60 93 -5 0.491 -2.44 5.00 Prom - 261474 261435 40 -3.16 6.00 Prom + 261897 261936 40 -11.82 6.01 Init + 263995 264137 143 0 2 64 80 224 0.872 18.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 114564 115330 767 1 2 116 42 541 0.920 45.88 S.002 Sngl - 116192 115848 345 2 0 101 37 187 0.912 10.94 S.003 Intr + 139836 139935 100 0 1 131 77 31 0.879 6.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:52656457_52921626|GENSCAN_predicted_peptide_1|58_aa IITITITITITTTTIIIIVIVIIMLLLAHWSLTEPCENPLYHVASISKKQAKNRTYSI >gi568815597f:52656457_52921626|GENSCAN_predicted_CDS_1|177_bp atcatcaccatcaccatcaccatcaccatcaccaccaccaccatcatcatcattgtcatc gtcatcattatgttgcttttagctcactggagcctcacagaaccctgtgagaatccactc tatcacgtggcctctatttcaaagaagcaggccaaaaacaggacatacagcatttga >gi568815597f:52656457_52921626|GENSCAN_predicted_peptide_2|231_aa MAGMVDFQDEEQVKSFLENMEVECNYHCYHEKDPDGCYRLVDYLEGIRKNFDEAAKVLKF NCEENQHSDSCYKLGAYYVTGKGGLTQDLKAAARCFLMACEKPGKKSIAACHNVGLLAHD GQVNEDGQPDLGKARDYYTRACDGGYTSSCFNLSAMFLQGAPGFPKDMDLACKYSMKACD LGHIWACANASRMYKLGDGVDKDEAKAEVLKNRAQQLHKEQQKGVQPLTFG >gi568815597f:52656457_52921626|GENSCAN_predicted_CDS_2|696_bp atggccggcatggtggacttccaggatgaggagcaggtcaagtcctttttggagaacatg gaggtggagtgcaactaccactgctaccacgagaaggacccggacggttgctatcggctg gtggactatttggaagggatccggaagaattttgatgaggctgccaaggtgttgaagttt aactgtgaagagaaccagcacagtgatagctgctacaaactgggggcctactatgtgact ggaaaaggtggtctgacccaggacctgaaagctgccgccaggtgctttttgatggcgtgt gagaagcctggaaagaagtcaatagcagcatgtcacaacgttggcctcctggcacatgat ggacaggttaatgaggatggccagcctgacttgggaaaggccagggactactacacaagg gcctgtgatggtggctatacttccagttgcttcaacctcagtgccatgttcctgcagggt gccccaggctttcccaaggacatggacctggcatgtaaatactccatgaaagcctgtgac ctgggtcatatctgggcctgtgccaatgccagtcgcatgtacaagctgggggatggtgtt gataaggatgaggccaaggccgaggtgctaaaaaatcgagcccagcagctacacaaagaa cagcagaaaggtgtccaacccttaacatttgggtaa >gi568815597f:52656457_52921626|GENSCAN_predicted_peptide_3|67_aa MVRSSRSATSCGNTPGSCRHNVPSCLAEQNFSSSREATVLKLFLLRPLKPVEGNGDSCDG EPSEDIR >gi568815597f:52656457_52921626|GENSCAN_predicted_CDS_3|204_bp atggtccgaagcagtcgatcagccacctcctgtgggaatactccaggttcctgcagacac aatgttccatcttgtctggctgaacagaacttctcaagctcccgggaagcaacagtcctc aaattgttcctgcttagacccctgaagcccgtagaaggaaatggggacagctgcgacggc gaaccgagcgaggacatccgataa >gi568815597f:52656457_52921626|GENSCAN_predicted_peptide_4|906_aa MESCNVAQAGLKLLASGLLNDGTVGIFRGNQMRLKRACIRKAKISAVAFRKAFCHHKLVE LDATGVNADITITDIISGLGSNKWIQQNLQCLVLNSLTLSLEDPYERCFSRLSGLRALSI TNVLFYNEDLAEVASLPRLESLDISNTSITDITALLACKDRLKSLTMHHLKCLKMTTTQI LDVVRELKHLNHLDISDDKQFTSDIALRLLEQKDILPNLVSLDVSGRKHVTDKAVEAFIQ QRPSMQFVGLLATDAGYSEFLTGEGHLKVSGEANETQIAEALKRYSERAFFVREALFHLF SLTHVMEKTKPEILKLVVTGMRNHPMNLPVQLAASACVFNLTKQDLAAGMPVRLLADVTH LLLKAMEHFPNHQQLSTEQTAQLGTELFIVRQLLQIVKQKTNQNSVDTTLKFTLSALWNL TDESPTTCRHFIENQGLELFMRVLESFPTESSIQQKVLGLLNNIAEVQELHSELMWKDFI DHISSLLHSVEVEVSYFAAGIIAHLISRGEHRGAGGDADLGATSSSGERGESPGTLPRHL LGSVARAGLRSRCRAPSRRRRRCGGKLTDRTASIFRGNQMKLKLVNIQKAKISTAAFIKA FCRHKLIELNATAVHADLPVPDIISGLCSNRQLKSDLAFHLLQQKDILPNVVSLDISGGN CITDEAVELFIRLRPAMQFVGLLATDAGSSDFFTTKQGLRVAGGASMSQISEALSRYRNR SCFVKEALHRLFTETFSMELSPEQTAQLEELFMAVKELLAIVKQKTTENLDDVTFLFTLK ALWNLTDGSPAACKHFIENQGLQIFIQVLENNIAEVRELSSKLVTEDVLKHINSLLCSRE MEVSYFAAASKYCKMLVEEEGLQLLCDIQEHSEATPKAQQIAASILDDFRMHFMNYQRPT LCQMPF >gi568815597f:52656457_52921626|GENSCAN_predicted_CDS_4|2721_bp atggagtcttgcaatgttgcccaggctggtctcaaactcctggcctcgggtctattgaat gatggaactgtgggtatttttaggggcaaccagatgcgcttaaagcgagcctgcattcgc aaagcaaagatctctgctgttgctttccggaaagctttctgccaccacaagttagtggaa cttgatgccacaggtgtgaatgctgatatcacgattacagacattatcagtgggcttggc agtaacaaatggatccagcagaatctccagtgcctggtgctgaattcattaactctctcc ctcgaggatccttacgagcgctgcttcagccggctttctggccttcgagctttaagcatc acgaatgttctcttttacaatgaagacctggctgaagttgcctcattgccaagattagag agcttggatatttctaacacctcaatcacagacatcactgctctactggcctgcaaagac cgactcaagtctctaaccatgcaccacttgaaatgtttaaaaatgacaactacccagata ctggatgtagttcgggaactcaaacatctgaatcatcttgatatctcagatgataaacag tttacatcagacatagctcttcgcttactagaacaaaaagacatcctacctaaccttgtt tctctggatgtttctgggagaaagcacgtgacagataaagccgttgaagcctttatacaa caacgtccaagcatgcaatttgtaggtttgctggctactgatgctggttactctgaattc ctcacaggcgaaggacatttgaaggtgtctggggaagccaatgaaactcagattgcagaa gcactgaagcgttacagtgaacgggcattctttgttcgggaagctctatttcatcttttt agtctgactcatgtgatggaaaaaacaaagccagaaattttaaagcttgtggttactggg atgagaaaccaccctatgaatttgccagtgcaactggctgcaagcgcctgtgtatttaac ttaaccaagcaggatcttgctgcagggatgcctgtccgactcctggctgatgtgacccat ttgctgctcaaagccatggaacattttcccaatcaccagcagctttctacagaacaaact gcacagcttggtactgagctcttcattgtcaggcaacttcttcaaatagtgaagcagaaa accaatcaaaattcagtggacactacattgaaatttactttgagtgcactttggaacctc acagatgaatctccaaccacttgtagacactttattgaaaaccaagggttagaactcttc atgagggttctagagtctttcccaactgagtcatccattcagcagaaagttctaggactt ttgaacaatatagctgaagtacaagaattacattctgaattaatgtggaaagattttata gaccacatcagtagtctcctacacagtgtggaagtggaagtcagttactttgcagctgga attattgcccatttaatatccagaggtgagcaccggggtgcgggcggcgacgcggacctc ggcgccacgtccagctccggcgagcgcggcgagtctcctgggacgctgccgaggcacttg ctggggagtgtggcccgcgcggggctgcggtctagatgccgagccccttccaggcgcagg cgtcgctgcggaggcaagctgactgacagaacagccagcattttccgaggcaaccaaatg aaactgaagctggtcaatatccaaaaagctaaaatctctacagctgcattcataaaagcc ttctgccgtcataagctcattgaactgaatgctactgcagtgcacgctgacctcccagtt ccagacatcataagtggactctgcagcaataggcaactcaaatcagacctagcttttcat ttgctacagcagaaggatatcctgcccaatgttgtgtcattggatatttctgggggcaat tgcatcactgatgaagctgtagaactgtttatacgactgcggcctgccatgcaatttgtg ggactattggccacggatgctggctcttctgacttctttactacaaagcaaggcttgagg gttgctggaggagccagtatgagtcagatttcagaagcactgagccgatacaggaacaga tcatgttttgtgaaggaagccctccacaggctgttcacagagacattttcaatggagctc tcacctgagcaaacggcacagcttgaagagcttttcatggcagttaaggaacttctagca atagtaaaacaaaagactactgagaatttagatgatgtcaccttcttgtttactttgaaa gcactttggaatcttacagatgggtctccagctgcctgcaagcacttcattgaaaatcaa ggattgcaaatcttcatccaagtcttggagaacaacatagcagaagtcagagagctctct tccaagctggtgaccgaagatgtgctgaagcatatcaacagtttactctgtagcagggaa atggaagtcagctattttgctgcagccagcaaatactgcaaaatgttagttgaagaagaa ggattgcagcttttgtgtgatatccaggagcacagtgaggcaacccccaaagcacagcag attgcagcctccattctggatgacttcagaatgcatttcatgaattatcagaggcccact ctgtgtcaaatgcccttctga >gi568815597f:52656457_52921626|GENSCAN_predicted_peptide_5|246_aa MGSFEDQEIEGQKGITEILMNRPSARNALGNVFVSELLETLAQLREDRQVRVLLFRSGVK GVFCAGADLKEREQMSEAEVGVFVQRLRGLMNDIAAFPAPTIAAMDGFALGGGLELALAC DLRVAGGTQRLPRCLGVALAKELIFTGRRLSGTEAHVLGLVNHAVAQNEEGDAAYQRARA LAQEILPQAPIAVRLGKVAIDRGTEVDIASGMAIEGMCYAQNIPTRDRLEGMAAFREKRT PKFVGK >gi568815597f:52656457_52921626|GENSCAN_predicted_CDS_5|741_bp atgggctcatttgaagatcaggaaattgaaggccagaaggggatcactgagattctgatg aacagaccttctgcccgcaatgccttggggaatgtcttcgtcagtgagctgctggaaact ctggcccagctgcgggaggaccggcaagtgcgtgtcctgctcttcagaagtggagtgaag ggcgtgttctgtgcaggtgcagacctgaaggagcgggaacagatgagtgaagcagaggtg ggggtgtttgtccagcgactccggggcctgatgaatgacatcgcagccttccctgcaccc accattgcggctatggatgggtttgccttgggcggaggcctagagcttgccctggcctgt gacctccgagtggcaggagggactcagaggctgccccgttgtctgggggtggccctggcg aaggagctcatcttcacgggccgacgactgagtggaactgaggcccacgtactggggctg gtgaatcacgctgtggcccagaacgaggagggggacgccgcctaccagcgggcacgagca ctggcccaggagatcctgccccaggcccccattgccgtgcggctgggcaaagtagccatt gaccgaggaacggaggtggacattgcatctgggatggccattgaagggatgtgctatgcc cagaatattccaacccgggaccggctagagggcatggcagccttcagggagaagcggact cccaaatttgttggcaaatga >gi568815597f:52656457_52921626|GENSCAN_predicted_peptide_6|48_aa MQAKEMDEEDKAFKQKQKEEQKKLEELKWKATGKGPLATSGIKKSGKN >gi568815597f:52656457_52921626|GENSCAN_predicted_CDS_6|144_bp atgcaggccaaggagatggacgaggaagataaggctttcaagcagaaacaaaaagaggag cagaagaaactcgaggagctaaaatggaaggccacggggaaggggcccttggccacaagt ggaattaagaaatctggcaaaaan