GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:43:47 Sequence gi568815582r:58608174_58834228 : 226055 bp : 44.92% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1466 1461 6 1.05 1.05 Term - 21131 20845 287 0 2 74 40 142 0.343 3.57 1.04 Intr - 22376 22331 46 2 1 92 32 25 0.029 -4.72 1.03 Intr - 32953 32804 150 1 0 83 106 188 0.952 20.46 1.02 Intr - 36819 36739 81 0 0 69 101 97 0.991 8.93 1.01 Init - 36929 36876 54 2 0 109 90 31 0.982 6.68 1.00 Prom - 38856 38817 40 -4.46 2.11 PlyA - 39100 39095 6 1.05 2.10 Term - 59314 59212 103 0 1 119 41 171 0.999 13.25 2.09 Intr - 61994 61940 55 0 1 112 103 26 0.999 4.54 2.08 Intr - 63071 62872 200 1 2 66 80 416 0.999 37.59 2.07 Intr - 64070 63923 148 0 1 120 89 269 0.992 29.49 2.06 Intr - 67881 67767 115 0 1 71 109 149 0.998 15.32 2.05 Intr - 68173 68116 58 0 1 85 50 64 0.825 1.19 2.04 Intr - 69251 69153 99 1 0 72 94 189 0.999 17.13 2.03 Intr - 70301 70160 142 0 1 105 36 148 0.920 10.71 2.02 Intr - 70721 70523 199 2 1 100 44 277 0.996 23.42 2.01 Init - 71953 71684 270 1 0 108 101 360 0.962 36.27 2.00 Prom - 93059 93020 40 -3.56 3.00 Prom + 96550 96589 40 -2.06 3.01 Init + 96857 96871 15 1 0 66 80 6 0.419 -1.95 3.02 Intr + 97162 97245 84 0 0 30 77 73 0.255 0.42 3.03 Term + 98706 98861 156 2 0 107 43 96 0.471 4.93 3.04 PlyA + 98932 98937 6 -0.45 4.11 PlyA - 98979 98974 6 1.05 4.10 Term - 100120 99998 123 1 0 81 35 236 0.935 15.98 4.09 Intr - 101394 101244 151 2 1 50 109 210 0.713 19.46 4.08 Intr - 108006 107841 166 1 1 57 58 201 0.837 13.02 4.07 Intr - 108640 108490 151 1 1 97 103 156 0.999 17.74 4.06 Intr - 110127 110023 105 0 0 86 100 34 0.964 4.81 4.05 Intr - 110515 110354 162 1 0 89 89 103 0.778 10.67 4.04 Intr - 111082 111023 60 1 0 70 105 43 0.844 3.13 4.03 Intr - 114105 113977 129 0 0 57 102 135 0.999 12.69 4.02 Intr - 115729 115573 157 0 1 72 105 169 0.879 17.01 4.01 Init - 126055 125967 89 1 2 109 115 165 0.999 19.43 4.00 Prom - 133172 133133 40 -5.06 5.00 Prom + 133388 133427 40 -4.46 5.01 Init + 140518 140660 143 0 2 78 48 200 0.991 12.61 5.02 Term + 141027 141183 157 0 1 81 37 117 0.951 3.31 5.03 PlyA + 141468 141473 6 1.05 6.02 PlyA - 142551 142546 6 1.05 6.01 Sngl - 145974 145552 423 0 0 91 48 96 0.473 2.39 6.00 Prom - 151081 151042 40 -2.56 7.00 Prom + 164129 164168 40 -3.96 7.01 Sngl + 169071 169382 312 2 0 61 44 335 0.998 22.33 7.02 PlyA + 170471 170476 6 1.05 8.06 PlyA - 170567 170562 6 1.05 8.05 Term - 174164 173505 660 2 0 -92 42 344 0.260 5.71 8.04 Intr - 174660 174286 375 0 0 -11 41 274 0.182 8.01 8.03 Intr - 182189 181878 312 2 0 1 64 160 0.106 1.28 8.02 Intr - 182734 182551 184 0 1 61 58 104 0.109 4.49 8.01 Init - 182928 182819 110 0 2 82 5 152 0.512 4.17 8.00 Prom - 214011 213972 40 -2.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:58608174_58834228|GENSCAN_predicted_peptide_1|205_aa MQANLEMAGNGVTSMGMEPLAIPHIYCCSEGTCNFSNTENHCLRAVLEAYGMIQGIGLNI YHRYSPWWEARGYQGHYTADMSNLFRRYQFNVATWGAATNEDALLLRWERVGIGFRGLGL LAGGTVFGVFACFQFFSVYSELPLSPSYAPSTLSKPPRFRVSPDGKGGREQALPYYLLQV LFHSRAWALTLSLKLAPTYSVRCLQ >gi568815582r:58608174_58834228|GENSCAN_predicted_CDS_1|618_bp atgcaggccaatttagaaatggcaggaaatggggtgacttccatgggaatggagcctctg gccattcctcacatctactgctgctctgagggcacctgcaacttcagcaacacagagaac cactgcctgagagcggtcctagaagcttatggcatgattcagggcattggcctcaacatt taccaccgctactcaccctggtgggaggcacgtggctaccaggggcattacactgctgac atgagcaaccttttccgaagataccagttcaacgtggccacctggggcgcagcgacgaac gaagacgcattacttctgcgatgggaacgggtgggcatcggcttccgaggcctagggctt ctggcgggggggaccgtgtttggagtttttgcatgtttccagttcttctccgtgtactcg gagctgccactctccccgtcttacgcccccagcacgttgtcaaagccccccaggtttcgg gtgtccccggatgggaagggcgggcgggagcaagcacttccttattacttgctacaggtt ttgttccactcgcgcgcctgggcgctcacactttctttgaagttggcccccacctactcg gtccggtgtctgcagtag >gi568815582r:58608174_58834228|GENSCAN_predicted_peptide_2|462_aa MAQVSINNDYSEWDLSTDAGERARLLQSPCVDTAPKSEWEASPGGLDRGTTSTLGAIFIV VNACLGAGLLNFPAAFSTAGGVAAGIALQMGMLVFIISGLVILAYCSQASNERTYQEVVW AVCGKLTGVLCEVAIAVYTFGTCIAFLIIIGDQQDKIIAVMAKEPEGASGPWYTDRKFTI SLTAFLFILPLSIPREIGFQKYASFLSVVGTWYVTAIVIIKYIWPDKEMTPGNILTRPAS WMAVFNAMPTICFGFQCHVSSVPVFNSMQQPEVKTWGGVVTAAMVIALAVYMGTGICGFL TFGAAVDPDVLLSYPSEDMAVAVARAFIILSVLTSYPILHFCGRAVVEGLWLRYQGVPVE EDVGRERRRRVLQTLVWFLLTLLLALFIPDIGKVISVIGGLAACFIFVFPGLCLIQAKLS EMEEVKPASWWVLVSYGVLLVTLGAFIFGQTTANAIFVDLLA >gi568815582r:58608174_58834228|GENSCAN_predicted_CDS_2|1389_bp atggcccaggtcagcatcaacaatgactacagcgagtgggacttgagcacggatgccggg gagcgggctcggctgctgcagagtccctgtgtggacacagcccccaagagtgagtgggaa gcctctcctgggggtctggacagaggcaccacttccacacttggggccatcttcatcgtc gtcaacgcgtgcctgggtgcagggttactcaacttcccagcagccttcagcactgcgggg ggcgtggcagcaggcatcgcactgcagatgggtatgctggttttcatcatcagtggcctt gtcatcctggcctactgctcccaggccagcaatgagaggacctaccaggaggtggtatgg gctgtgtgtggcaagctgacaggtgtgctatgtgaggtggccatcgctgtctacaccttt ggcacctgcattgccttcctaatcatcattggcgaccagcaggacaagattatagctgtg atggcgaaagagccggagggggccagcggcccttggtacacagaccgcaagttcaccatc agcctcactgccttcctcttcatcctgcccctctccatccccagggagattggtttccag aaatatgccagcttcctgagcgtcgtgggtacctggtacgtcacagccatcgttatcatc aagtacatctggccagataaagagatgaccccagggaacatcctgaccaggccggcttcc tggatggctgtgttcaatgccatgcccaccatctgcttcggatttcagtgccacgtcagc agtgtgcccgtcttcaacagcatgcagcagcctgaagtgaagacctggggtggagtggtg acagctgccatggtcatagccctcgctgtctacatggggacaggcatctgtggcttcctg acctttggagctgctgtggatcctgacgtgctcctgtcctatccctcggaggacatggcc gtggccgttgcccgagccttcatcatcctgagcgtgctcacctcctaccctatcctgcac ttctgtgggcgggcggtggtggaaggcctgtggctgcgctaccagggggtgccagtggag gaggacgtggggcgggagcggcggcggcgagtgctgcagacgctggtctggttcctgctc accctgctgctggcgctcttcatccctgacatcggcaaggtgatctcagtcattggaggc ctggccgcctgcttcatcttcgtcttcccagggctgtgcctcattcaagccaaactctct gagatggaagaggtcaaaccagccagctggtgggtgctggtcagctacggagtcctcttg gtcaccctgggagccttcatcttcggccagaccacagccaacgccatctttgtggatctc ttggcataa >gi568815582r:58608174_58834228|GENSCAN_predicted_peptide_3|84_aa MADLWKTIKLQTPHDEWDLVQCKVVVPMFKNNKDPSWDSTCSPQGPRDYTHILSIFGLDK LNGNMKKTTNLQPIFGIVDTSTGL >gi568815582r:58608174_58834228|GENSCAN_predicted_CDS_3|255_bp atggctgatctatggaaaaccatcaagctgcagacaccacatgatgaatgggacttagtc cagtgtaaggtggtagtacccatgtttaaaaataacaaggatccttcatgggactccact tgttccccacaaggtccaagggattacacccacattctcagcatcttcggactggacaaa ctgaatggcaacatgaagaaaacgacgaatcttcagccaatatttggaatagtggataca tctactggcttatga >gi568815582r:58608174_58834228|GENSCAN_predicted_peptide_4|430_aa MALLHSGRVLPGIAAAFHPGLAAAASARASSWWTHVEMGPPDPILGVTEAFKRDTNSKKM NLGVGAYRDDNGKPYVLPSVRKAEAQIAAKNLDKEYLPIGGLAEFCKASAELALGENSEV LKSGRFVTVQTISGTGALRIGASFLQRFFKFSRDVFLPKPTWGNHTPIFRDAGMQLQGYR YYDPKTCGFDFTGAVEDISKIPEQSVLLLHACAHNPTGVDPRPEQWKEIATVVKKRNLFA FFDMAYQGFASGDGDKDAWAVRHFIEQGINVCLCQSYAKNMGLYGERVGAFTMVCKDADE AKRVESQLKILIRPMYSNPPLNGARIAAAILNTPDLRKQWLQEVKVMADRIIGMRTQLVS NLKKEGSTHNWQHITDQIGMFCFTGLKPEQVERLIKEFSIYMTKDGRISVAGVTSSNVGY LAHAIHQVTK >gi568815582r:58608174_58834228|GENSCAN_predicted_CDS_4|1293_bp atggccctgctgcactccggccgcgtcctccccgggatcgccgccgccttccacccgggc ctcgccgccgcggcctctgccagagccagctcctggtggacccatgtggaaatgggacct ccagatcccattctgggagtcactgaagcctttaagagggacaccaatagcaaaaagatg aatctgggagttggtgcctaccgggatgataatggaaagccttacgttctgcctagcgtc cgcaaggcagaggcccagattgccgcaaaaaatttggacaaggaatacctgcccattggg ggactggctgaattttgcaaggcatctgcagaactagccctgggtgagaacagcgaagtc ttgaagagtggccggtttgtcactgtgcagaccatttctggaactggagccttaaggatc ggagccagttttctgcaaagattttttaagttcagccgagatgtctttctgcccaaacca acctggggaaaccacacacccatcttcagggatgctggcatgcagctacaaggttatcgg tattatgaccccaagacttgcggttttgacttcacaggcgctgtggaggatatttcaaaa ataccagagcagagtgttcttcttctgcatgcctgcgcccacaatcccacgggagtggac ccgcgtccggaacagtggaaggaaatagcaacagtggtgaagaaaaggaatctctttgcg ttctttgacatggcctaccaaggctttgccagtggtgatggtgataaggatgcctgggct gtgcgccacttcatcgaacagggcattaatgtttgcctctgccaatcatatgccaagaac atgggcttatatggtgagcgtgtaggagccttcactatggtctgcaaagatgcggatgaa gccaaaagggtagagtcacagttgaagatcttgatccgtcccatgtattccaaccctccc ctcaatggggcccggattgctgctgccattctgaacaccccagatttgcgaaaacaatgg ctgcaagaagtgaaagtcatggctgaccgcatcattggcatgcggactcaactggtctcc aacctcaagaaggagggttccacccacaattggcaacacatcaccgaccaaattggcatg ttctgtttcacagggctaaagcctgaacaggtggagcggctgatcaaggagttctccatc tacatgacaaaagatggccgcatctctgtggcaggggtcacctccagcaacgtgggctac cttgcccatgccattcaccaggtcaccaagtaa >gi568815582r:58608174_58834228|GENSCAN_predicted_peptide_5|99_aa MLAALAHPWCLLGLAPTLATLEEPFSPPLHCGSPFLGWTDAGAGSLSLAPGPINRPRAEG CRHTVGDWQAAPPAAVVRDPLDEASWAPESSGHLENLYV >gi568815582r:58608174_58834228|GENSCAN_predicted_CDS_5|300_bp atgctggcagccctcgctcacccttggtgcctcctcggcctcgcgcccactctggccacg cttgaggagcccttcagcccgccgctgcactgtgggagccccttcctgggatggaccgac gccggagccggctccctcagcctggcgcctggtcccatcaaccgcccaagggctgagggg tgccggcacacggtgggggactggcaggcagctccacctgcggccgtggtgcgggatcca ctggatgaagccagctgggctccggagtctagtgggcatttggagaacctttatgtctag >gi568815582r:58608174_58834228|GENSCAN_predicted_peptide_6|140_aa MTLHSLLAESSCPDGKQGQGRASPSCWPEQDDRTDRAINTPPPVRLWTARPKELLDCCNT PSGASGSQAPLLGHHCVPLSVTHLVQLQDPHGACSCASTWNSQPDPTLAGSQIHFCQELS KQLQQPWDCARVQARCGLVG >gi568815582r:58608174_58834228|GENSCAN_predicted_CDS_6|423_bp atgaccctccactctctgctggcggagagcagctgccccgacgggaagcaggggcagggc agagccagcccgagctgctggccggagcaggacgacaggactgacagagctattaacacg ccaccacctgtcaggctgtggacggccagaccaaaagagctattagattgctgtaacacc ccctctggggcttccgggtctcaggcacccctgcttgggcaccactgtgttcccctcagt gtgacacacctggtccagctgcaagacccacatggagcctgctcctgtgccagcacttgg aacagccagccagatcctacacttgctggctcacagatccacttctgccaggagctgagc aagcagttgcagcagccatgggattgtgccagagtgcaagccaggtgtggcctggtgggc tga >gi568815582r:58608174_58834228|GENSCAN_predicted_peptide_7|103_aa MEITEDLCQYFAETGRHREEQGWRQQLDAECLDSYVNADHDLCYNTHWWVEPPTERPSKR CQAEMKRLYGNSTAKLQAMEASVQLSFDKHCDQKQPSSGWSCL >gi568815582r:58608174_58834228|GENSCAN_predicted_CDS_7|312_bp atggaaatcactgaggatctctgccagtactttgcagaaaccgggaggcacagagaagaa caagggtggcggcagcagctggatgcagaatgcttggacagctatgtgaatgccgaccac gacctttgctacaacacccactggtgggtggagcccccgactgagcggcccagcaagcgg tgccaggctgagatgaagcgcttgtatgggaacagcactgccaagctccaggccatggag gcttcggtgcagctgagctttgacaagcactgtgaccaaaagcagcccagttctggctgg tcatgcctgtga >gi568815582r:58608174_58834228|GENSCAN_predicted_peptide_8|546_aa MGLVGSTLGTAGSAAGPASPAAAGNEGLSTQASGCGGAQDLQPAMPKPPTPSMGSCAAGA SQMSAAPCSTAPRPIDHQRAEECGRTAQDWQATPPAAPGCSFTPEANKTTSPPGGKNNSR RTTLRAVTLTAKVCSFTPEPARPRTHQKEETPNTSEHQKEQTPDTLPLRTVTLTARVRGF ILEVSETKNPPIPDTPALREAEELVLEKINKIDRLLARLIKKKREKNQKDAIKNDKGDIT NDPTDIQTTIREYYKHLYENKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIEAT INSLPTKKCPGPDGFTAEFYQRYKEELNFGPIPLMNINAKILNKIPANQIQQHIKKVIRH DQVGFIPGMQGWFNIRKSINVIHHINRTKDKNHTIISIDAEKAFDKIQQPFMLKTLNKLG IDGTYLKIIRATYDKPTANITVNAQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIMQ EKEIKGIQLGKEEVKLSLFADDMIVHLENPMVSAQNLLKLISNFSKVSGYKINVKNHKHS YAPITD >gi568815582r:58608174_58834228|GENSCAN_predicted_CDS_8|1641_bp atgggcttggtgggctccacactcggaacagccggcagcgcagccggccctgccagccct gccgccgcgggcaatgaggggcttagcacccaggccagcggctgcggaggggctcaggac ctgcagcccgccatgcctaagcctcccaccccctccatgggctcctgtgcagctggagcc tcccagatgagcgcggccccctgctccacggcacccaggcccatcgaccaccaaagggct gaggagtgcgggcgcacggcacaggactggcaggccactccacctgcagccccgggctgc agcttcactcctgaagccaacaagaccacgagcccaccgggaggaaagaacaactccaga cgcaccaccttaagagctgtaacactcaccgcgaaggtctgcagcttcactcctgagcca gcaagaccacgaacccaccagaaggaagaaactccaaacacatctgaacatcagaaggaa caaactccagacacgctgcctttaagaactgtaacactcaccgcaagggtccgtggcttc attctcgaagtcagtgagaccaagaacccaccgattccggacacaccagcacttcgggag gccgaggagctggtacttgaaaagatcaacaaaattgatagactgctggcaagactaata aagaagaaaagagagaagaatcaaaaagatgcaataaaaaatgataaaggggatatcacc aacgatcccacagacatacaaactaccatcagagaatactataaacacctctatgaaaat aaactagaaaatctagaagaaatggataaattcctggacacatacaccctcccaagacta aaccaggaagaagttgaatccctgaatagaccaataacaggctctgaaattgaggcaaca attaatagcctaccaaccaaaaaatgtccaggaccagatggattcacagccgaattctac cagaggtacaaggaggagctgaattttggaccaatacccctgatgaacatcaatgcaaaa atcctcaataaaataccggcaaaccaaatccagcagcacatcaaaaaggttatccgccat gatcaagtgggcttcatccctgggatgcaaggctggttcaacatacgaaaatcaataaac gtaatccatcatataaacagaaccaaagacaaaaaccacacgattatctcaatagatgca gaaaaggcctttgacaaaattcaacagcccttcatgctaaaaactctcaataaattaggt attgatgggacgtatctcaaaataataagagctacttatgacaaacccacagccaatatc acagtgaatgcgcagaaactggaagcattccctttgaaaactggcacaagacagggatgc cctctctcaccactcctattcaacatagtgttggaagttctggccagggcaatcatgcag gagaaagaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgca gatgacatgattgtgcatttagaaaaccctatggtctcagcccaaaatctccttaagctg ataagcaacttcagcaaagtctcaggatacaaaatcaatgtgaaaaatcacaagcattct tatgcaccaataacagactaa