GENSCAN 1.0 Date run: 3-Nov-116 Time: 06:54:00 Sequence gi568815594f:75415453_75627211 : 211759 bp : 38.86% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4726 4929 204 0 0 69 55 154 0.732 8.67 1.02 Term + 7127 7285 159 1 0 116 38 185 0.989 13.26 1.03 PlyA + 7966 7971 6 1.05 2.04 PlyA - 8005 8000 6 1.05 2.03 Term - 8611 8606 6 1 0 98 54 0 0.002 -5.51 2.02 Intr - 15795 15715 81 0 0 63 91 48 0.081 1.52 2.01 Init - 18890 18666 225 2 0 71 106 111 0.434 9.72 2.00 Prom - 27569 27530 40 -3.45 3.02 PlyA - 28330 28325 6 1.05 3.01 Sngl - 30699 28603 2097 0 0 44 42 726 0.519 57.60 3.00 Prom - 30792 30753 40 -6.15 4.02 PlyA - 30961 30956 6 1.05 4.01 Sngl - 32205 31876 330 0 0 88 45 356 0.996 27.07 4.00 Prom - 36918 36879 40 -3.95 5.00 Prom + 42003 42042 40 -2.65 5.01 Init + 42619 42743 125 0 2 89 58 82 0.400 5.00 5.02 Term + 56492 56702 211 2 1 116 41 87 0.275 2.58 5.03 PlyA + 56826 56831 6 1.05 6.06 PlyA - 61166 61161 6 1.05 6.05 Term - 67214 67086 129 2 0 93 49 80 0.251 1.80 6.04 Intr - 75249 75129 121 2 1 63 76 98 0.187 5.68 6.03 Intr - 97725 97650 76 1 1 88 47 83 0.215 1.85 6.02 Intr - 98538 98418 121 0 1 61 89 51 0.718 1.65 6.01 Init - 98834 98703 132 2 0 89 47 194 0.884 15.59 6.00 Prom - 99805 99766 40 -13.11 7.00 Prom + 100118 100157 40 -4.65 7.01 Init + 100568 100575 8 1 2 92 83 10 0.967 0.73 7.02 Intr + 101320 101527 208 1 1 94 72 112 0.967 8.36 7.03 Intr + 106284 106409 126 2 0 79 78 28 0.642 0.86 7.04 Term + 111508 111762 255 0 0 104 44 195 0.998 11.30 7.05 PlyA + 112923 112928 6 1.05 8.04 PlyA - 113079 113074 6 1.05 8.03 Term - 117661 117066 596 2 2 -35 54 275 0.448 5.50 8.02 Intr - 135849 135818 32 2 2 55 103 1 0.031 -4.84 8.01 Init - 136978 136869 110 1 2 70 36 202 0.545 12.94 8.00 Prom - 137604 137565 40 -4.25 9.15 PlyA - 138315 138310 6 1.05 9.14 Term - 138795 138728 68 1 2 71 49 78 0.012 -0.68 9.13 Intr - 140261 140115 147 0 0 57 78 75 0.015 2.69 9.12 Intr - 146759 146640 120 0 0 65 65 83 0.008 3.25 9.11 Intr - 168179 168005 175 2 1 30 30 172 0.012 4.19 9.10 Intr - 176473 176367 107 2 2 63 92 69 0.290 3.81 9.09 Intr - 180888 180791 98 2 2 109 71 32 0.544 2.33 9.08 Intr - 181784 181483 302 2 2 58 80 185 0.922 9.31 9.07 Intr - 182760 182625 136 2 1 63 99 86 0.980 6.85 9.06 Intr - 184917 184829 89 0 2 82 65 54 0.977 0.35 9.05 Intr - 188504 188365 140 0 2 92 84 88 0.982 8.06 9.04 Intr - 190182 190070 113 2 2 82 82 34 0.979 1.30 9.03 Intr - 191909 191731 179 2 2 74 71 106 0.923 5.30 9.02 Intr - 198997 198803 195 1 0 60 101 49 0.755 2.09 9.01 Init - 210536 210369 168 2 0 46 113 167 0.835 14.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 21124 21462 339 0 0 74 41 167 0.800 6.38 S.002 Intr + 148662 148800 139 1 1 145 37 103 0.902 9.50 S.003 Init - 168209 168005 205 2 1 89 30 176 0.811 11.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:75415453_75627211|GENSCAN_predicted_peptide_1|120_aa SGPFAAGLLEFAGGPLQSLFALVSPAEAAEQQRLLPVPSSGSFIAERHPLQKLARALLYK VFVNPCWEALNGLDDASPRSTSVRVIVFTQPIDSNTNLFWKHSQTHPEVMFGQPSEHPLA >gi568815594f:75415453_75627211|GENSCAN_predicted_CDS_1|363_bp tcaggcccatttgctgcaggactgctggagtttgctggaggtccactccagtccctgttt gccttggtgtcaccagcagaggctgcagaacagcaaagattgctgcctgttccttcctct ggaagtttcatagcagagaggcacccactccagaagttagccagagctctcctgtataag gtgtttgtcaatccctgctgggaggccctcaacggattggatgatgcctctccacggtcc acatcggtgagggtgattgtctttactcagcctattgattcaaacactaatctcttctgg aaacactctcagactcacccagaagtaatgtttggccagccatcggagcatcctctggct tag >gi568815594f:75415453_75627211|GENSCAN_predicted_peptide_2|103_aa METDMLGFITRLFFSRYDVAVETQSPNPQRTLCSSPSCCIHLRCISLAEEPSEIGVASLL IQISREGIYPWTFSWLKLKGGNYGISARLFPAKQSDSTVGSQA >gi568815594f:75415453_75627211|GENSCAN_predicted_CDS_2|312_bp atggagacagacatgctgggcttcatcacaaggctgtttttctccaggtatgatgttgct gtggaaacacaaagcccaaatcctcaaagaacactctgttctagtcccagctgctgtatc catctgcgttgtatctcattagcagaagagcctagtgagataggagttgcatctctgctc atccagatctctagagaagggatctacccctggactttttcctggcttaagctaaaaggg ggaaattacgggataagtgcaaggctatttcctgccaaacagtcagatagtacagtggga tctcaggcctga >gi568815594f:75415453_75627211|GENSCAN_predicted_peptide_3|698_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSREYTFFSAPHHTYS KIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQIHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRLLARLIK KKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDEFLDTYTLLRLN QEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKE GILPISFYKASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILAKRIQQHIKKLIHHD QVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGI DGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQE KEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFL YTNNRQTESQIMSELTFTIASERIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNI PCSWVGRINIVKMAILPKVIYRFNAIPIKLPMSFFTEL >gi568815594f:75415453_75627211|GENSCAN_predicted_CDS_3|2097_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaagagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatactgggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctttgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaatacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaaccctccaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagacttctagcaagactaataaag aaaaaaagagagaagaatcaaatagatgcaataaaaaatgataaaggggatatcaccact gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggatgaattcctggacacatacactctcctaagactaaac caggaagaagttgaatctctgaatagaccaataacaggagctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaattggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctatctcattttacaaggccagcatcattctgataccaaagccaggcaga gacacaacaaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaacgaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgta atccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaa aaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtatt gatgggacgtatttcaaaataataagagctatctatgacaaacccacagccaatatcata ctgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccactcctattcaacatagtgttggaagttctggccagggcaattaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagat gacatgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctcata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattctta tacaccaacaacagacaaacagagagccaaatcatgagtgaactcacattcacaattgct tcagagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatt tacagattcaatgccatccccatcaagctaccaatgtctttcttcacagaattgtaa >gi568815594f:75415453_75627211|GENSCAN_predicted_peptide_4|109_aa MGKKQNRKTGNSKKQSASPPPKERSSSPATEQSWMENDFDELRGEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRS >gi568815594f:75415453_75627211|GENSCAN_predicted_CDS_4|330_bp atggggaaaaaacagaacagaaaaactggaaactctaaaaagcagagtgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgactttgac gagctgagaggagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaacttcgaaaaaaatttggaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaatta cgtgaagaatgcagaagcctcaggagctga >gi568815594f:75415453_75627211|GENSCAN_predicted_peptide_5|111_aa MKRSMWLGEIISGREGSQVMGPDHIGPSKEGEREKEKWGEERVDPTLWGGRSSVSDYVGM FNLEQGRTNSVWPWFQATISFPALSSISDSHIYYTIDSELDSKLQKRIIPL >gi568815594f:75415453_75627211|GENSCAN_predicted_CDS_5|336_bp atgaagaggtcgatgtggctgggagaaataataagtgggagagaagggagccaggtaatg gggccagatcatatagggcctagtaaggaaggagagagggagaaagagaagtggggtgaa gaaagggtagatcctaccctgtggggaggaaggtcttcagtcagcgactatgtgggtatg ttcaatttagagcaaggaagaacaaactcagtttggccttggttccaagccacaatctct tttccagctttatcatcaatatctgactctcatatctactacacaattgattcagaactt gattcaaaactccagaagaggattatacccctttaa >gi568815594f:75415453_75627211|GENSCAN_predicted_peptide_6|192_aa MAATAREDGASGQERGQRGCEHYDRGCLLKVTPSMDSSLTSFRQQNPSWEVIRLAQWSLP VGLFVFCCRSRLRQPRACTGVFRSGHGLTNIAPLNWMKLLKNVGHPPSVVGYRCPLCMHS ALDMTRYWRQLDDEVAQTPMPSEYQNMTVDILCNDCNGRSTVQFHILGMKCKICESYNTA QAGGRRISLDQQ >gi568815594f:75415453_75627211|GENSCAN_predicted_CDS_6|579_bp atggcggcgacggcccgggaagatggcgccagcggtcaagagcgaggtcagcggggctgc gagcactatgacagaggatgtctcctaaaggtgacgccttctatggactcttccctgaca agcttccgtcagcaaaacccttcttgggaagtgattaggcttgcacagtggtctctgcca gtgggcttattcgtcttttgctgccgaagtagactgaggcagccccgggcctgtactggg gttttccgttcaggtcatggcttgacgaacattgctccactgaactggatgaagctcctt aaaaatgttggtcatccaccaagtgttgtaggctacagatgtccattatgtatgcactct gctttagatatgaccaggtattggagacagctggatgatgaagtagcacagactcctatg ccatcagaatatcagaacatgactgtggatattctctgcaatgactgtaatggacgatcc actgttcagtttcatatattaggcatgaaatgtaagatttgtgaatcctataatactgct caagctggaggacgtagaatttcactggatcagcaatga >gi568815594f:75415453_75627211|GENSCAN_predicted_peptide_7|198_aa MKEFPTDENIKRKWVLAMKRLDVNAAGIWEPKKGDVLCSRHFKKTDFDRSAPNIKLKPGV IPSIFDSPYHLQGKREKLHCRKNFTLKTVPATNYNHHLVGASSCIEEFQSQFIFEHSYSV MDSPKKLKHKLDHVIGELEDTKESLRNVLDREKRFQKSLRKTIRELKDECLISQETANRL DTFCWDCCQESIEQDYIS >gi568815594f:75415453_75627211|GENSCAN_predicted_CDS_7|597_bp atgaaggaattccccacagatgaaaacatcaaaaggaaatgggtattagcaatgaaaaga cttgatgtgaatgcagccggcatttgggagcctaaaaaaggagatgtgttgtgttcgagg cactttaagaagacagattttgacagaagtgctccaaatattaaactgaaacctggagtc ataccttctatctttgattctccatatcacctacaggggaaaagagaaaaacttcattgt agaaaaaacttcaccctcaaaaccgttccagccactaactacaatcaccatcttgttggt gcttcctcatgtattgaagaattccaatcccagttcatttttgaacatagctacagtgta atggacagtccaaagaaacttaagcataaattagatcatgtgatcggcgagctagaggat acaaaggaaagtctacggaatgttttagaccgagaaaaacgttttcagaaatcattgagg aagacaatcagggaattaaaggatgaatgtctgatcagccaagaaacagcaaatagactg gacactttctgttgggactgttgtcaggagagcatagaacaggactatatttcatga >gi568815594f:75415453_75627211|GENSCAN_predicted_peptide_8|245_aa MVIIADGSSMHVIAPEDLPVEQDVEVEDSDSDDPDPVATSHPASSICGKESIKVQKICSP TCNKKKIPFSEEKFKPAAETCVSNEELNVNPQDNGENVSRACQRSSRQPLPSQVWRPRRK VWFCGPGPGSPCCVQPRDLVPCVPATPAMAERSQHTPWAVASEGASLKPWQLPYDVEPLS AQKSRTEVWASPPKFQMYGNAWMPRQKFVVGPGSSWRTSARAVQKGNVELEPPHRVPTGH CLVEL >gi568815594f:75415453_75627211|GENSCAN_predicted_CDS_8|738_bp atggttatcatagcagatggcagctctatgcatgttattgcccctgaagatcttccagtg gaacaagatgtggaggtggaagacagtgacagtgatgatcctgaccccgtagcaacttcc catcctgcaagctccatttgtggaaaggagagcataaaagttcagaaaatttgcagccca acttgcaataaaaagaaaatcccattttctgaagagaaattcaagccggctgcagaaact tgcgtaagtaatgaggagctgaatgttaatccccaagacaatggggaaaatgtctccagg gcatgtcagcgatcttcaaggcagccccttccatcacaggtctggaggcctaggaggaaa gtatggttttgtgggccaggcccagggtccccgtgttgtgtgcagcccagggacttggtg ccctgtgtcccagccactccagccatggctgaaaggagccaacatacaccttgggccgtg gcttcagagggtgcaagtctcaagccttggcagcttccatatgatgttgagcctttgagt gcacagaagtcaagaactgaggtttgggcatctccgcctaaatttcagatgtatggaaat gcctggatgcccaggcagaagtttgttgtagggccagggtcctcatggagaacctctgct agggcagtgcagaagggaaatgtggagttggaacccccacacagagtccctactgggcac tgcctagtggagctgtga >gi568815594f:75415453_75627211|GENSCAN_predicted_peptide_9|678_aa MEKYENLGLVGEGSYGMVMKCRNKDTGRIVAIKKFLESDDDKMVKKIAMREIKLLKQLRH ENLVNLLEVCKKKKRWYLVFEFVDHTILDDLELFPNGLDYQVVQKYLFQIINGIGFCHSH NIIHRDIKPENILVSQSGVVKLCDFGFARTLAAPGEVYTDYVATRWYRAPELLVGDVKYG KAVDVWAIGCLVTEMFMGEPLFPGDSDIDQLYHIMMCLGNLIPRHQELFNKNPVFAGVRL PEIKEREPLERRYPKLSEVVIDLAKKCLHIDPDKRPFCAELLHHDFFQMDGFAERFSQEL QLKVQKDARNVSLSKKSQNRKKEKEKDDSLVEERKTLVVQDTNADPKIKDYKLFKIKGSK IDGEKAEKGNRASNASCLHDSRTSHNKIVPSTSLKDCSNVSVDHTRNPSVAIPPLTHNLS AVAPSINSGMGTETIPIQGYRVDEKTKKCSIPFVKPNRHSPSGIYNINVTTLVTRNSRLT KKESKILSESRIPSLAAIDLHTPSITLHQMCGLLAKCLQFHIVGAFIVSLGVAAVCKIAV AEPRKKTYADFYRNYDSVKDLEEMGKAVPHTYILFHRRTLEMSVVSPTHLLAMADWTTDR HSTHSLSAYLLKKRNKRERALPGGFLTPVVCSGELGIILRPPTRVYAASLLCFCPGVSDT GQGAGAKDKIPALMELIV >gi568815594f:75415453_75627211|GENSCAN_predicted_CDS_9|2037_bp atggaaaaatatgaaaacctgggtttggttggagaagggagttatggaatggtgatgaag tgtaggaataaagatactggaagaattgtggccataaagaagttcttagaaagtgacgat gacaaaatggttaaaaagattgcaatgcgagaaatcaagttactaaagcaacttaggcat gaaaacttggtgaatctcttggaagtgtgtaagaaaaaaaaacgatggtacctagtcttt gaatttgttgaccacacaattcttgatgacttggagctctttccaaatggactagactac caagtagttcaaaagtatttgtttcagattattaatggaattggattttgtcacagtcac aatatcatacacagagatataaagccagagaatatattagtctcccagtctggcgttgtc aagctatgcgattttggatttgcgcgaacattggcagctcctggggaggtttatactgat tatgtggcaacccgatggtacagagctccagaactattggttggtgatgtcaagtatggc aaggctgttgatgtgtgggccattggttgtctggtaactgaaatgttcatgggggaaccc ctatttcctggagattctgatattgatcagctatatcatattatgatgtgtttaggtaat ctaattccaaggcatcaggagctttttaataaaaatcctgtgtttgctggagtaaggttg cctgaaatcaaggaaagagaacctcttgaaagacgctatcctaagctctctgaagtggtg atagatttagcaaagaaatgcttacatattgaccccgacaaaagacccttctgtgctgag ctcctacaccatgatttctttcaaatggatggatttgctgagaggttttcccaagaacta cagttaaaagtacagaaagatgccagaaatgtttctttatctaaaaaatcccaaaacaga aagaaggaaaaagaaaaagatgattccttagttgaagaaagaaaaacacttgtggtacag gataccaatgctgatcccaaaattaaggattataaactatttaaaataaaaggctcaaaa attgatggagaaaaagctgaaaaaggcaatagagcttcaaatgccagctgtctccatgac agtaggacaagccacaacaaaatagtgccttcaacaagcctcaaagactgcagcaatgtc agcgtggaccacacaaggaatccaagcgtggcaattcccccacttacacacaatctttct gcagttgctcccagcattaattctggaatggggactgagactataccaattcagggttac agagtggatgagaaaactaagaagtgttctattccatttgttaaaccgaacagacattcc ccatcaggcatttataacattaatgtgaccacattagtaactcgaaattccaggctaaca aagaaagagagcaaaattctttcagaatctcgaattccttctctggctgctattgacctg cacacccccagtattacattacatcagatgtgtggtcttctggccaaatgtctgcaattt catattgttggagcctttattgtatccctgggggttgcagctgtctgtaagattgctgtg gctgaaccaagaaagaagacatatgcagatttctacagaaattatgattccgtgaaagat ttggaggagatggggaaggctgtccctcatacatatattctgtttcatcgacgaaccctg gaaatgtcagttgtcagtcctacgcaccttctggctatggctgattggaccactgataga cactcaactcattcactgtctgcttacctacttaaaaagcgaaacaagagggagagggca ttacccggaggcttcctgaccccggtggtttgcagtggagagttggggatcattcttagg cccccaaccagggtttatgctgcttctctgctctgcttttgtccaggagtgtcagacact ggtcaaggtgctggggctaaagataagattcctgctctcatggaacttatagtttag