GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:35:13 Sequence gi568815595r:196371822_196603173 : 231352 bp : 43.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9703 10012 310 0 1 99 -54 251 0.006 9.48 1.02 Term + 12316 13151 836 2 2 53 36 341 0.008 18.55 1.03 PlyA + 14550 14555 6 1.05 2.06 PlyA - 16183 16178 6 1.05 2.05 Term - 17407 17339 69 1 0 58 29 50 0.053 -5.96 2.04 Intr - 20104 19992 113 2 2 72 98 92 0.207 8.70 2.03 Intr - 31198 31131 68 0 2 84 103 48 0.185 4.45 2.02 Intr - 60068 59963 106 0 1 120 80 32 0.085 4.87 2.01 Init - 60719 60506 214 2 1 62 111 288 0.946 25.41 2.00 Prom - 74281 74242 40 -5.16 3.14 PlyA - 76544 76539 6 1.05 3.13 Term - 87253 87164 90 1 0 83 39 84 0.487 0.72 3.12 Intr - 92091 91951 141 0 0 103 52 18 0.252 0.25 3.11 Intr - 100951 100079 873 1 0 132 63 298 0.442 23.23 3.10 Intr - 103491 103410 82 2 1 67 95 35 0.511 1.74 3.09 Intr - 112070 111949 122 2 2 60 100 22 0.479 -0.11 3.08 Intr - 115757 115578 180 2 0 14 87 164 0.958 8.96 3.07 Intr - 116862 116786 77 1 2 44 100 83 0.416 4.33 3.06 Intr - 131336 131052 285 0 0 29 92 287 0.100 20.51 3.05 Intr - 135024 134988 37 0 1 81 46 58 0.091 -1.26 3.04 Intr - 136510 136150 361 0 1 52 91 192 0.152 11.12 3.03 Intr - 137848 137699 150 0 0 120 81 119 0.992 13.68 3.02 Intr - 144888 144811 78 2 0 83 38 88 0.729 1.97 3.01 Init - 146301 146177 125 0 2 88 66 51 0.511 2.55 3.00 Prom - 150230 150191 40 -4.56 4.06 PlyA - 152286 152281 6 1.05 4.05 Term - 157226 156915 312 2 0 65 49 165 0.044 5.30 4.04 Intr - 172618 172583 36 1 0 91 76 29 0.105 0.46 4.03 Intr - 174329 174130 200 0 2 75 82 76 0.644 4.77 4.02 Intr - 182986 182473 514 1 1 52 22 159 0.055 -1.76 4.01 Init - 189654 189175 480 0 0 64 64 219 0.527 12.49 4.00 Prom - 191538 191499 40 -3.26 5.00 Prom + 195762 195801 40 -4.06 5.01 Init + 197164 197481 318 0 0 93 100 558 0.778 52.84 5.02 Intr + 205632 205988 357 2 0 87 94 257 0.996 21.65 5.03 Term + 212312 212497 186 1 0 94 48 148 0.966 8.89 5.04 PlyA + 217219 217224 6 1.05 6.00 Prom + 217331 217370 40 -2.76 6.01 Init + 224545 224639 95 0 2 47 86 67 0.226 2.25 6.02 Term + 226369 226834 466 1 1 57 43 146 0.100 1.49 6.03 PlyA + 230158 230163 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 9703 10041 339 0 0 99 34 266 0.946 18.33 S.002 Sngl + 164711 164881 171 1 0 83 36 164 0.929 5.53 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:196371822_196603173|GENSCAN_predicted_peptide_1|381_aa MGRNQSRKDENSKNQSASSPPKDRNSLPATEQRWMENDFDKLTEVVFRRSVITNFSKLKE DVRTHRKEAKHLEKRLDELLTRINSVEKNLNDLMELKTTARELQIQTTTREYYKHLYANK LENLEEMGKFLDTYTLPRLNQEEVESLNRPITGSEIEAIINSLTTKKSPGPDGFTAKFYQ RYKEELVPFLLKLFQSTEKKGILPNSFYEANIILIPKPGRDTTKKENFRPISLMNIDAKI LNKILANRIQQHMKKLIHHDQVGFIPGMQGWFNICKSINIIHHINRTNDKNHMIISIDAE KTFNKIQQPFTLKILNKLGIDGTYLKIIRDIYGKPTANTILNGQKLEAFPLKTGTRQGCP LSPLLFNILLEVLARAIRQEK >gi568815595r:196371822_196603173|GENSCAN_predicted_CDS_1|1146_bp atggggagaaaccagagcagaaaagatgaaaattctaaaaatcagagcgcctcttctcct ccaaaggatcgcaactccttgccagcaacggaacaacgctggatggagaatgactttgac aagttgacagaagtagtcttcagaaggtcggtaataacaaacttctccaagctaaaggag gatgttcgaacccatcgcaaggaagctaaacaccttgaaaaaagattagatgaattgcta actagaataaacagtgtagagaagaacttaaatgacctgatggagctgaaaaccacagca cgagaacttcaaatacaaactaccaccagagaatactataaacacctctatgcaaataaa ctagaaaatctagaagaaatgggcaaattcctggacacatacaccctcccaagactaaac caggaagaagttgaatctctgaacagaccaataacaggctctgaaattgaggcaataatt aacagcctaacaaccaaaaaaagtccaggaccagacggattcacagccaaattctaccag aggtacaaagaggagctggtaccattccttctgaaactattccaatcaacagaaaaaaag ggaatcctccctaactcattttatgaggccaacatcatcctgataccaaagcctggcaga gacacaacaaaaaaggagaattttagaccaatatccctgatgaacattgatgcgaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatgaaaaagcttatccaccatgat caagtcggcttcatccctgggatgcaaggttggttcaacatatgcaaatcaataaacata atccatcatatcaacagaaccaacgacaaaaaccacatgattatctcaatagatgcagaa aagaccttcaacaaaattcaacagcccttcacgctaaaaattctcaataaactaggtatt gatggaacgtatctcaaaataataagagatatttatggcaaacccacagccaataccata ctgaatgggcaaaaactggaagcattccctttgaaaaccggcacaagacaaggatgccct ctctcaccactcctattcaacatactgttggaagttctggccagggcaatcaggcaagag aaataa >gi568815595r:196371822_196603173|GENSCAN_predicted_peptide_2|189_aa MRLRRPLGTGGAGLEGRSMRLRTPHGEVGEGSPCVSVLLFGGGGGGKMAAHGGSAASSAL KGLIQQFTTITGHSAPVPGLPFPFLGPSEPSPWPWRVRALPPSRQGREEVRAPIPQKQEI LVEPEPLFGVRQEQELRNGGAIDKKLTTLADLFRPPIDLMHKGSFETYSGDTTAIERENS SNIKGRKGS >gi568815595r:196371822_196603173|GENSCAN_predicted_CDS_2|570_bp atgcgcctgcgcagaccgctggggacgggaggggcggggctcgaggggcggtcaatgcgc ctgcgcacaccgcacggcgaagtgggggagggcagtccgtgtgtgtctgtgttgttgttc ggcggcggcggcggcggtaagatggctgcccacgggggctccgcggcgtcctcggcgctg aaggggttaattcaacagttcaccaccattaccggtcattctgccccggtccccggcctg cccttcccctttctgggcccctcggagccatcgccgtggccctggcgggttcgggccctc ccgccttcacggcaaggccgagaagaagttcgtgccccaattcctcaaaagcaggaaata ctggtggaaccagaaccattatttggtgttcggcaagaacaagaattaagaaatggagga gctatcgataagaaattaactacccttgcagatctattccggccacccattgatttgatg cataaaggcagctttgaaacatattctggtgatactacagccattgagagagagaatagc agcaatattaaaggcagaaaaggatcttag >gi568815595r:196371822_196603173|GENSCAN_predicted_peptide_3|866_aa MPVHVTSTWLAAMISAHVATKKKQQELPYWVNLALSHQHLHIMFLEDKDFALLITILPVP GIVLLIHSVDHKLQALETQFKELDFTKDNLMQKFEHHSKALASQAAQDEMWTAVRALQLT SMELNILYSYVIEVLICLHTRVLEKLPDLVRGLPTLASVLRRKVKNKRVRVVWESILEEC GLQEGDITALCTFFIARGNKAEHYTAKVRQMYIRDVTFLITNMVKNQALQDSLLRAVQSW DNNSELIKFGNAIPSLSECQCGICMEILVEPVTLPCNHTLCKPCFQSTVEKASLCCPFCR RRVSSWTRYHTRRNSLVNVELWTIIQKHYPRECKLRASGQESEEVADDYQPVRLLSKPGE LRREYEEEISKVAAERRASEEEENKASEEYIQRLLAEEEEEEKRQAEKRRRAMEEQLKSD EELARKLSIDINNFCEGSISASPLNSRKSDPVTPKSEKKSKNKQRNTGDIQKYLTPKSQF GSASHSEAVQEVRKDSVSKDIDSSDRKSPTGQDTEIEDMPTLSPQISLGVGEQGADSSIE SPMPWLCACGAEWYHEGNVKTRPSNHGKELCVLSHERPKTRVPYSKETAVMPCGRTESGC APTSGVTQTNGNNTGETENEESCLLISKEISKRKNQESSFEAVKDPCFSAKRRKVSPESS PDQEETEINFTQKLIDLEHLLFERHKQEEQDRLLALQLQKEVDKEQMVPNRQKGSPDEYH LRATSSPPDKVLNGQRKNPKDGNFKRQTHTKHPTPERGSRDKNRQVSLKMQLKQSVNRRK MPNSTRDHCKMYPGSQKTLAPEETDSQKYLVTYSSSHTSSAYVVWIKIWKTDTKNLEEIK PSDIITKFPELLEGYNPEIIEHQDKI >gi568815595r:196371822_196603173|GENSCAN_predicted_CDS_3|2601_bp atgcctgtacatgttacttctacctggctggctgccatgatatctgcacatgtggcaaca aagaaaaagcagcaggaactgccatattgggtgaacctggctcttagccaccagcattta catattatgttccttgaggacaaggatttcgccttacttatcactatattaccagtgcct ggaatagtgctgctcatccattcagtagaccacaaactccaagcgttagaaacacagttc aaagaactagacttcaccaaggataacctgatgcagaaattcgaacatcatagtaaggct ttggcaagccaagcagcccaagatgagatgtggacagcagttcgggcactccagctcact tcaatggaattgaatattttatacagctacgtcattgaagtacttatctgcttgcatact cgtgtgcttgagaagctgccagacctggtgagaggtcttccaaccttagcctctgtactc agaagaaaagttaagaacaagcgcgttagagttgtatgggagtccatactggaggagtgt gggctgcaagaaggagacatcacagcactttgtaccttctttattgcacgtggtaacaag gcagaacactatactgctaaagtgaggcagatgtacatcagggatgtcacgttcctaatt actaacatggtaaagaaccaggctctgcaggacagtttgctgagggctgtgcagtcatgg gacaacaactcagaactcatcaaattcgggaacgccatcccctcgctgtccgagtgccag tgcgggatctgcatggaaatcctcgtggagcccgtcaccctcccgtgtaaccacacgctg tgtaaaccgtgcttccagtcgaccgtcgaaaaggcgagtttatgctgtcccttctgtcgc cgccgggtatcgtcgtggactcggtaccatacccgaagaaattctctcgtcaacgtggaa ctgtggacgataattcaaaaacactatcccagggagtgcaagcttagagcgtctggccaa gaatcagaggaagtggctgatgactatcagccagttcgtctgctcagtaaacctggggaa ctgagaagagaatatgaagaggaaataagcaaggtggcggcagagcgacgggccagcgag gaagaagaaaacaaagccagtgaagaatacatacagaggttgttggcagaggaggaagaa gaggaaaaaagacaggcagaaaaaaggcgaagagcgatggaagaacaactgaaaagtgat gaggaactggcaagaaagctaagcattgatattaacaatttctgtgagggaagtatctcg gcttctcccttgaattccagaaaatctgatccagttacacccaagtctgaaaagaaaagt aagaacaaacaaagaaacactggagatattcagaagtatttgacaccgaaatctcagttt gggtcagcctcacactctgaagctgtacaagaagtcaggaaagactccgtatctaaggac attgacagtagtgataggaaaagcccaacagggcaagacacagaaatagaagatatgccg acactttctccacagatatcccttggagttggagaacaaggtgcagattcttcaatagag tcccctatgccatggttatgtgcctgtggtgccgaatggtaccatgaaggaaacgtcaaa acaagaccaagcaatcatgggaaagagttatgtgtcttaagtcacgagcgacctaaaacc agagttccctactcgaaagaaactgcagttatgccttgtggcagaacagaaagtgggtgc gcccccacatcaggggtgacacagacaaatggaaacaacacaggtgagacagaaaatgaa gagtcgtgcctactgatcagtaaggagatttccaaaagaaaaaaccaagaatcttccttt gaagcagtcaaggatccatgcttttctgcaaaaagaagaaaagtgtcccccgaatcttcc ccagatcaagaggaaacagaaataaactttacccaaaaactgatagatttggagcatcta ctgtttgagagacataaacaagaagaacaggacaggttattggcattacaacttcagaag gaggtggataaagagcaaatggtgccaaaccggcaaaaaggatccccagatgagtatcac ttacgcgctacatcctcccctccagacaaagtgctaaatggacagaggaagaatcccaaa gatgggaacttcaaaaggcaaactcacacaaagcatccaacaccagagagaggctcaagg gacaaaaataggcaagtgtctttaaagatgcagttgaagcagtcagttaatagaagaaag atgccaaattctactagagatcactgtaagatgtatcctgggtcacaaaagaccttggct ccagaagaaactgattctcagaaataccttgtgacttactcaagttctcatactagctct gcgtatgtggtgtggataaaaatttggaaaacagacactaaaaacttggaggaaattaag ccatcagacatcatcactaaattcccagagctcctagaaggatacaatccagaaataata gaacaccaggataagatttga >gi568815595r:196371822_196603173|GENSCAN_predicted_peptide_4|513_aa MAVKWTGGHSSPVLCLNASKEGLLASGAEGGDLTAWGEDGTPLGHTRFQGADDVTSVLFS PSCPTKLYASHGETISVLDVRSLKDSLDHFHVNEEEINCLSLNQTENLLASADDSGAIKI LDLENKKVIRSLKRHSNICSSVAFRPQRPQSLVSCGLDMQVMLWSLQKARPLWITNLQED ETEEMEGPQSPGQLLNPALAHSISVASCGNIFSCGAEDGKVRIFRVMGVKCEQELGFKGH TSGVSQVCFLPESYLLLTGGNDGKITLWDANSEVEKKQKSPTKRTHRKKPKRGTCTKQGG NTNASVTDEEEHGNILPKLNIEHGEKVNWLLDLSKQKLNLWGNVVSNIKTQDDMQENWEG YRTEVFLYRRGLGRRQQFHTLTGQVTPMRQGSNQIETQAYTFMANRFLAKLSIPGTSVLS SAAIASRSATGVAGSRLRMEEAAASAPWRLGPAPLPRPPPPPSSPRRRPGRSEPDRAEPP RRRCPEWKCGLRSGRRCWEPDGARRRDTAVLRT >gi568815595r:196371822_196603173|GENSCAN_predicted_CDS_4|1542_bp atggcagtcaagtggacgggtgggcattcttctcctgtcctctgcctgaatgcaagtaaa gaagggctgctggcttctggagcagagggcggagatctcacggcttggggtgaagatgga actccattaggacacacgcggttccaaggggctgatgatgttaccagtgtcttattttct ccctcctgtcccaccaagctctatgcctcacatggagaaaccattagtgtactggatgtc aggtccctcaaagattccttggaccattttcatgtgaatgaagaagaaatcaattgtctt tcattgaatcaaacggaaaacctgctggcttctgctgacgactctggggcaatcaaaatc ctagacttggaaaacaagaaagttatcagatccttgaagagacattccaatatctgctcc tcagtggcttttcggcctcagaggcctcagagcctggtgtcatgtggactggatatgcag gtgatgctgtggagtcttcaaaaagcccgaccactctggattacaaatttacaggaggat gaaacagaagaaatggaaggcccacagtcacctggtcagctcttaaaccctgccctagcc cattctatctctgtggcttcgtgtggtaatatttttagttgtggtgcagaagatggtaag gttcgaatctttcgggtgatgggagttaagtgtgaacaggaactgggatttaagggccac acttcaggggtatcccaggtctgctttctcccagaatcctatttgctgcttactggaggg aatgatgggaagatcacgttgtgggatgcaaacagtgaagttgagaaaaaacagaagagt cccacaaaacgtacccacaggaagaaacctaaaagaggaacttgcaccaagcagggtgga aatactaacgcttcagtaacagatgaggaagaacatggcaacattttaccaaagctaaat attgaacatggagaaaaagtgaactggctcttggatctttcaaagcagaaattaaacctc tggggaaatgttgtttctaatataaaaacacaggatgatatgcaggagaactgggaaggc tacaggactgaagtcttcctgtacaggagaggcttggggaggaggcagcagttccatact ctcactggtcaggtgacacccatgcgtcaggggagcaaccagattgaaactcaggcttac acgttcatggcaaacagatttttagcaaagctctccattccggggaccagtgttctgagc tcggccgccatcgcctcacgctcagcgacgggcgtggccgggagccgcctgcggatggag gaagcggccgccagcgccccctggaggctcgggccggctccactcccgcggcccccgcca ccgcccagctccccgcggaggcgcccaggccggtctgagcccgacagggcggaaccaccg cggcggcgctgccctgagtggaaatgcggcctccgcagcgggagaagatgctgggagccg gatggggcccggaggcgcgacaccgcagttctccgtacttga >gi568815595r:196371822_196603173|GENSCAN_predicted_peptide_5|286_aa MAAPAPGAGAASGGAGCSGGGAGAGAGSGSGAAGAGGRLPSRVLELVFSYLELSELRSCA LVCKHWYRCLHGDENSEVWRSLCARSLAEEALRTDILCNLPSYKAKIRAFQHAFSTNDCS RNVYIKKNGFTLHRNPIAQSTDGARTKIGFSEGRHAWEVWWEGPLGTVAVIGIATKRAPM QCQGYVALLGSDDQSWGWNLVDNNLLHNGEVNGSFPQCNNAPKYQIGERIRVILDMEDKT LAFERGYEFLGVAFRGLPKVCLYPAVSAVYGNTEVTLVYLGKPLDG >gi568815595r:196371822_196603173|GENSCAN_predicted_CDS_5|861_bp atggcggcgccggccccgggggctggggcagcctcgggcggcgctggctgtagcggcggc ggcgcgggcgcgggcgcgggctcgggctctggggccgcgggggccgggggccggctgccc agccgggtgctggagttggtgttctcttacctggagctgtccgagctgcggagctgcgcc ctggtgtgcaagcactggtaccgctgcctgcacggcgatgagaacagcgaggtgtggcgg agcctgtgcgcccgcagcctggcagaagaggctctgcgcacggacatcctgtgcaacctg cccagctacaaggccaagatacgtgcttttcaacatgccttcagcactaatgactgctcc aggaatgtctacattaagaagaatggctttactttacatcgaaaccccattgctcagagc actgatggtgcaaggaccaagattggtttcagtgagggccgccatgcatgggaagtgtgg tgggagggccctctgggcactgtggcagtgattggaattgccacaaaacgggcccccatg cagtgccaaggttatgtggcattgctgggcagtgatgaccagagctggggctggaatctg gtggacaataatctactacataatggagaagtcaatggcagttttccacagtgcaacaac gcaccaaaatatcagataggagaaagaattcgagtcatcttggacatggaagataagact ttagcttttgaacgtggatatgagttcctgggggttgcttttagaggacttccaaaggtc tgcttatacccagcagtttctgctgtatatggcaacacagaagtgactttggtttacctt ggaaaacctttggacggatga >gi568815595r:196371822_196603173|GENSCAN_predicted_peptide_6|186_aa MRNSTFEDSPEEKRQRRLKGVKRKKDTEGRASAGTVSVSGIGGFLVSLTSRLKPRTLAVS VTALKGARLEFVPSDVRMCSEFFLSGGLQTFAVSVTALKAAHLELLVPPAGLVVSLASGV KLQTFVVSVTAHKKQRGPSEQQQDLLQKAKEQSFHTMEGDRNGLPLLARAACFYSLIWPH PHPVDW >gi568815595r:196371822_196603173|GENSCAN_predicted_CDS_6|561_bp atgaggaatagcacatttgaggactccccagaagaaaaacgacaaaggaggctgaaagga gtgaagaggaaaaaagatacagaggggagagcaagcgcgggtactgttagtgtgtctgga attggtgggttcttggtctcactgacttcaagactgaagccgcggaccctcgcggtgagt gttacagctcttaagggcgcgcgtctggagtttgttccttctgatgttcggatgtgttcg gagttttttctttctggtgggctgcagactttcgcggtgagtgttacagctcttaaagcg gcgcatctggagttgcttgttcctcccgctgggctcgtggtctcactggcttcaggagtg aagctgcagacttttgtggtgagtgttacagctcataaaaagcagcgtggacccagtgag cagcagcaagatttattgcaaaaagccaaagaacagagcttccacacaatggaaggggac cggaacggcttgccgctgctggctcgggcagcctgcttttattctcttatctggccccac ccacatcctgttgattggtag