GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:55:23 Sequence gi568815592r:152871166_153082959 : 211794 bp : 37.57% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11381 11545 165 1 0 69 81 153 0.992 12.38 1.02 Intr + 12312 12495 184 2 1 82 71 95 0.451 5.64 1.03 Term + 13089 13537 449 1 2 15 43 217 0.352 4.29 1.04 PlyA + 13765 13770 6 1.05 2.00 Prom + 13931 13970 40 -6.15 2.01 Sngl + 14065 14679 615 0 0 22 41 269 0.942 11.54 2.02 PlyA + 14737 14742 6 1.05 3.05 PlyA - 15052 15047 6 1.05 3.04 Term - 27230 27069 162 2 0 62 44 173 0.047 7.25 3.03 Intr - 27742 27599 144 1 0 61 91 136 0.123 10.76 3.02 Intr - 40848 40601 248 1 2 82 41 172 0.126 8.16 3.01 Init - 50949 50856 94 0 1 86 85 32 0.513 3.29 3.00 Prom - 52910 52871 40 -3.65 4.00 Prom + 57235 57274 40 -4.55 4.01 Init + 57386 57400 15 1 0 77 87 16 0.673 0.74 4.02 Term + 61286 61426 141 1 0 69 34 169 0.929 6.55 4.03 PlyA + 63013 63018 6 1.05 5.02 PlyA - 63831 63826 6 1.05 5.01 Sngl - 88613 88404 210 2 0 80 46 228 0.681 12.75 5.00 Prom - 95683 95644 40 -3.65 6.07 PlyA - 96440 96435 6 1.05 6.06 Term - 98605 98502 104 2 2 42 42 102 0.315 -1.54 6.05 Intr - 100249 100036 214 1 1 122 47 120 0.661 8.67 6.04 Intr - 101289 101107 183 0 0 53 76 140 0.767 8.36 6.03 Intr - 101971 101881 91 0 1 58 103 25 0.808 0.08 6.02 Intr - 104456 103742 715 0 1 88 108 387 0.966 30.26 6.01 Init - 105278 105161 118 2 1 60 74 103 0.960 6.51 6.00 Prom - 108886 108847 40 -7.55 7.04 PlyA - 110724 110719 6 1.05 7.03 Term - 116067 115969 99 0 0 100 44 107 0.728 4.65 7.02 Intr - 116881 116832 50 2 2 66 62 51 0.429 -2.22 7.01 Init - 117940 117802 139 1 1 62 107 90 0.489 8.66 7.00 Prom - 118498 118459 40 -10.25 8.15 PlyA - 118507 118502 6 1.05 8.14 Term - 118930 118730 201 1 0 77 38 163 0.924 6.61 8.13 Intr - 120156 120020 137 1 2 67 98 65 0.997 4.77 8.12 Intr - 121809 121692 118 0 1 66 89 142 0.999 11.22 8.11 Intr - 123511 123348 164 2 2 92 94 83 0.999 8.07 8.10 Intr - 124129 123971 159 2 0 62 68 98 0.913 4.24 8.09 Intr - 126711 126516 196 0 1 -24 66 192 0.273 3.87 8.08 Intr - 127464 127385 80 1 2 123 80 54 0.302 6.45 8.07 Intr - 131456 131262 195 0 0 21 103 189 0.499 12.16 8.06 Intr - 132142 131548 595 1 1 -1 53 327 0.199 11.51 8.05 Intr - 153331 153097 235 0 1 93 83 147 0.773 11.47 8.04 Intr - 155378 155289 90 1 0 68 101 69 0.646 4.39 8.03 Intr - 159850 159757 94 2 1 92 81 30 0.652 0.80 8.02 Intr - 163352 163176 177 0 0 58 74 56 0.277 0.17 8.01 Init - 166872 166773 100 0 1 72 77 119 0.702 9.67 8.00 Prom - 171277 171238 40 -6.05 9.00 Prom + 179291 179330 40 -4.85 9.01 Init + 185197 185241 45 0 0 91 72 14 0.313 0.83 9.02 Term + 191992 192312 321 0 0 68 43 204 0.933 7.74 9.03 PlyA + 192391 192396 6 1.05 10.00 Prom + 202931 202970 40 -3.45 10.01 Init + 205315 205428 114 0 0 79 105 77 0.892 8.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 40848 40547 302 1 2 82 31 177 0.813 5.80 S.002 Init + 144015 144149 135 2 0 50 24 157 0.867 5.69 S.003 Term + 144278 144394 117 1 0 28 45 162 0.855 3.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:152871166_153082959|GENSCAN_predicted_peptide_1|265_aa MNRHFPKEDIYVDNKCDKELNISDYQRNENQNHNEIPSNTSQNGNYKKVKKQQMLPLLMI PRQTGSTVDLQQIPTDLQLRYLTLRRKTNKQKGIASTSTKRTSTPKAHRSPASKTKARQA NIQIQEIQRTTQRYSSRRTTPRHIIVRFTKVEMKEKMLRAAREKGRVTHKGKPIRLTADL SAETLQARREWGPIFNILKEKNFQPIISYPAKLSFKSEGEIKSFTDKQMLRDFVTTRPAL KELLKEALNMERNNRYQPLQKHAKL >gi568815592r:152871166_153082959|GENSCAN_predicted_CDS_1|798_bp atgaacagacacttcccaaaagaagacatttacgtggacaacaaatgtgacaaagagctc aacatctctgattatcagagaaatgaaaatcaaaaccacaatgagataccatctaacacc agtcagaatggcaattataaaaaagtcaagaaacaacagatgctgcctctgctgatgata cccaggcaaacagggtctacagtggacctccagcaaattccaacggacctgcagctgagg tacctgactcttagaaggaaaactaacaaacagaaaggaatagcatcaacatcaacaaaa aggacatccacaccaaaagcccacaggtcaccagcatcaaagaccaaagcaaggcaggcc aacattcaaattcaggaaatacagagaacaacacaaagatactcctcaagaagaacaacc ccaagacacataattgtcagattcaccaaggttgaaatgaaggaaaaaatgctaagggca gccagagagaaaggtcgggttacccacaaagggaagcccatcagactaacagcagatctc tcagcagaaactctacaagccagaagagagtgggggccaatattcaacattcttaaagaa aagaattttcaacccataatttcatatccagccaaactaagcttcaaaagtgaaggagaa ataaaatcctttacagacaagcaaatgctgagagattttgtcaccaccaggcctgcctta aaagagctcctgaaggaagcactaaacatggaaaggaacaaccggtaccagccattgcaa aaacatgccaaattgtaa >gi568815592r:152871166_153082959|GENSCAN_predicted_peptide_2|204_aa MRQKVNKDIQDLNSSLHQPDLIDIYRTLHPKSTEYTFFSAPHCTYSKIDHIIGSKALLSK CKRTEITTNCLSDHSAIKLELRIKKLTQNHTTTWELNNLLLSDYWVNNKMKAEIKMFFET NENKDTTYQNLSGTFKAVCRGKFIALNAYKRKQEISKIDTLTSQLKELEKQQQTHSKASR RQEITKIRAELKEIETQKNPSKNQ >gi568815592r:152871166_153082959|GENSCAN_predicted_CDS_2|615_bp atgagacagaaggttaacaaggatatccaggacttgaactcaagtctgcaccaaccggac ctaatagacatctacagaactctccaccccaaatcaacagaatatacattcttctcagca ccacattgcacttattccaaaattgaccacataattggaagtaaagcactcctcagcaaa tgtaaaagaacagaaatcacaacaaactgtctctcagaccacagtgcaatcaaattagaa ctcaggattaagaaactcactcaaaaccacacgactacatgggaactgaacaacctgctc ctgagtgactactgggtaaataacaaaatgaaggcagaaataaaaatgttctttgaaacc aatgagaacaaagacacaacgtaccagaatctctcgggcacatttaaagcagtgtgtaga gggaaatttatagcactaaatgcctacaagagaaagcaggaaatatctaaaatcgacacc ctaacatcacaattaaaagaactagagaagcaacagcaaacacattcaaaagctagcaga aggcaagaaataactaagatcagagcagaactgaaggagatagagacacaaaaaaaccct tcaaaaaatcaatga >gi568815592r:152871166_153082959|GENSCAN_predicted_peptide_3|215_aa MVEGERHVLHYGRQEKRAYTGKLPPYKTIRSEIPEALREYCKHLYACKLENLEEMDTFLE TYNCPRLNQKETETLNRPIMISKIELVIKSLSTRKSPGPDRFTAEFYWMYEEELVVAMAG VSGNHHVLGINHLLGELRHYEGLVVLAAVDGQQGEDRHKEVQGVGQLEILQAVTALSLPP HNVQYRIHQLRTFCVVPIGPVVSGTTLMEDEVVQP >gi568815592r:152871166_153082959|GENSCAN_predicted_CDS_3|648_bp atggtggaaggtgaaaggcacgtcttacattatggcaggcaagagaagagagcttataca gggaaactccccccttataaaaccatcaggtctgaaataccagaagccctcagagagtat tgtaaacacctctatgcatgcaagctagaaaacctggaagagatggatacattcttggaa acatacaactgcccaagattgaaccagaaagaaactgaaaccctaaacagaccaataatg atctccaaaattgaattagtaataaaaagcctatcaacaagaaaaagccctggaccagac agattcacagctgaattctactggatgtatgaggaagagctggtagtggccatggcaggg gtctcagggaaccatcatgttcttggcatcaaccatctgctgggtgagctcaggcactat gagggcctggtagtgctggctgctgtggatggtcagcagggtgaagacaggcataaagaa gtgcagggagtgggtcagctggaaatcctgcaggcagtcacagctctcagcctccctcct cacaatgtccagtaccgaatccaccagctccgcaccttctgcgtagtgcccattggccca gttgtttccggcaccactctgatggaagatgaagttgtccagccgtaa >gi568815592r:152871166_153082959|GENSCAN_predicted_peptide_4|51_aa MASWKADGEFEEPIAEEGTGDAVCKLTAGCYQEDNKNNTCHTWRRNSSGQQ >gi568815592r:152871166_153082959|GENSCAN_predicted_CDS_4|156_bp atggcgtcatggaaagctgatggagaatttgaggagccgatagctgaagaaggaacaggt gatgctgtctgtaaattgactgctggatgttaccaagaagacaataaaaacaatacgtgc catacctggaggcgaaactccagtggacagcagtag >gi568815592r:152871166_153082959|GENSCAN_predicted_peptide_5|69_aa MSLGFSDCTTTASPTTMSSGSENHTNTTNPRRKYGPHIGVAEQLWGIPGGKINWRRTELK KKLISAGRC >gi568815592r:152871166_153082959|GENSCAN_predicted_CDS_5|210_bp atgtctttgggattcagcgactgcaccacaacagcaagccccacaaccatgtcctcggga agtgaaaaccacaccaatacaacaaaccccaggcggaagtacggaccgcatattggcgtc gccgagcaactgtgggggatcccggggggaaaaatcaactggcgccgaaccgagctgaag aaaaagctgatcagcgcaggccggtgttga >gi568815592r:152871166_153082959|GENSCAN_predicted_peptide_6|474_aa MSRYSSLLSSKVDKRFEYIRRSPGAPIRIKLGDKSCGNSGCKEESSTLSVKMKCDFNCNH VHSGLKLVKPDDIGRLVSYTPAYLEGSCKDCIKDYERLSCIGSPIVSPRIVQLETESKRL HNKENQHVQQTLNSTNEIEALETSRLYEDSGYSSFSLQSGLSEHEEGSLLEENFGDSLQS CLLQIQSPDQYPNKNLLPVLHFEKVVCSTLKKNAKRNPKVDREMLKEIIARGNFRLQNII GRKMGLECVDILSELFRRGLRHVLATILAQLSDMDLINVSKVSTTWKKILEDDKGAFQLY SKAIQRVTENNNKFSPHASTREYVMFRTPLASVQKSAAQTSLKKDAQTKLSNQGDQKGST YSRHNEFSEVAKTLKKNESLKACIRCNSPAKYDCYLQRATCKREGCGFDYCTKCLCNYHT TKDCSDGKLLKASCKIGPLPGSMLLDIPVHQLMDVWLFTPFPAVTSKVLSVQSN >gi568815592r:152871166_153082959|GENSCAN_predicted_CDS_6|1425_bp atgtcccggtactcgagtttactatccagtaaggtagacaaacgctttgaatatattaga aggtcaccaggagcacccatcagaataaaacttggagataagagctgtgggaattcaggt tgtaaagaagaaagttctaccctttctgtcaaaatgaagtgtgattttaattgtaaccat gttcattccggacttaaactggtaaaacctgatgacattggaagactagtttcctacacc cctgcatatttggaaggttcctgtaaagactgcattaaagactatgaaaggctgtcatgt attgggtcaccgattgtgagccctaggattgtacaacttgaaactgaaagcaagcgcttg cataacaaggaaaatcaacatgtgcaacagacacttaatagtacaaatgaaatagaagca ctagagaccagtagactttatgaagacagtggctattcctcattttctctacaaagtggc ctcagtgaacatgaagaaggtagcctcctggaggagaatttcggtgacagtctacaatcc tgcctgctacaaatacaaagcccagaccaatatcccaacaaaaacttgctgccagttctt cattttgaaaaagtggtttgttcaacattaaaaaagaatgcaaaacgaaatcctaaagta gatcgggagatgctgaaggaaattatagccagaggaaattttagactgcagaatataatt ggcagaaaaatgggcctagaatgtgtagatattctcagcgaactctttcgaaggggactc agacatgtcttagcaactattttagcacaactcagtgacatggacttaatcaatgtgtct aaagtgagcacaacttggaagaagatcctagaagatgataagggggcattccagttgtac agtaaagcaatacaaagagttaccgaaaacaacaataaattttcacctcatgcttcaacc agagaatatgttatgttcagaaccccactggcttctgttcagaaatcagcagcccagact tctctcaaaaaagatgctcaaaccaagttatccaatcaaggtgatcagaaaggttctact tatagtcgacacaatgaattctctgaggttgccaagacattgaaaaagaacgaaagcctc aaagcctgtattcgctgtaattcacctgcaaaatatgattgctatttacaacgggcaacc tgcaaacgagaaggctgtggatttgattattgtacgaagtgtctctgtaattatcatact actaaagactgttcagatggcaagctcctcaaagccagttgtaaaataggtcccctgcct ggcagtatgctgttggatataccagttcaccagttgatggatgtttggctctttactcca tttcctgctgttacaagtaaggtgctgtcagtgcaatcaaattag >gi568815592r:152871166_153082959|GENSCAN_predicted_peptide_7|95_aa MVKSGLMERVATAYSVIGKFPVLPWARAGVPLVDQVLQAKNLTLPTAAQNEDTIPEHLNI LHKASFVPISSEDIPYSHGSHPEGLLPPQETSDNV >gi568815592r:152871166_153082959|GENSCAN_predicted_CDS_7|288_bp atggtgaagtcaggcctaatggagagagtggctacagcctactcagtgattgggaaattc cctgtcttgccctgggccagagctggtgtaccgctggtggatcaggtcttacaagcaaag aacctcacactcccaactgcagcacaaaatgaagataccatacctgaacacctgaacatt cttcacaaggcctcctttgtgcccatttcttcagaagacatcccctacagccatggttct catccagagggtcttttgcccccgcaggagacatctgacaacgtttag >gi568815592r:152871166_153082959|GENSCAN_predicted_peptide_8|846_aa MYRSFGDDLGNRLVLFIPMAMAPSALEKAEHMKGIISVPLQVSGPEEFTRLCRIWLLQAW QGSVCMLSVKVADAVSSVCITSRFPALTLIGTGLHLANNCPFSTQFNLYVFCEAFLNSLM QKHSLTVRNEERGENAGRPTHTTKMESIQVLEECQNPTAEEVLSWSQNFDKMMKAPAGRN LFREFLRTEYSEENLLFWLACEDLKKEQNKKVIEEKARMIYEDYISILSPKEWLSHNTLI KSLLPRTSPTVDKRNISRKKGGGINTRLSIIKLALEDSAAALHRRLTTCAAVTSALQRTL DCPALRLWRPRLSAWERNPHYQKTEGQSAALPFRPNTSPINRIPTKRLSEEAPKVSQARP SPITLRSAPSTFGLREVRSVLTLGRKFPPRLPENPLKSNACAGRGLVRSHPYLQILGVVA RAAVSGTSGSARRPLSSGSPPLEELFTRGGPLRTFLERQAGSEAHLKVRRPELLAVIKLL NEKERELRETEHLLHDENEDLRKLAENEITLCQKEITQLKHQFLRSGNKKKEEPNKDCNM SAYGFPIEALKIACLMRGMSRSTGVGEDSGEAVPGILLKLWLSQNPHKETDENDLILEVT AGVGGQEAMLFTSEIFDMYQQYAAFKRWHFETLEYFPSELGGLRHASASIGGSEAYRHMK FEGGVHRVQRVPKTEKQGRVHTSTMTVAILPQPTEINLVINPKDLRIDTKRASGAGGQHV NTTDSAVRIVHLPTGVVSECQQERSQLKNKELAMTKLRAKLYSMHLEEEINKRQNARKIQ IGSKGRSEKIRTYNFPQNRVTDHRINKTLHDLETFMQGDYLLDELVQSLKEYADYESLVE IISQKV >gi568815592r:152871166_153082959|GENSCAN_predicted_CDS_8|2541_bp atgtaccggagctttggggatgaccttggtaatagactggtgctgttcatacccatggcc atggctccctctgccctagagaaggctgaacacatgaaaggtattatttctgtcccttta caagtctcaggcccagaggaatttaccagactgtgtaggatctggctattgcaggcttgg caaggttctgtttgcatgttgtcagttaaggtagcggatgctgtttcttcagtctgtatc acttcgcgcttcccagccctcacattaatcggcacaggtctccatctggcaaacaactgc ccattttcaacccaatttaatctttacgtcttctgtgaagcctttctcaattctcttatg caaaagcacagcctcactgtgaggaatgaagaaagaggggaaaatgcgggaagacccaca cacactacaaaaatggagagtatccaggtcctagaggaatgccaaaaccccactgcagag gaagtcttgtcctggtctcaaaattttgacaagatgatgaaggccccagcaggaagaaac cttttcagagagttcctccgaacagaatacagtgaagagaacctacttttctggcttgct tgtgaagacttaaagaaggagcagaacaaaaaagtaattgaagaaaaggctaggatgata tatgaagattacatttctatactatcaccaaaagagtggttatctcataatacacttatc aagtccttattaccacgaacctcacccaccgtggacaagagaaacataagcaggaaaaaa ggaggaggaataaacacacgcctgtccataataaaactcgctcttgaagactcagcggca gccctgcaccggagactgacgacttgcgcggctgtgacctccgccctgcagcggaccctc gactgccctgcactgcggctctggaggccccgactcagtgcatgggaaagaaatcctcac tatcagaaaacagaggggcaatctgctgctctccctttccggccaaacacgtcacccatc aaccggatacctaccaagaggctttcagaggaggcgcccaaggtctcccaggcccgcccc tccccaatcacgctccgctcagccccctcaacttttggcctccgggaagttcgcagcgtt ctcacgcttggcaggaagttcccgccaaggcttccggaaaatcctttaaaaagcaacgct tgcgctgggcggggcttggtgcgctctcacccttatctccaaattctgggtgttgtcgcg agggctgctgtgtccggaacttccggttccgcccgccggcccctgagctccggtagcccg ccgctggaggagctgttcacccggggcgggcccttgcggaccttcctcgagcgccaggcg gggtctgaagcccatttgaaggtcaggaggcccgagttgctggcggtgatcaaactgctg aacgagaaggagcgggagctgcgggagactgagcacttgctgcacgatgagaatgaagat ttaaggaaacttgcagagaatgaaatcactttgtgtcaaaaagaaataactcagctgaag catcagtttttaagatcaggaaacaaaaaaaaggaggagccaaataaggactgtaacatg agtgcctatggatttcccattgaagctctcaaaattgcctgtttgatgagaggaatgagc aggagcactggtgtaggagaggactctggtgaagctgtccctggtattttgctaaaactc tggctttctcaaaaccctcataaagaaacagatgaaaatgatttgatcctggaagtaact gcaggagttggaggtcaggaggcaatgttgtttacatcagagatatttgatatgtatcag caatatgctgcatttaaaagatggcattttgaaaccctggaatattttccaagtgaacta ggtggccttagacatgcatctgccagcattgggggttcagaagcctataggcacatgaaa tttgaaggaggtgttcacagagtacaaagagtgccaaagacagaaaagcaaggccgcgtc catactagcaccatgactgtagcaatattaccccagcctactgagattaatctggtgatt aatccgaaagatttgagaattgacactaagcgagccagtggagctggggggcagcatgta aataccacggacagtgctgtccggatagttcatcttccaacaggtgttgtttctgaatgt caacaagagagatctcagctgaaaaataaagagctggctatgacaaagttacgtgcaaaa ctgtacagcatgcatctagaagaagaaataaataaaagacagaatgctagaaaaattcag attggaagtaaaggaagatcagagaaaataagaacatataattttccacagaaccgggtc acagatcacagaataaacaagacgctgcatgatcttgaaacttttatgcaaggagattat ctactggatgaacttgtacagtcattgaaggaatacgccgattatgaatctttagtagaa attatttcccaaaaagtttaa >gi568815592r:152871166_153082959|GENSCAN_predicted_peptide_9|121_aa MHRIRGYAFTCKIPKTQTNIYNYQDTVDSCRKKRTSPNELNKPPRINLEETEIYDLSDRE FKISVLRKPKEIQENTEKEFRILSDKFNKDNEIIKKNQADILELKNAIGIMKNASESQKQ N >gi568815592r:152871166_153082959|GENSCAN_predicted_CDS_9|366_bp atgcacaggataagaggatatgcattcacttgtaagatcccaaagacacagacaaacatc tacaactatcaagacactgttgatagttgtaggaaaaaaaggacctcaccaaatgaacta aataagccaccaaggattaatcttgaagaaacagagatatatgacctttcggacagagaa ttcaaaatatcagttttgaggaaacccaaagaaattcaagagaacacagagaaggaattc agaattctatcagataaatttaacaaagataatgaaataattaaaaagaatcaagcagac attctggagctgaaaaatgcaattggcataatgaagaatgcatctgagtctcaaaagcag aattga >gi568815592r:152871166_153082959|GENSCAN_predicted_peptide_10|38_aa MNIYRVSGSRVLNTIPHKKEPELDGEIVIQGLTQKNPK >gi568815592r:152871166_153082959|GENSCAN_predicted_CDS_10|114_bp atgaacatatatagagtatctggatcccgggttctaaataccattccccacaaaaaggaa ccagagctggatggagaaatagtgattcaaggtctgacacagaaaaatcctaag